annotate man/man1/gennap.1 @ 0:5242703e91d3 tip

Initial checkin for AIM92 aimR8.2 (last updated May 1997).
author tomwalters
date Fri, 20 May 2011 15:19:45 +0100
parents
children
rev   line source
tomwalters@0 1 .TH GENNAP 1 "8 April 1994"
tomwalters@0 2 .LP
tomwalters@0 3 .SH NAME
tomwalters@0 4 .LP
tomwalters@0 5 gennap \- generate neural activity pattern
tomwalters@0 6 .LP
tomwalters@0 7 .SH SYNOPSIS
tomwalters@0 8 .LP
tomwalters@0 9 gennap [ option=value | -option ] [ filename ]
tomwalters@0 10 .LP
tomwalters@0 11 .SH DESCRIPTION
tomwalters@0 12 .LP
tomwalters@0 13 The gennap module of the AIM software converts an input wave into a
tomwalters@0 14 simulated neural activity pattern (NAP), which is AIM's representation
tomwalters@0 15 of the pattern of information in the auditory nerve at about the level
tomwalters@0 16 of the cochlear nucleus. Gennap begins by calculating the basilar
tomwalters@0 17 membrane motion (BMM) associated with the input wave using the genbmm
tomwalters@0 18 module, and then it applies several additional transforms that we know
tomwalters@0 19 occur in some form during the neural transduction process. AIM
tomwalters@0 20 provides two alternative methods for generating the NAP, a
tomwalters@0 21 two-dimensional adaptive thresholding mechanism (Holdsworth and
tomwalters@0 22 Patterson, 1993), and an array of inner haircell simulators based
tomwalters@0 23 (Meddis et al., 1990; Giguere and Woodland, 1994). The adaptive
tomwalters@0 24 thresholding mechanism applies rectification, log compression,
tomwalters@0 25 adaptation in time, and suppression across frequency; its purpose is
tomwalters@0 26 to stabilise the level of the membrane activity with compression and
tomwalters@0 27 then sharpen the features that appear in the compressed membrane
tomwalters@0 28 motion. Together, the gammatone filterbank and adaptive thresholding
tomwalters@0 29 form a 'functional' cochlea simulation. The Meddis module applies
tomwalters@0 30 level-dependant compression and adaptation that simulate the response
tomwalters@0 31 of inner haircells to membrane motion. The cells are not coupled and
tomwalters@0 32 so there is no frequency sharpening in this module. Together, the
tomwalters@0 33 transmission-line filterbank and the Meddis module form a
tomwalters@0 34 'physiological' cochlea simulation.
tomwalters@0 35 .LP
tomwalters@0 36 .SH OPTIONS
tomwalters@0 37 .LP
tomwalters@0 38 The options for gennap are grouped according to the functions they
tomwalters@0 39 control. The adaptive thresholding options are identified by the
tomwalters@0 40 common suffix _at; the Meddis module options are identified by the
tomwalters@0 41 common suffix _med. These two groups of options are the subject of
tomwalters@0 42 this manual entry, together with two additional options that specify
tomwalters@0 43 whether rectification and compression operations are required before
tomwalters@0 44 the transduction stage. There is also an option to specify the choice
tomwalters@0 45 of the transduction function.
tomwalters@0 46 .LP
tomwalters@0 47 .SH RECTIFICATION AND COMPRESSION
tomwalters@0 48 .LP
tomwalters@0 49 The adaptive thresholding process begins with rectification and log
tomwalters@0 50 compression of the BMM. It is occasionally useful to have these
tomwalters@0 51 functions available separately and so the options 'rectify' and
tomwalters@0 52 'compress' are presented separately in the options list before the
tomwalters@0 53 neural transduction options.
tomwalters@0 54 .RE
tomwalters@0 55 .LP
tomwalters@0 56 .TP 13
tomwalters@0 57 rectify
tomwalters@0 58 Rectification switch
tomwalters@0 59 .RS
tomwalters@0 60 Switch. Default value: off.
tomwalters@0 61 .RE
tomwalters@0 62 .RS
tomwalters@0 63 .LP
tomwalters@0 64 If rectify is on, the BMM is half-wave rectified.
tomwalters@0 65 The compression operation also performs half-wave rectification (to
tomwalters@0 66 avoid taking logs of negative numbers). So the rectify option is
tomwalters@0 67 really here just to provide for rectified BMM in the absence of
tomwalters@0 68 compression. As a result, the default for option rectify is
tomwalters@0 69 off. (Note: Full wave rectification is produced if rectify is set to
tomwalters@0 70 2. This is useful when calculating envelopes with genasa.)
tomwalters@0 71 .RE
tomwalters@0 72 .LP
tomwalters@0 73 .TP 13
tomwalters@0 74 compress
tomwalters@0 75 Compression switch
tomwalters@0 76 .RS
tomwalters@0 77 Switch. Default value: on.
tomwalters@0 78 .RE
tomwalters@0 79 .RS
tomwalters@0 80 .LP
tomwalters@0 81 The compressor is strictly logarithmic and so to this point, the
tomwalters@0 82 functional cochlea simulation is level independent. In the auditory
tomwalters@0 83 system, the compressor is logarithmic over the lower part of its range
tomwalters@0 84 and then it asymptotes to a soft limit. The default for option
tomwalters@0 85 compress is on (note that the compressor also performs half-wave
tomwalters@0 86 rectification).
tomwalters@0 87 .RE
tomwalters@0 88 .LP
tomwalters@0 89 Important: The default value for option compress is 'on' which assumes
tomwalters@0 90 that the transduction module is adaptive thresholding (the default for
tomwalters@0 91 the transduction option described below). If the Meddis transduction
tomwalters@0 92 module is selected (transduction=med), compress should be set to 'off'
tomwalters@0 93 to obtain the operation described in Giguerre and Woodland
tomwalters@0 94 (1994). This can be done on the command line (see EXAMPLES) or in the
tomwalters@0 95 appropriate .gen???rc files.
tomwalters@0 96 .RE
tomwalters@0 97 .LP
tomwalters@0 98 .SH NEURAL TRANSDUCTION
tomwalters@0 99 .LP
tomwalters@0 100 The neural transduction is performed either by two-dimensional
tomwalters@0 101 adaptive thresholding or an array of Meddis haircells. The choice is
tomwalters@0 102 controlled by the option 'transduction'.
tomwalters@0 103 .LP
tomwalters@0 104 .TP 13
tomwalters@0 105 transduction
tomwalters@0 106 The transduction function
tomwalters@0 107 .RS
tomwalters@0 108 Switch. Default value: at. Choices: at, med, off.
tomwalters@0 109 .RE
tomwalters@0 110 .LP
tomwalters@0 111 If adaptive thresholding is specified (at), the options with suffix
tomwalters@0 112 _at below apply; if the Meddis module is specified (med), the options
tomwalters@0 113 with suffix _med below apply. If off is specified, no transduction
tomwalters@0 114 function is applied. The default is at.
tomwalters@0 115 .RE
tomwalters@0 116 .LP
tomwalters@0 117 .SS "Two-dimensional adaptive thresholding: _at "
tomwalters@0 118 .PP
tomwalters@0 119 The adaptive thresholding mechanism is a functional model of neural
tomwalters@0 120 encoding. Its purpose is to enhance the contrast of the larger
tomwalters@0 121 features that appear in the surface of the BMM and reduce those
tomwalters@0 122 aspects of the representation which are just a direct consequence of
tomwalters@0 123 the filtering and compression processes (Holdsworth and Patterson,
tomwalters@0 124 1993). The process begins with rectification and compression of the
tomwalters@0 125 BMM. The tail of the envelope of the impulse response of the
tomwalters@0 126 gammatone filter is exponential. As a result, logarithmic compression
tomwalters@0 127 is used, since this makes the filter decay function linear in NAP
tomwalters@0 128 coordinates. Following compression, adaptation is applied in time and
tomwalters@0 129 suppression is applied across frequency.
tomwalters@0 130 .LP
tomwalters@0 131 Briefly, an adaptive threshold value is maintained for each channel
tomwalters@0 132 and updated at the sampling rate. The new value is the largest of a)
tomwalters@0 133 the previous value reduced by a fast-acting temporal decay factor
tomwalters@0 134 (t1recovery_at), b) the previous value reduced by a longer-term
tomwalters@0 135 temporal decay factor (t2recovery_at), c) the adapted level in the
tomwalters@0 136 channel immediately above, reduced by a frequency spread factor
tomwalters@0 137 (frecovery_at), d) the adapted level in the channel immediately below,
tomwalters@0 138 reduced by the same frequency spread factor, or e) a floor level that
tomwalters@0 139 precludes the mechanism listening to its own internal noise
tomwalters@0 140 (reclimit_at). The mechanism produces output whenever the input
tomwalters@0 141 exceeds the adaptive threshold, and the output level is the difference
tomwalters@0 142 between the input and the adaptive threshold. The adaptation and
tomwalters@0 143 suppression are coupled, and they jointly sharpen features like vowel
tomwalters@0 144 formants which appear smeared in compressed BMM.
tomwalters@0 145 .LP
tomwalters@0 146 .TP 13
tomwalters@0 147 trise_at
tomwalters@0 148 Threshold rise rate
tomwalters@0 149 .RS
tomwalters@0 150 Default value: 1000.
tomwalters@0 151 .RE
tomwalters@0 152 .RS
tomwalters@0 153 .LP
tomwalters@0 154 Upward Adaptation: This option specifies the rate at which the
tomwalters@0 155 adaptive threshold will rise in response to a rise in signal
tomwalters@0 156 level. The default value, 1000, means that the adaptive threshold
tomwalters@0 157 responds very quickly to increases in the input wave; essentially, it
tomwalters@0 158 follows the envelope of any rise in signal amplitude.
tomwalters@0 159 .RE
tomwalters@0 160 .LP
tomwalters@0 161 Downward Adaptation: Following the cessation of sound, or a rapid drop
tomwalters@0 162 in input level, temporal adaptation occurs in two stages as determined
tomwalters@0 163 by t1recovery_at, t2recovery_at and propt2t1_at: If the default values
tomwalters@0 164 are used, the mechanism initially adapts at a rate slightly slower
tomwalters@0 165 than the decay rate of the gammatone filter in the given channel, and
tomwalters@0 166 this represses much of the ringing of the impulse response of the
tomwalters@0 167 filter. Later the adaptation switches to a slower rate more in line
tomwalters@0 168 with data on auditory forward masking. The option propt2t1_at
tomwalters@0 169 determines the point at which the initial fast rate of decay gives way
tomwalters@0 170 to the slower limiting decay rate.
tomwalters@0 171 .RE
tomwalters@0 172 .LP
tomwalters@0 173 .TP 13
tomwalters@0 174 t1recovery_at
tomwalters@0 175 The initial rate of threshold recovery relative to filter decay rate
tomwalters@0 176 .RS
tomwalters@0 177 Default value: 0.6.
tomwalters@0 178 .RE
tomwalters@0 179 .RS
tomwalters@0 180 .LP
tomwalters@0 181 This option determines the initial rate of decay of the adaptive
tomwalters@0 182 threshold relative to the rate of decay of the auditory filter,
tomwalters@0 183 provided propt2t1_at is less than unity. Values of t1recovery_at less
tomwalters@0 184 than unity cause the adaptive threshold to decay more slowly than the
tomwalters@0 185 auditory filter and thereby to remove the filter response from the
tomwalters@0 186 representation when it is the sole reason for BMM activity. The rate
tomwalters@0 187 of decay is linear with respect to the log-compressed BMM, so it is
tomwalters@0 188 like an exponential decay with respect to the BMM.
tomwalters@0 189 .RE
tomwalters@0 190 .LP
tomwalters@0 191 .TP 13
tomwalters@0 192 t2recovery_at
tomwalters@0 193 The secondary threshold recovery rate
tomwalters@0 194 .RS
tomwalters@0 195 Default value: 0.2.
tomwalters@0 196 .RE
tomwalters@0 197 .RS
tomwalters@0 198 .LP
tomwalters@0 199 This option determines the limiting rate of decay of the adaptive
tomwalters@0 200 threshold when the sound ceases provided propt2t1_at is less than
tomwalters@0 201 unity. The default value causes the adaptive threshold to decay more
tomwalters@0 202 slowly than the initial rate as observed in auditory forward masking.
tomwalters@0 203 Note, however, that the system to this point is level independent,
tomwalters@0 204 whereas auditory forward masking is level dependent.
tomwalters@0 205 .RE
tomwalters@0 206 .LP
tomwalters@0 207 .TP 13
tomwalters@0 208 propt2t1_at
tomwalters@0 209 The point at which t1recovery_at gives way to t2_recovery_at
tomwalters@0 210 .RS
tomwalters@0 211 Default value: 0.5.
tomwalters@0 212 .RE
tomwalters@0 213 .RS
tomwalters@0 214 .LP
tomwalters@0 215 This option determines the point at which the initial fast rate of
tomwalters@0 216 decay (t1recovery_at) gives way to the slower limiting decay rate
tomwalters@0 217 (t2recovery_at). The nomanclature assumes that propt2t1_at is a value
tomwalters@0 218 less than unity. Otherwise the the roles of the initial and limiting
tomwalters@0 219 decays are reversed.
tomwalters@0 220 .RE
tomwalters@0 221 .LP
tomwalters@0 222 .TP 13
tomwalters@0 223 frecovery_at
tomwalters@0 224 Recovery rate across frequency
tomwalters@0 225 .RS
tomwalters@0 226 Default value: 20.
tomwalters@0 227 .RE
tomwalters@0 228 .RS
tomwalters@0 229 .LP
tomwalters@0 230 This parameter specifies the rate at which a threshold value in one channel
tomwalters@0 231 propagates to influence threshold in neighbouring channels.
tomwalters@0 232 .RE
tomwalters@0 233 .LP
tomwalters@0 234 .TP 13
tomwalters@0 235 reclimit_at
tomwalters@0 236 Limitation on recovery level
tomwalters@0 237 .RS
tomwalters@0 238 Default units: mB. Default value: 500 mB. (mB=milliBells)
tomwalters@0 239 .RE
tomwalters@0 240 .RS
tomwalters@0 241 .LP
tomwalters@0 242 In order to prevent the mechanism from encountering system noise,
tomwalters@0 243 or alternately, to reduce sensitivity to stimulus noise, there is a
tomwalters@0 244 limit placed on the recovery that the adaptive threshold can achieve.
tomwalters@0 245 The limit, reclimit_at, is the limit of the sensitivity of the system.
tomwalters@0 246 .RE
tomwalters@0 247 .LP
tomwalters@0 248 .TP 13
tomwalters@0 249 gain_at
tomwalters@0 250 Output gain
tomwalters@0 251 .RS
tomwalters@0 252 Default units: scalar. Default value: 1.
tomwalters@0 253 .RE
tomwalters@0 254 .LP
tomwalters@0 255 .SS "Meddis haircell model: _med "
tomwalters@0 256 .PP
tomwalters@0 257 The purpose of the Meddis module is to simulate neural transduction of
tomwalters@0 258 BMM as performed by the inner haircells of the cochlea. There is one
tomwalters@0 259 haircell simulation unit for each output channel of the filterbank.
tomwalters@0 260 The haircell equations (Meddis et al., 1990) are solved using the wave
tomwalters@0 261 digital filter algorithm described in Giguere and Woodland (1994). The
tomwalters@0 262 characteristics of the haircell are controlled by options: fiber_med,
tomwalters@0 263 thresh_med, and gain_med.
tomwalters@0 264 .LP
tomwalters@0 265 .TP 13
tomwalters@0 266 fiber_med
tomwalters@0 267 The spontaneous-rate of the simulated fiber
tomwalters@0 268 .RS
tomwalters@0 269 Default value: medium. Choices: medium, high.
tomwalters@0 270 .RE
tomwalters@0 271 .RS
tomwalters@0 272 .LP
tomwalters@0 273 If medium is specified, a medium spontaneous-rate haircell fiber is
tomwalters@0 274 simulated. If high is specified, a high spontaneous-rate
tomwalters@0 275 fiber is simulated. The properties of these two types of fibers
tomwalters@0 276 are listed in Table II in Meddis et al. (1990).
tomwalters@0 277 The default value is medium.
tomwalters@0 278 .RE
tomwalters@0 279 .LP
tomwalters@0 280 .TP 13
tomwalters@0 281 thresh_med
tomwalters@0 282 The threshold shift of the fiber
tomwalters@0 283 .RS
tomwalters@0 284 Default Units: dB. Default value: 0.
tomwalters@0 285 .RE
tomwalters@0 286 .RS
tomwalters@0 287 .LP
tomwalters@0 288 This option shifts the entire rate-intensity function of the haircell
tomwalters@0 289 fiber horizontally to a higher or lower level, to accomodate changes
tomwalters@0 290 in the scaling of the input wave. A positive (negative) value
tomwalters@0 291 increases (decreases) the rate- and saturation-thresholds of the fiber
tomwalters@0 292 by that amount. This operation does not change the dynamic range, the
tomwalters@0 293 spontaneous and saturation rates, or the adaptation time constants or
tomwalters@0 294 synchronization index of the fiber.
tomwalters@0 295 .RE
tomwalters@0 296 .LP
tomwalters@0 297 .TP 13
tomwalters@0 298 gain_med
tomwalters@0 299 Output gain
tomwalters@0 300 .RS
tomwalters@0 301 Default units: scalar. Default value: 1.
tomwalters@0 302 .RE
tomwalters@0 303 .RS
tomwalters@0 304 .LP
tomwalters@0 305 Note: There is an internal gain of 20.0 within the software of
tomwalters@0 306 the Meddis haircell model itself. The total gain is therefore
tomwalters@0 307 20.0 times the value for gain_med.
tomwalters@0 308 .RE
tomwalters@0 309 .LP
tomwalters@0 310 .SH REFERENCES
tomwalters@0 311 .LP
tomwalters@0 312 .RE
tomwalters@0 313 .TP 4
tomwalters@0 314 Giguere, C. and Woodland, P.C. (1994). A computational model of
tomwalters@0 315 the auditory periphery for speech and hearing research. I. Ascending
tomwalters@0 316 path. J.Acoust. Soc. Am. 95: 331-342.
tomwalters@0 317 .RE
tomwalters@0 318 .LP
tomwalters@0 319 .TP 4
tomwalters@0 320 Holdsworth, J. (1990). Two-Dimensional adaptive thresholding.
tomwalters@0 321 Annex 4 of AAM-HAP Report 1, APU contract Report.
tomwalters@0 322 .RE
tomwalters@0 323 .LP
tomwalters@0 324 .TP 4
tomwalters@0 325 Holdsworth, J. and Patterson, R.D. (1993). "Analysis of waveforms,"
tomwalters@0 326 UK Patent GB 2234078B.
tomwalters@0 327 .LP
tomwalters@0 328 .TP 4
tomwalters@0 329 Meddis, R., Hewitt, M. and Shackleton, T. (1990). Implementation
tomwalters@0 330 details of a computational model of the inner-haircell/auditory-nerve
tomwalters@0 331 synapse. J.Acoust. Soc. Am. 87: 1813-1816.
tomwalters@0 332 .RE
tomwalters@0 333 .LP
tomwalters@0 334 .SH EXAMPLES
tomwalters@0 335 .LP
tomwalters@0 336 The following command generates the neural activity pattern using the
tomwalters@0 337 gammatone auditory filterbank (the default) and the adaptive
tomwalters@0 338 thresholding (the default) for an input file named cegc:
tomwalters@0 339 .RE
tomwalters@0 340 .LP
tomwalters@0 341 example% gennap cegc
tomwalters@0 342 .RE
tomwalters@0 343 .LP
tomwalters@0 344 The following command generates the neural activity pattern using the
tomwalters@0 345 gammatone filterbank (the default) and Meddis haircell
tomwalters@0 346 transduction for input cegc:
tomwalters@0 347 .RE
tomwalters@0 348 .LP
tomwalters@0 349 example% gennap compress=off transduction=meddis cegc
tomwalters@0 350 .RE
tomwalters@0 351 .LP
tomwalters@0 352 The following command generates the neural activity pattern using the
tomwalters@0 353 transmission line filterbank and Meddis haircell transduction for cegc:
tomwalters@0 354 .RE
tomwalters@0 355 .LP
tomwalters@0 356 example% gennap filter=tlf compress=off transduction=meddis cegc
tomwalters@0 357 .LP
tomwalters@0 358 .SH FILES
tomwalters@0 359 .LP
tomwalters@0 360 .TP 13
tomwalters@0 361 .gennaprc
tomwalters@0 362 The options file for gennap.
tomwalters@0 363 .LP
tomwalters@0 364 .SH SEE ALSO
tomwalters@0 365 .LP
tomwalters@0 366 genepn, gencgm, genbmm
tomwalters@0 367 .LP
tomwalters@0 368 .SH COPYRIGHT
tomwalters@0 369 .LP
tomwalters@0 370 Copyright (c) Applied Psychology Unit, Medical Research Council, 1995
tomwalters@0 371 .LP
tomwalters@0 372 Permission to use, copy, modify, and distribute this software without fee
tomwalters@0 373 is hereby granted for research purposes, provided that this copyright
tomwalters@0 374 notice appears in all copies and in all supporting documentation, and that
tomwalters@0 375 the software is not redistributed for any fee (except for a nominal
tomwalters@0 376 shipping charge). Anyone wanting to incorporate all or part of this
tomwalters@0 377 software in a commercial product must obtain a license from the Medical
tomwalters@0 378 Research Council.
tomwalters@0 379 .LP
tomwalters@0 380 The MRC makes no representations about the suitability of this
tomwalters@0 381 software for any purpose. It is provided "as is" without express or
tomwalters@0 382 implied warranty.
tomwalters@0 383 .LP
tomwalters@0 384 THE MRC DISCLAIMS ALL WARRANTIES WITH REGARD TO THIS SOFTWARE, INCLUDING
tomwalters@0 385 ALL IMPLIED WARRANTIES OF MERCHANTABILITY AND FITNESS, IN NO EVENT SHALL
tomwalters@0 386 THE A.P.U. BE LIABLE FOR ANY SPECIAL, INDIRECT OR CONSEQUENTIAL DAMAGES
tomwalters@0 387 OR ANY DAMAGES WHATSOEVER RESULTING FROM LOSS OF USE, DATA OR PROFITS,
tomwalters@0 388 WHETHER IN AN ACTION OF CONTRACT, NEGLIGENCE OR OTHER TORTIOUS ACTION,
tomwalters@0 389 ARISING OUT OF OR IN CONNECTION WITH THE USE OR PERFORMANCE OF THIS
tomwalters@0 390 SOFTWARE.
tomwalters@0 391 .LP
tomwalters@0 392 .SH ACKNOWLEDGEMENTS
tomwalters@0 393 .LP
tomwalters@0 394 The AIM software was developed for Unix workstations by John
tomwalters@0 395 Holdsworth and Mike Allerhand of the MRC APU, under the direction of
tomwalters@0 396 Roy Patterson. The physiological version of AIM was developed by
tomwalters@0 397 Christian Giguere. The options handler is by Paul Manson. The revised
tomwalters@0 398 SAI module is by Jay Datta. Michael Akeroyd extended the postscript
tomwalters@0 399 facilites and developed the xreview routine for auditory image
tomwalters@0 400 cartoons.
tomwalters@0 401 .LP
tomwalters@0 402 The project was supported by the MRC and grants from the U.K. Defense
tomwalters@0 403 Research Agency, Farnborough (Research Contract 2239); the EEC Esprit
tomwalters@0 404 BR Porgramme, Project ACTS (3207); and the U.K. Hearing Research Trust.
tomwalters@0 405