comparison man/man1/gennap.1 @ 0:5242703e91d3 tip

Initial checkin for AIM92 aimR8.2 (last updated May 1997).
author tomwalters
date Fri, 20 May 2011 15:19:45 +0100
parents
children
comparison
equal deleted inserted replaced
-1:000000000000 0:5242703e91d3
1 .TH GENNAP 1 "8 April 1994"
2 .LP
3 .SH NAME
4 .LP
5 gennap \- generate neural activity pattern
6 .LP
7 .SH SYNOPSIS
8 .LP
9 gennap [ option=value | -option ] [ filename ]
10 .LP
11 .SH DESCRIPTION
12 .LP
13 The gennap module of the AIM software converts an input wave into a
14 simulated neural activity pattern (NAP), which is AIM's representation
15 of the pattern of information in the auditory nerve at about the level
16 of the cochlear nucleus. Gennap begins by calculating the basilar
17 membrane motion (BMM) associated with the input wave using the genbmm
18 module, and then it applies several additional transforms that we know
19 occur in some form during the neural transduction process. AIM
20 provides two alternative methods for generating the NAP, a
21 two-dimensional adaptive thresholding mechanism (Holdsworth and
22 Patterson, 1993), and an array of inner haircell simulators based
23 (Meddis et al., 1990; Giguere and Woodland, 1994). The adaptive
24 thresholding mechanism applies rectification, log compression,
25 adaptation in time, and suppression across frequency; its purpose is
26 to stabilise the level of the membrane activity with compression and
27 then sharpen the features that appear in the compressed membrane
28 motion. Together, the gammatone filterbank and adaptive thresholding
29 form a 'functional' cochlea simulation. The Meddis module applies
30 level-dependant compression and adaptation that simulate the response
31 of inner haircells to membrane motion. The cells are not coupled and
32 so there is no frequency sharpening in this module. Together, the
33 transmission-line filterbank and the Meddis module form a
34 'physiological' cochlea simulation.
35 .LP
36 .SH OPTIONS
37 .LP
38 The options for gennap are grouped according to the functions they
39 control. The adaptive thresholding options are identified by the
40 common suffix _at; the Meddis module options are identified by the
41 common suffix _med. These two groups of options are the subject of
42 this manual entry, together with two additional options that specify
43 whether rectification and compression operations are required before
44 the transduction stage. There is also an option to specify the choice
45 of the transduction function.
46 .LP
47 .SH RECTIFICATION AND COMPRESSION
48 .LP
49 The adaptive thresholding process begins with rectification and log
50 compression of the BMM. It is occasionally useful to have these
51 functions available separately and so the options 'rectify' and
52 'compress' are presented separately in the options list before the
53 neural transduction options.
54 .RE
55 .LP
56 .TP 13
57 rectify
58 Rectification switch
59 .RS
60 Switch. Default value: off.
61 .RE
62 .RS
63 .LP
64 If rectify is on, the BMM is half-wave rectified.
65 The compression operation also performs half-wave rectification (to
66 avoid taking logs of negative numbers). So the rectify option is
67 really here just to provide for rectified BMM in the absence of
68 compression. As a result, the default for option rectify is
69 off. (Note: Full wave rectification is produced if rectify is set to
70 2. This is useful when calculating envelopes with genasa.)
71 .RE
72 .LP
73 .TP 13
74 compress
75 Compression switch
76 .RS
77 Switch. Default value: on.
78 .RE
79 .RS
80 .LP
81 The compressor is strictly logarithmic and so to this point, the
82 functional cochlea simulation is level independent. In the auditory
83 system, the compressor is logarithmic over the lower part of its range
84 and then it asymptotes to a soft limit. The default for option
85 compress is on (note that the compressor also performs half-wave
86 rectification).
87 .RE
88 .LP
89 Important: The default value for option compress is 'on' which assumes
90 that the transduction module is adaptive thresholding (the default for
91 the transduction option described below). If the Meddis transduction
92 module is selected (transduction=med), compress should be set to 'off'
93 to obtain the operation described in Giguerre and Woodland
94 (1994). This can be done on the command line (see EXAMPLES) or in the
95 appropriate .gen???rc files.
96 .RE
97 .LP
98 .SH NEURAL TRANSDUCTION
99 .LP
100 The neural transduction is performed either by two-dimensional
101 adaptive thresholding or an array of Meddis haircells. The choice is
102 controlled by the option 'transduction'.
103 .LP
104 .TP 13
105 transduction
106 The transduction function
107 .RS
108 Switch. Default value: at. Choices: at, med, off.
109 .RE
110 .LP
111 If adaptive thresholding is specified (at), the options with suffix
112 _at below apply; if the Meddis module is specified (med), the options
113 with suffix _med below apply. If off is specified, no transduction
114 function is applied. The default is at.
115 .RE
116 .LP
117 .SS "Two-dimensional adaptive thresholding: _at "
118 .PP
119 The adaptive thresholding mechanism is a functional model of neural
120 encoding. Its purpose is to enhance the contrast of the larger
121 features that appear in the surface of the BMM and reduce those
122 aspects of the representation which are just a direct consequence of
123 the filtering and compression processes (Holdsworth and Patterson,
124 1993). The process begins with rectification and compression of the
125 BMM. The tail of the envelope of the impulse response of the
126 gammatone filter is exponential. As a result, logarithmic compression
127 is used, since this makes the filter decay function linear in NAP
128 coordinates. Following compression, adaptation is applied in time and
129 suppression is applied across frequency.
130 .LP
131 Briefly, an adaptive threshold value is maintained for each channel
132 and updated at the sampling rate. The new value is the largest of a)
133 the previous value reduced by a fast-acting temporal decay factor
134 (t1recovery_at), b) the previous value reduced by a longer-term
135 temporal decay factor (t2recovery_at), c) the adapted level in the
136 channel immediately above, reduced by a frequency spread factor
137 (frecovery_at), d) the adapted level in the channel immediately below,
138 reduced by the same frequency spread factor, or e) a floor level that
139 precludes the mechanism listening to its own internal noise
140 (reclimit_at). The mechanism produces output whenever the input
141 exceeds the adaptive threshold, and the output level is the difference
142 between the input and the adaptive threshold. The adaptation and
143 suppression are coupled, and they jointly sharpen features like vowel
144 formants which appear smeared in compressed BMM.
145 .LP
146 .TP 13
147 trise_at
148 Threshold rise rate
149 .RS
150 Default value: 1000.
151 .RE
152 .RS
153 .LP
154 Upward Adaptation: This option specifies the rate at which the
155 adaptive threshold will rise in response to a rise in signal
156 level. The default value, 1000, means that the adaptive threshold
157 responds very quickly to increases in the input wave; essentially, it
158 follows the envelope of any rise in signal amplitude.
159 .RE
160 .LP
161 Downward Adaptation: Following the cessation of sound, or a rapid drop
162 in input level, temporal adaptation occurs in two stages as determined
163 by t1recovery_at, t2recovery_at and propt2t1_at: If the default values
164 are used, the mechanism initially adapts at a rate slightly slower
165 than the decay rate of the gammatone filter in the given channel, and
166 this represses much of the ringing of the impulse response of the
167 filter. Later the adaptation switches to a slower rate more in line
168 with data on auditory forward masking. The option propt2t1_at
169 determines the point at which the initial fast rate of decay gives way
170 to the slower limiting decay rate.
171 .RE
172 .LP
173 .TP 13
174 t1recovery_at
175 The initial rate of threshold recovery relative to filter decay rate
176 .RS
177 Default value: 0.6.
178 .RE
179 .RS
180 .LP
181 This option determines the initial rate of decay of the adaptive
182 threshold relative to the rate of decay of the auditory filter,
183 provided propt2t1_at is less than unity. Values of t1recovery_at less
184 than unity cause the adaptive threshold to decay more slowly than the
185 auditory filter and thereby to remove the filter response from the
186 representation when it is the sole reason for BMM activity. The rate
187 of decay is linear with respect to the log-compressed BMM, so it is
188 like an exponential decay with respect to the BMM.
189 .RE
190 .LP
191 .TP 13
192 t2recovery_at
193 The secondary threshold recovery rate
194 .RS
195 Default value: 0.2.
196 .RE
197 .RS
198 .LP
199 This option determines the limiting rate of decay of the adaptive
200 threshold when the sound ceases provided propt2t1_at is less than
201 unity. The default value causes the adaptive threshold to decay more
202 slowly than the initial rate as observed in auditory forward masking.
203 Note, however, that the system to this point is level independent,
204 whereas auditory forward masking is level dependent.
205 .RE
206 .LP
207 .TP 13
208 propt2t1_at
209 The point at which t1recovery_at gives way to t2_recovery_at
210 .RS
211 Default value: 0.5.
212 .RE
213 .RS
214 .LP
215 This option determines the point at which the initial fast rate of
216 decay (t1recovery_at) gives way to the slower limiting decay rate
217 (t2recovery_at). The nomanclature assumes that propt2t1_at is a value
218 less than unity. Otherwise the the roles of the initial and limiting
219 decays are reversed.
220 .RE
221 .LP
222 .TP 13
223 frecovery_at
224 Recovery rate across frequency
225 .RS
226 Default value: 20.
227 .RE
228 .RS
229 .LP
230 This parameter specifies the rate at which a threshold value in one channel
231 propagates to influence threshold in neighbouring channels.
232 .RE
233 .LP
234 .TP 13
235 reclimit_at
236 Limitation on recovery level
237 .RS
238 Default units: mB. Default value: 500 mB. (mB=milliBells)
239 .RE
240 .RS
241 .LP
242 In order to prevent the mechanism from encountering system noise,
243 or alternately, to reduce sensitivity to stimulus noise, there is a
244 limit placed on the recovery that the adaptive threshold can achieve.
245 The limit, reclimit_at, is the limit of the sensitivity of the system.
246 .RE
247 .LP
248 .TP 13
249 gain_at
250 Output gain
251 .RS
252 Default units: scalar. Default value: 1.
253 .RE
254 .LP
255 .SS "Meddis haircell model: _med "
256 .PP
257 The purpose of the Meddis module is to simulate neural transduction of
258 BMM as performed by the inner haircells of the cochlea. There is one
259 haircell simulation unit for each output channel of the filterbank.
260 The haircell equations (Meddis et al., 1990) are solved using the wave
261 digital filter algorithm described in Giguere and Woodland (1994). The
262 characteristics of the haircell are controlled by options: fiber_med,
263 thresh_med, and gain_med.
264 .LP
265 .TP 13
266 fiber_med
267 The spontaneous-rate of the simulated fiber
268 .RS
269 Default value: medium. Choices: medium, high.
270 .RE
271 .RS
272 .LP
273 If medium is specified, a medium spontaneous-rate haircell fiber is
274 simulated. If high is specified, a high spontaneous-rate
275 fiber is simulated. The properties of these two types of fibers
276 are listed in Table II in Meddis et al. (1990).
277 The default value is medium.
278 .RE
279 .LP
280 .TP 13
281 thresh_med
282 The threshold shift of the fiber
283 .RS
284 Default Units: dB. Default value: 0.
285 .RE
286 .RS
287 .LP
288 This option shifts the entire rate-intensity function of the haircell
289 fiber horizontally to a higher or lower level, to accomodate changes
290 in the scaling of the input wave. A positive (negative) value
291 increases (decreases) the rate- and saturation-thresholds of the fiber
292 by that amount. This operation does not change the dynamic range, the
293 spontaneous and saturation rates, or the adaptation time constants or
294 synchronization index of the fiber.
295 .RE
296 .LP
297 .TP 13
298 gain_med
299 Output gain
300 .RS
301 Default units: scalar. Default value: 1.
302 .RE
303 .RS
304 .LP
305 Note: There is an internal gain of 20.0 within the software of
306 the Meddis haircell model itself. The total gain is therefore
307 20.0 times the value for gain_med.
308 .RE
309 .LP
310 .SH REFERENCES
311 .LP
312 .RE
313 .TP 4
314 Giguere, C. and Woodland, P.C. (1994). A computational model of
315 the auditory periphery for speech and hearing research. I. Ascending
316 path. J.Acoust. Soc. Am. 95: 331-342.
317 .RE
318 .LP
319 .TP 4
320 Holdsworth, J. (1990). Two-Dimensional adaptive thresholding.
321 Annex 4 of AAM-HAP Report 1, APU contract Report.
322 .RE
323 .LP
324 .TP 4
325 Holdsworth, J. and Patterson, R.D. (1993). "Analysis of waveforms,"
326 UK Patent GB 2234078B.
327 .LP
328 .TP 4
329 Meddis, R., Hewitt, M. and Shackleton, T. (1990). Implementation
330 details of a computational model of the inner-haircell/auditory-nerve
331 synapse. J.Acoust. Soc. Am. 87: 1813-1816.
332 .RE
333 .LP
334 .SH EXAMPLES
335 .LP
336 The following command generates the neural activity pattern using the
337 gammatone auditory filterbank (the default) and the adaptive
338 thresholding (the default) for an input file named cegc:
339 .RE
340 .LP
341 example% gennap cegc
342 .RE
343 .LP
344 The following command generates the neural activity pattern using the
345 gammatone filterbank (the default) and Meddis haircell
346 transduction for input cegc:
347 .RE
348 .LP
349 example% gennap compress=off transduction=meddis cegc
350 .RE
351 .LP
352 The following command generates the neural activity pattern using the
353 transmission line filterbank and Meddis haircell transduction for cegc:
354 .RE
355 .LP
356 example% gennap filter=tlf compress=off transduction=meddis cegc
357 .LP
358 .SH FILES
359 .LP
360 .TP 13
361 .gennaprc
362 The options file for gennap.
363 .LP
364 .SH SEE ALSO
365 .LP
366 genepn, gencgm, genbmm
367 .LP
368 .SH COPYRIGHT
369 .LP
370 Copyright (c) Applied Psychology Unit, Medical Research Council, 1995
371 .LP
372 Permission to use, copy, modify, and distribute this software without fee
373 is hereby granted for research purposes, provided that this copyright
374 notice appears in all copies and in all supporting documentation, and that
375 the software is not redistributed for any fee (except for a nominal
376 shipping charge). Anyone wanting to incorporate all or part of this
377 software in a commercial product must obtain a license from the Medical
378 Research Council.
379 .LP
380 The MRC makes no representations about the suitability of this
381 software for any purpose. It is provided "as is" without express or
382 implied warranty.
383 .LP
384 THE MRC DISCLAIMS ALL WARRANTIES WITH REGARD TO THIS SOFTWARE, INCLUDING
385 ALL IMPLIED WARRANTIES OF MERCHANTABILITY AND FITNESS, IN NO EVENT SHALL
386 THE A.P.U. BE LIABLE FOR ANY SPECIAL, INDIRECT OR CONSEQUENTIAL DAMAGES
387 OR ANY DAMAGES WHATSOEVER RESULTING FROM LOSS OF USE, DATA OR PROFITS,
388 WHETHER IN AN ACTION OF CONTRACT, NEGLIGENCE OR OTHER TORTIOUS ACTION,
389 ARISING OUT OF OR IN CONNECTION WITH THE USE OR PERFORMANCE OF THIS
390 SOFTWARE.
391 .LP
392 .SH ACKNOWLEDGEMENTS
393 .LP
394 The AIM software was developed for Unix workstations by John
395 Holdsworth and Mike Allerhand of the MRC APU, under the direction of
396 Roy Patterson. The physiological version of AIM was developed by
397 Christian Giguere. The options handler is by Paul Manson. The revised
398 SAI module is by Jay Datta. Michael Akeroyd extended the postscript
399 facilites and developed the xreview routine for auditory image
400 cartoons.
401 .LP
402 The project was supported by the MRC and grants from the U.K. Defense
403 Research Agency, Farnborough (Research Contract 2239); the EEC Esprit
404 BR Porgramme, Project ACTS (3207); and the U.K. Hearing Research Trust.
405