Mercurial > hg > aim92
comparison man/man1/gennap.1 @ 0:5242703e91d3 tip
Initial checkin for AIM92 aimR8.2 (last updated May 1997).
author | tomwalters |
---|---|
date | Fri, 20 May 2011 15:19:45 +0100 |
parents | |
children |
comparison
equal
deleted
inserted
replaced
-1:000000000000 | 0:5242703e91d3 |
---|---|
1 .TH GENNAP 1 "8 April 1994" | |
2 .LP | |
3 .SH NAME | |
4 .LP | |
5 gennap \- generate neural activity pattern | |
6 .LP | |
7 .SH SYNOPSIS | |
8 .LP | |
9 gennap [ option=value | -option ] [ filename ] | |
10 .LP | |
11 .SH DESCRIPTION | |
12 .LP | |
13 The gennap module of the AIM software converts an input wave into a | |
14 simulated neural activity pattern (NAP), which is AIM's representation | |
15 of the pattern of information in the auditory nerve at about the level | |
16 of the cochlear nucleus. Gennap begins by calculating the basilar | |
17 membrane motion (BMM) associated with the input wave using the genbmm | |
18 module, and then it applies several additional transforms that we know | |
19 occur in some form during the neural transduction process. AIM | |
20 provides two alternative methods for generating the NAP, a | |
21 two-dimensional adaptive thresholding mechanism (Holdsworth and | |
22 Patterson, 1993), and an array of inner haircell simulators based | |
23 (Meddis et al., 1990; Giguere and Woodland, 1994). The adaptive | |
24 thresholding mechanism applies rectification, log compression, | |
25 adaptation in time, and suppression across frequency; its purpose is | |
26 to stabilise the level of the membrane activity with compression and | |
27 then sharpen the features that appear in the compressed membrane | |
28 motion. Together, the gammatone filterbank and adaptive thresholding | |
29 form a 'functional' cochlea simulation. The Meddis module applies | |
30 level-dependant compression and adaptation that simulate the response | |
31 of inner haircells to membrane motion. The cells are not coupled and | |
32 so there is no frequency sharpening in this module. Together, the | |
33 transmission-line filterbank and the Meddis module form a | |
34 'physiological' cochlea simulation. | |
35 .LP | |
36 .SH OPTIONS | |
37 .LP | |
38 The options for gennap are grouped according to the functions they | |
39 control. The adaptive thresholding options are identified by the | |
40 common suffix _at; the Meddis module options are identified by the | |
41 common suffix _med. These two groups of options are the subject of | |
42 this manual entry, together with two additional options that specify | |
43 whether rectification and compression operations are required before | |
44 the transduction stage. There is also an option to specify the choice | |
45 of the transduction function. | |
46 .LP | |
47 .SH RECTIFICATION AND COMPRESSION | |
48 .LP | |
49 The adaptive thresholding process begins with rectification and log | |
50 compression of the BMM. It is occasionally useful to have these | |
51 functions available separately and so the options 'rectify' and | |
52 'compress' are presented separately in the options list before the | |
53 neural transduction options. | |
54 .RE | |
55 .LP | |
56 .TP 13 | |
57 rectify | |
58 Rectification switch | |
59 .RS | |
60 Switch. Default value: off. | |
61 .RE | |
62 .RS | |
63 .LP | |
64 If rectify is on, the BMM is half-wave rectified. | |
65 The compression operation also performs half-wave rectification (to | |
66 avoid taking logs of negative numbers). So the rectify option is | |
67 really here just to provide for rectified BMM in the absence of | |
68 compression. As a result, the default for option rectify is | |
69 off. (Note: Full wave rectification is produced if rectify is set to | |
70 2. This is useful when calculating envelopes with genasa.) | |
71 .RE | |
72 .LP | |
73 .TP 13 | |
74 compress | |
75 Compression switch | |
76 .RS | |
77 Switch. Default value: on. | |
78 .RE | |
79 .RS | |
80 .LP | |
81 The compressor is strictly logarithmic and so to this point, the | |
82 functional cochlea simulation is level independent. In the auditory | |
83 system, the compressor is logarithmic over the lower part of its range | |
84 and then it asymptotes to a soft limit. The default for option | |
85 compress is on (note that the compressor also performs half-wave | |
86 rectification). | |
87 .RE | |
88 .LP | |
89 Important: The default value for option compress is 'on' which assumes | |
90 that the transduction module is adaptive thresholding (the default for | |
91 the transduction option described below). If the Meddis transduction | |
92 module is selected (transduction=med), compress should be set to 'off' | |
93 to obtain the operation described in Giguerre and Woodland | |
94 (1994). This can be done on the command line (see EXAMPLES) or in the | |
95 appropriate .gen???rc files. | |
96 .RE | |
97 .LP | |
98 .SH NEURAL TRANSDUCTION | |
99 .LP | |
100 The neural transduction is performed either by two-dimensional | |
101 adaptive thresholding or an array of Meddis haircells. The choice is | |
102 controlled by the option 'transduction'. | |
103 .LP | |
104 .TP 13 | |
105 transduction | |
106 The transduction function | |
107 .RS | |
108 Switch. Default value: at. Choices: at, med, off. | |
109 .RE | |
110 .LP | |
111 If adaptive thresholding is specified (at), the options with suffix | |
112 _at below apply; if the Meddis module is specified (med), the options | |
113 with suffix _med below apply. If off is specified, no transduction | |
114 function is applied. The default is at. | |
115 .RE | |
116 .LP | |
117 .SS "Two-dimensional adaptive thresholding: _at " | |
118 .PP | |
119 The adaptive thresholding mechanism is a functional model of neural | |
120 encoding. Its purpose is to enhance the contrast of the larger | |
121 features that appear in the surface of the BMM and reduce those | |
122 aspects of the representation which are just a direct consequence of | |
123 the filtering and compression processes (Holdsworth and Patterson, | |
124 1993). The process begins with rectification and compression of the | |
125 BMM. The tail of the envelope of the impulse response of the | |
126 gammatone filter is exponential. As a result, logarithmic compression | |
127 is used, since this makes the filter decay function linear in NAP | |
128 coordinates. Following compression, adaptation is applied in time and | |
129 suppression is applied across frequency. | |
130 .LP | |
131 Briefly, an adaptive threshold value is maintained for each channel | |
132 and updated at the sampling rate. The new value is the largest of a) | |
133 the previous value reduced by a fast-acting temporal decay factor | |
134 (t1recovery_at), b) the previous value reduced by a longer-term | |
135 temporal decay factor (t2recovery_at), c) the adapted level in the | |
136 channel immediately above, reduced by a frequency spread factor | |
137 (frecovery_at), d) the adapted level in the channel immediately below, | |
138 reduced by the same frequency spread factor, or e) a floor level that | |
139 precludes the mechanism listening to its own internal noise | |
140 (reclimit_at). The mechanism produces output whenever the input | |
141 exceeds the adaptive threshold, and the output level is the difference | |
142 between the input and the adaptive threshold. The adaptation and | |
143 suppression are coupled, and they jointly sharpen features like vowel | |
144 formants which appear smeared in compressed BMM. | |
145 .LP | |
146 .TP 13 | |
147 trise_at | |
148 Threshold rise rate | |
149 .RS | |
150 Default value: 1000. | |
151 .RE | |
152 .RS | |
153 .LP | |
154 Upward Adaptation: This option specifies the rate at which the | |
155 adaptive threshold will rise in response to a rise in signal | |
156 level. The default value, 1000, means that the adaptive threshold | |
157 responds very quickly to increases in the input wave; essentially, it | |
158 follows the envelope of any rise in signal amplitude. | |
159 .RE | |
160 .LP | |
161 Downward Adaptation: Following the cessation of sound, or a rapid drop | |
162 in input level, temporal adaptation occurs in two stages as determined | |
163 by t1recovery_at, t2recovery_at and propt2t1_at: If the default values | |
164 are used, the mechanism initially adapts at a rate slightly slower | |
165 than the decay rate of the gammatone filter in the given channel, and | |
166 this represses much of the ringing of the impulse response of the | |
167 filter. Later the adaptation switches to a slower rate more in line | |
168 with data on auditory forward masking. The option propt2t1_at | |
169 determines the point at which the initial fast rate of decay gives way | |
170 to the slower limiting decay rate. | |
171 .RE | |
172 .LP | |
173 .TP 13 | |
174 t1recovery_at | |
175 The initial rate of threshold recovery relative to filter decay rate | |
176 .RS | |
177 Default value: 0.6. | |
178 .RE | |
179 .RS | |
180 .LP | |
181 This option determines the initial rate of decay of the adaptive | |
182 threshold relative to the rate of decay of the auditory filter, | |
183 provided propt2t1_at is less than unity. Values of t1recovery_at less | |
184 than unity cause the adaptive threshold to decay more slowly than the | |
185 auditory filter and thereby to remove the filter response from the | |
186 representation when it is the sole reason for BMM activity. The rate | |
187 of decay is linear with respect to the log-compressed BMM, so it is | |
188 like an exponential decay with respect to the BMM. | |
189 .RE | |
190 .LP | |
191 .TP 13 | |
192 t2recovery_at | |
193 The secondary threshold recovery rate | |
194 .RS | |
195 Default value: 0.2. | |
196 .RE | |
197 .RS | |
198 .LP | |
199 This option determines the limiting rate of decay of the adaptive | |
200 threshold when the sound ceases provided propt2t1_at is less than | |
201 unity. The default value causes the adaptive threshold to decay more | |
202 slowly than the initial rate as observed in auditory forward masking. | |
203 Note, however, that the system to this point is level independent, | |
204 whereas auditory forward masking is level dependent. | |
205 .RE | |
206 .LP | |
207 .TP 13 | |
208 propt2t1_at | |
209 The point at which t1recovery_at gives way to t2_recovery_at | |
210 .RS | |
211 Default value: 0.5. | |
212 .RE | |
213 .RS | |
214 .LP | |
215 This option determines the point at which the initial fast rate of | |
216 decay (t1recovery_at) gives way to the slower limiting decay rate | |
217 (t2recovery_at). The nomanclature assumes that propt2t1_at is a value | |
218 less than unity. Otherwise the the roles of the initial and limiting | |
219 decays are reversed. | |
220 .RE | |
221 .LP | |
222 .TP 13 | |
223 frecovery_at | |
224 Recovery rate across frequency | |
225 .RS | |
226 Default value: 20. | |
227 .RE | |
228 .RS | |
229 .LP | |
230 This parameter specifies the rate at which a threshold value in one channel | |
231 propagates to influence threshold in neighbouring channels. | |
232 .RE | |
233 .LP | |
234 .TP 13 | |
235 reclimit_at | |
236 Limitation on recovery level | |
237 .RS | |
238 Default units: mB. Default value: 500 mB. (mB=milliBells) | |
239 .RE | |
240 .RS | |
241 .LP | |
242 In order to prevent the mechanism from encountering system noise, | |
243 or alternately, to reduce sensitivity to stimulus noise, there is a | |
244 limit placed on the recovery that the adaptive threshold can achieve. | |
245 The limit, reclimit_at, is the limit of the sensitivity of the system. | |
246 .RE | |
247 .LP | |
248 .TP 13 | |
249 gain_at | |
250 Output gain | |
251 .RS | |
252 Default units: scalar. Default value: 1. | |
253 .RE | |
254 .LP | |
255 .SS "Meddis haircell model: _med " | |
256 .PP | |
257 The purpose of the Meddis module is to simulate neural transduction of | |
258 BMM as performed by the inner haircells of the cochlea. There is one | |
259 haircell simulation unit for each output channel of the filterbank. | |
260 The haircell equations (Meddis et al., 1990) are solved using the wave | |
261 digital filter algorithm described in Giguere and Woodland (1994). The | |
262 characteristics of the haircell are controlled by options: fiber_med, | |
263 thresh_med, and gain_med. | |
264 .LP | |
265 .TP 13 | |
266 fiber_med | |
267 The spontaneous-rate of the simulated fiber | |
268 .RS | |
269 Default value: medium. Choices: medium, high. | |
270 .RE | |
271 .RS | |
272 .LP | |
273 If medium is specified, a medium spontaneous-rate haircell fiber is | |
274 simulated. If high is specified, a high spontaneous-rate | |
275 fiber is simulated. The properties of these two types of fibers | |
276 are listed in Table II in Meddis et al. (1990). | |
277 The default value is medium. | |
278 .RE | |
279 .LP | |
280 .TP 13 | |
281 thresh_med | |
282 The threshold shift of the fiber | |
283 .RS | |
284 Default Units: dB. Default value: 0. | |
285 .RE | |
286 .RS | |
287 .LP | |
288 This option shifts the entire rate-intensity function of the haircell | |
289 fiber horizontally to a higher or lower level, to accomodate changes | |
290 in the scaling of the input wave. A positive (negative) value | |
291 increases (decreases) the rate- and saturation-thresholds of the fiber | |
292 by that amount. This operation does not change the dynamic range, the | |
293 spontaneous and saturation rates, or the adaptation time constants or | |
294 synchronization index of the fiber. | |
295 .RE | |
296 .LP | |
297 .TP 13 | |
298 gain_med | |
299 Output gain | |
300 .RS | |
301 Default units: scalar. Default value: 1. | |
302 .RE | |
303 .RS | |
304 .LP | |
305 Note: There is an internal gain of 20.0 within the software of | |
306 the Meddis haircell model itself. The total gain is therefore | |
307 20.0 times the value for gain_med. | |
308 .RE | |
309 .LP | |
310 .SH REFERENCES | |
311 .LP | |
312 .RE | |
313 .TP 4 | |
314 Giguere, C. and Woodland, P.C. (1994). A computational model of | |
315 the auditory periphery for speech and hearing research. I. Ascending | |
316 path. J.Acoust. Soc. Am. 95: 331-342. | |
317 .RE | |
318 .LP | |
319 .TP 4 | |
320 Holdsworth, J. (1990). Two-Dimensional adaptive thresholding. | |
321 Annex 4 of AAM-HAP Report 1, APU contract Report. | |
322 .RE | |
323 .LP | |
324 .TP 4 | |
325 Holdsworth, J. and Patterson, R.D. (1993). "Analysis of waveforms," | |
326 UK Patent GB 2234078B. | |
327 .LP | |
328 .TP 4 | |
329 Meddis, R., Hewitt, M. and Shackleton, T. (1990). Implementation | |
330 details of a computational model of the inner-haircell/auditory-nerve | |
331 synapse. J.Acoust. Soc. Am. 87: 1813-1816. | |
332 .RE | |
333 .LP | |
334 .SH EXAMPLES | |
335 .LP | |
336 The following command generates the neural activity pattern using the | |
337 gammatone auditory filterbank (the default) and the adaptive | |
338 thresholding (the default) for an input file named cegc: | |
339 .RE | |
340 .LP | |
341 example% gennap cegc | |
342 .RE | |
343 .LP | |
344 The following command generates the neural activity pattern using the | |
345 gammatone filterbank (the default) and Meddis haircell | |
346 transduction for input cegc: | |
347 .RE | |
348 .LP | |
349 example% gennap compress=off transduction=meddis cegc | |
350 .RE | |
351 .LP | |
352 The following command generates the neural activity pattern using the | |
353 transmission line filterbank and Meddis haircell transduction for cegc: | |
354 .RE | |
355 .LP | |
356 example% gennap filter=tlf compress=off transduction=meddis cegc | |
357 .LP | |
358 .SH FILES | |
359 .LP | |
360 .TP 13 | |
361 .gennaprc | |
362 The options file for gennap. | |
363 .LP | |
364 .SH SEE ALSO | |
365 .LP | |
366 genepn, gencgm, genbmm | |
367 .LP | |
368 .SH COPYRIGHT | |
369 .LP | |
370 Copyright (c) Applied Psychology Unit, Medical Research Council, 1995 | |
371 .LP | |
372 Permission to use, copy, modify, and distribute this software without fee | |
373 is hereby granted for research purposes, provided that this copyright | |
374 notice appears in all copies and in all supporting documentation, and that | |
375 the software is not redistributed for any fee (except for a nominal | |
376 shipping charge). Anyone wanting to incorporate all or part of this | |
377 software in a commercial product must obtain a license from the Medical | |
378 Research Council. | |
379 .LP | |
380 The MRC makes no representations about the suitability of this | |
381 software for any purpose. It is provided "as is" without express or | |
382 implied warranty. | |
383 .LP | |
384 THE MRC DISCLAIMS ALL WARRANTIES WITH REGARD TO THIS SOFTWARE, INCLUDING | |
385 ALL IMPLIED WARRANTIES OF MERCHANTABILITY AND FITNESS, IN NO EVENT SHALL | |
386 THE A.P.U. BE LIABLE FOR ANY SPECIAL, INDIRECT OR CONSEQUENTIAL DAMAGES | |
387 OR ANY DAMAGES WHATSOEVER RESULTING FROM LOSS OF USE, DATA OR PROFITS, | |
388 WHETHER IN AN ACTION OF CONTRACT, NEGLIGENCE OR OTHER TORTIOUS ACTION, | |
389 ARISING OUT OF OR IN CONNECTION WITH THE USE OR PERFORMANCE OF THIS | |
390 SOFTWARE. | |
391 .LP | |
392 .SH ACKNOWLEDGEMENTS | |
393 .LP | |
394 The AIM software was developed for Unix workstations by John | |
395 Holdsworth and Mike Allerhand of the MRC APU, under the direction of | |
396 Roy Patterson. The physiological version of AIM was developed by | |
397 Christian Giguere. The options handler is by Paul Manson. The revised | |
398 SAI module is by Jay Datta. Michael Akeroyd extended the postscript | |
399 facilites and developed the xreview routine for auditory image | |
400 cartoons. | |
401 .LP | |
402 The project was supported by the MRC and grants from the U.K. Defense | |
403 Research Agency, Farnborough (Research Contract 2239); the EEC Esprit | |
404 BR Porgramme, Project ACTS (3207); and the U.K. Hearing Research Trust. | |
405 |