Mercurial > hg > aim92
comparison man/man1/genbmm.1 @ 0:5242703e91d3 tip
Initial checkin for AIM92 aimR8.2 (last updated May 1997).
author | tomwalters |
---|---|
date | Fri, 20 May 2011 15:19:45 +0100 |
parents | |
children |
comparison
equal
deleted
inserted
replaced
-1:000000000000 | 0:5242703e91d3 |
---|---|
1 .TH GENBMM 1 "11 April 1994" | |
2 .LP | |
3 .SH NAME | |
4 .LP | |
5 genbmm \- generate basilar membrane motion | |
6 .LP | |
7 .SH SYNOPSIS | |
8 .LP | |
9 genbmm [ option=value | -option ] [ filename ] | |
10 .LP | |
11 .SH DESCRIPTION | |
12 .LP | |
13 The genbmm module of the AIM software simulates the spectral analysis | |
14 performed by the auditory system using a bank of auditory filters. | |
15 Specifically, genbmm converts an input wave into an array of filtered | |
16 waves, one for each channel of the filterbank. The surface of the | |
17 array of filtered waves is AIM's representation of basilar membrane | |
18 motion (BMM) as a function of time. AIM provides two alternative | |
19 methods for generating the BMM, linear, gammatone filterbank | |
20 (Patterson et al, 1988; Slaney 1993, Cooke, 1993), or a nonlinear, | |
21 transmission-line filterbank (Giguere and Woodland, 1994). For | |
22 convenience, they are referred to as the 'functional' filterbank and | |
23 the 'physiological' filterbank, respectively. | |
24 .LP | |
25 .SH OPTIONS | |
26 .LP | |
27 There are three sets of options for genbmm; they are grouped by | |
28 function and identified by the suffixes _afb, _gtf and _tlf. The first | |
29 set controls the distribution of the filtered waves across frequency | |
30 (suffix _afb); the second specifies the shape of the gammatone filter | |
31 (suffix _gtf); and the third specifies the shape of the transmission | |
32 line filter (suffix _tlf). These three groups of options are the | |
33 subject of this manual entry, together with an option that specifies | |
34 the filter choice (gtf or tlf), and an option that specifies whether a | |
35 middle ear function should be used with the gtf filterbank. | |
36 .LP | |
37 .SS "The Outer/Middle Ear function: middle_ear " | |
38 .PP | |
39 In the auditory system the middle ear causes a progressive attenuation | |
40 of sound energy in the region below about 500 Hz and a progressive | |
41 attenuation in the region above about 4000 Hz. There is also a | |
42 primary auditory canal resonance around 2700 Hz that provides a boost | |
43 in sound transmission. The resulting transfer function is a normal | |
44 aspect of auditory processing and preceeds spectral analysis. If the | |
45 functional filterbank is chosen (gtf), the outer/middle ear filter | |
46 acts directly on the input wave, and the stapes velocity wave it | |
47 generates is the input to the spectral filtering stage. If the | |
48 physiological filterbank is chosen (tlf), the outer/middle ear and | |
49 cochlear filter are performed simultaneously as in the auditory | |
50 system. The only parameter associated with this function is the | |
51 middle_ear switch which makes it possible to turn the outer/middle ear | |
52 filtering off when the functional filterbank is chosen. | |
53 .LP | |
54 .TP 13 | |
55 middle_ear | |
56 Outer/middle ear switch | |
57 .RS | |
58 Switch. Default: on. | |
59 .RE | |
60 .RS | |
61 .LP | |
62 It is also possible to specify a floating point number, in which | |
63 case the middle ear output is multiplied by that value. | |
64 .RE | |
65 .LP | |
66 Note: The middle_ear option is ignored if option filter (see below) | |
67 is set to tlf. This is because the outer/middle stage and the | |
68 cochlear stage are bidirectionally coupled in the transmission | |
69 line filter implementation, and cannot be separated. | |
70 .RE | |
71 .LP | |
72 .SS "The Auditory FilterBank options: _afb " | |
73 .PP | |
74 The distribution of the filters across frequency and the total | |
75 number of output filters in the bank are determined by four parameters: | |
76 channels_afb, mincf_afb, maxcf_afb, and dencf_afb. | |
77 .LP | |
78 .TP 13 | |
79 channels_afb | |
80 The number of channels in the filterbank. | |
81 .RS | |
82 Default unit: filters. Default value: 75 | |
83 .RE | |
84 .TP 13 | |
85 mincf_afb | |
86 The minimum centre frequency | |
87 .RS | |
88 Default unit: Hz. Default value: 100 Hz. | |
89 .RE | |
90 .TP 13 | |
91 maxcf_afb | |
92 The maximum centre frequency | |
93 .RS | |
94 Default unit: Hz. Default value: 6000 Hz. | |
95 .RE | |
96 .TP 13 | |
97 dencf_afb | |
98 The density of the filters in the filterbank. | |
99 .RS | |
100 Units: filters/critical band. Default: off | |
101 .RE | |
102 .RS | |
103 .LP | |
104 dencf_afb provides an alternative method of specifying the number of channels | |
105 in terms of the density of filters along the frequency scale. | |
106 .RE | |
107 .LP | |
108 Note: channels_afb overrides dencf_afb whenever it has a non-zero | |
109 value. The values of dencf_afb and channels_afb may conflict at | |
110 this point, in which case dencf_afb is ignored. | |
111 .RE | |
112 .LP | |
113 WARNING: When using the transmission line filter (filter=tlf), the | |
114 channel density should be 3 or more filters/erb. Using a lower | |
115 density may lead to excessive spatial discretization errors (see | |
116 Giguere and Woodland (1994) for a discussion). To view a small number | |
117 of channels, use a reasonable density and reduce the number of | |
118 displayed channels using option downchannel. | |
119 .LP | |
120 .TP 14 | |
121 audiogram_afb | |
122 The audiogram | |
123 .RS | |
124 Units: none. Default: off. Status: obsolete. | |
125 .RE | |
126 .LP | |
127 Note: In the versions up to and including AIM R6.15, this parameter | |
128 was used as a means of approximating equal loudness contours, as well | |
129 as middle ear attenuation. It applies a spectral weighting function at | |
130 the output of the filterbank. With the addition of the outer/middle | |
131 ear transfer function, this parameter is obsolete, and so the default | |
132 value is off. Users who wish to use the audiogram parameter instead of | |
133 the new outer/middle filter as a loudness equilisation function can | |
134 still do so by setting audiogram_afb=on and middle_ear=off. As before, | |
135 audiogram_afb is applied as a power function and so as the value of | |
136 audiogram_afb decreases from 1 to 0, the degree of attenuation | |
137 decreases. Values greater than unity are allowed but their | |
138 interpretation is unclear. | |
139 .RE | |
140 .LP | |
141 The ERB scale for the gammatone auditory filterbank | |
142 is specificed with three options: bwmin_afb, quality_afb, | |
143 and mmerb_afb. | |
144 .LP | |
145 .TP 13 | |
146 bwmin_afb | |
147 The minimum bandwidth for an auditory filter. | |
148 .RS | |
149 Default unit: Hz. Default value: 24.7 | |
150 .RE | |
151 .TP 13 | |
152 quality_afb | |
153 The limiting quality factor for high frequency auditory filters. | |
154 .RS | |
155 Units: scalar. Default: 9.265 | |
156 .RE | |
157 .TP 13 | |
158 mmerb_afb | |
159 The length of one erb-rate unit along the basilar membrane. | |
160 .RS | |
161 Units: mm. Default: 0.89 | |
162 .RE | |
163 .LP 13 | |
164 A listing of the parameters for the filter in the bank can be directed | |
165 to the terminal at run time by setting info_afb=on. | |
166 .RE | |
167 .TP | |
168 info_afb | |
169 Print filterbank information to stderr. | |
170 Switch. Default: off. | |
171 .RE | |
172 .LP | |
173 The physiological data on human cochlear frequency-position | |
174 function (Greenwood, 1990) and the psychoacoustic data on auditory | |
175 filter bandwidth (Patterson and Moore, 1986) indicate that the | |
176 spectral analysis performed in the cochlea is like a 'constant Q' | |
177 system (quality_afb) that asymptotes to a minimum filter bandwidth | |
178 (bwmin_afb) at low centre frequencies. That is, | |
179 .PD 0 | |
180 .LP | |
181 .PD 4 | |
182 .LP | |
183 erb = bwmin_afb + centre-frequency/quality_afb. | |
184 .PD 0 | |
185 .LP | |
186 .PD 4 | |
187 .LP | |
188 If we assume, as Greenwood suggests, that each filter bandwidth | |
189 corresponds to a constant distance (mmerb_afb) along the basilar | |
190 membrane, it is possible to scale frequency in terms of erb units (or | |
191 position along the basilar membrane) by integrating the inverse of the | |
192 erb function above. | |
193 .RE | |
194 .LP | |
195 Glasberg and Moore (1990) have reviewed the available human filter | |
196 shape data and concluded that the optimum values for bwmin_afb and | |
197 quality_afb are 24.7 and 9.265, respectively, together with mmerb_afb | |
198 of 0.89. (As a rule of thumb for rapid estimation, erb = 25 + 10% of | |
199 cf ). The auditory scale used by Greenwood (1990) can be specified by | |
200 setting bwmin_afb=22.85, quality_afb=7.238 and mmerb_afb=1.0. A | |
201 reasonable approximation to the Bark scale (Zwicker, 1961) is obtained | |
202 by setting bwmin_afb=80, quality_afb=6.5 and mmerb_afb=0.89. | |
203 .RE | |
204 .LP | |
205 .SS "Auditory filter design: filter " | |
206 .PP | |
207 The choice of filterbank -- linear gammatone or nonlinear transmission | |
208 line -- is determined by option filter. | |
209 .LP | |
210 .TP 13 | |
211 filter | |
212 The auditory filter design | |
213 .RS | |
214 Default: gtf. Choices: gtf, tlf, off. | |
215 .RE | |
216 .LP | |
217 When gtf is specified, the options below with suffix _gtf apply, and | |
218 when tlf is specified, the options below with suffix _tlf apply. When | |
219 off is specified, the input wave (or the stapes velocity) is passed on | |
220 directly to the next stage. This provides for non-auditory use of the | |
221 modules following the filterbank with their associated displays. For | |
222 example, the envelope of the input wave (or stapes velocity) can be | |
223 extracted using the rectification and integration modules that follow | |
224 genbmm. The entry point genasa has the most convenient default | |
225 settings for this purpose. The default value for the filter option is | |
226 gtf. | |
227 .RE | |
228 .LP | |
229 .SS "The GammaTone Filter options: _gtf " | |
230 .LP | |
231 .TP 13 | |
232 order_gtf | |
233 The order of the gammatone filter | |
234 .RS | |
235 Units: none. Default: 4 | |
236 .RE | |
237 .RS | |
238 .LP | |
239 The order of the filter, order_gtf, determines the number of filtering | |
240 stages and so it determines the slope of the skirts of the attenuation | |
241 function and their extent. The default value is 4 and the range of | |
242 useful values is from about 2 to 8. The processing time increases | |
243 linearly with order above about order 2. | |
244 .RE | |
245 .LP | |
246 Note that the bandwidth calculation takes account of the fact that | |
247 changes in order_gtf affect bandwidth. Thus, as long as bwmin_afb is | |
248 fixed, changing the order will not affect the bandwidths of the | |
249 resulting filters. Increasing the order of the filter increases the | |
250 delay of the onset of the impulse response but it has little effect on | |
251 the shape of the envelope of the impulse response for orders greater | |
252 than three. The human auditory system is not sensitive to small phase | |
253 changes between filter channels (Patterson, 1987) and so filter order | |
254 is not well constrained by human experimental data. The default value | |
255 (4) is used because this value provides the best match between the | |
256 amplitude characteristics of the gammatone and roex filters for humans | |
257 (Patterson et al., 1988). | |
258 .LP | |
259 .TP 13 | |
260 gain_gtf | |
261 Filter output amplification | |
262 .RS | |
263 Units: scalar. Default: 4. | |
264 .RE | |
265 .RS | |
266 .LP | |
267 The ratio of input to output level across the auditory filter | |
268 when the input is a sinusoid at the cf of the filter. | |
269 .RE | |
270 .TP 13 | |
271 phase_gtf | |
272 The phase of the impulse response (obsolete) | |
273 .RS | |
274 Units: none. Default: 0. | |
275 .RE | |
276 .LP | |
277 In the absence of phase compensation, the surface of basilar membrane | |
278 motion has a strong rightward skew in the low-frequency channels | |
279 because the filters get progressively narrower as centre frequency | |
280 decreases, and this narrowing is accompanied by a slower filter | |
281 response. There are occassionally non-auditory reasons for wanting to | |
282 align the channels across frequency in one way or another. The | |
283 software provides four alignment systems which are discussed at the | |
284 end of this entry just before the references under the title Phase | |
285 Alignment. | |
286 .RE | |
287 .LP | |
288 .SS "The Transmission Line Filter options: _tlf " | |
289 .LP | |
290 .TP 13 | |
291 motion_tlf | |
292 The basilar membrane output motion variable | |
293 .RS | |
294 Default: vel. Choices: vel, disp. | |
295 .RE | |
296 .RS | |
297 .LP | |
298 If vel (velocity) is specified, the output of genbmm | |
299 is the basilar membrane velocity. If disp (displacement) | |
300 is specified, the output of genbmm is the basilar membrane | |
301 displacement. The default value is vel. | |
302 .RE | |
303 .TP 13 | |
304 outdencf_tlf | |
305 The density of the filters outside the display | |
306 range. | |
307 .RS | |
308 Units: filters/critical band. Default: 4. | |
309 .RE | |
310 .RS | |
311 .LP | |
312 In the transmission line filter implementation, it is necessary to | |
313 simulate the basilar membrane over its entire length. The option | |
314 outdencf_tlf provides a means of specifying the number of additional | |
315 channels that must be computed at the basal and apical ends of the | |
316 cochlea, ie. outside the range specified by mincf_afb and maxcf_afb | |
317 (see above). These additional channels are only computed for internal | |
318 use and are not passed to the next stage of processing. | |
319 .RE | |
320 .TP 13 | |
321 qref_tlf | |
322 The local quality factor of each basilar membrane channel | |
323 .RS | |
324 Units: scalar. Default: 2. | |
325 .RE | |
326 .RS | |
327 .LP | |
328 Note: With the transmission line filter, the bandwidth is not | |
329 determined by options bwmin_afb and quality_afb at high levels but | |
330 rather by option qref_tlf (see above). | |
331 .RE | |
332 .TP 13 | |
333 feedback_tlf | |
334 The feedback gain of the outer hair cell circuit | |
335 .RS | |
336 Units: scalar. Default: 0.99 | |
337 .RE | |
338 .RS | |
339 .LP | |
340 WARNING: A value for feedback_afb greater than or equal to 1.0 can | |
341 lead to unstable behaviour at low-levels (ie. oscillation). However, | |
342 the model output will not grow unbound. The growth of the oscillations | |
343 will be limited by the saturating nonlinearity of the outer hair cell | |
344 circuit, and the model output will go into a kind of limit-cycle. | |
345 These model oscillations have not yet been studied in detail and are | |
346 likely to deviate substantially from real cochlear emissions. | |
347 .RE | |
348 .TP 13 | |
349 dsat_tlf | |
350 The basilar membrane displacement at the half-saturation point | |
351 of the outer hair cell circuit | |
352 .RS | |
353 Units: cm. Default: 5.75e-6 | |
354 .RE | |
355 .TP 13 | |
356 gain_tlf | |
357 Filter output amplification | |
358 .RS | |
359 Units: scalar. Default: 4. | |
360 .RE | |
361 .RS | |
362 .LP | |
363 Note: There is an internal gain of 4.0 within the software of | |
364 the transmission line model itself. The total gain is therefore | |
365 4.0 times the value for gain_tlf. | |
366 .RE | |
367 .LP | |
368 Note: A linearized version of the transmission line filter with | |
369 roughly the same bandwidth as the gammatone filter can be obtained by | |
370 setting feedback_tlf=0 and qref_tlf to about 10. The main difference | |
371 is that the low-frequency skirt of the transmission line filter is | |
372 less steep than that of the gammatone. | |
373 .LP | |
374 .SH FURTHER DESCRIPTION | |
375 .LP | |
376 .SS "The distribution of filter centres along the ERB scale. " | |
377 .LP | |
378 Given values for mincf_afb maxcf_afb and channels_afb (or | |
379 dencf_afb), the program creates an array of centre frequencies | |
380 in three steps: | |
381 .LP | |
382 1. It centres a filter at 1.0 kHz. | |
383 .RE | |
384 .LP | |
385 2. Then it centres filters below 1.0 kHz, one after another, | |
386 until it encounters mincf_afb. (Thus, mincf_afb is actually the | |
387 frequency below which no filters are centred). The step size, | |
388 that is the distance between centre frequencies, is determined | |
389 by dencf_afb. When dencf_afb is equal to one, the centre | |
390 frequencies are 1 ERB apart. The ERB is the Equivalent | |
391 Rectangular Bandwidth of the filter (about 14% larger than the 3 | |
392 dB bandwidth of the filter). The function relating the ERB to the | |
393 centre frequency of the filter is taken from a `critical band' | |
394 equation introduced by Greenwood (1961) and adapted to human | |
395 auditory masking by Glasberg and Moore (1990). | |
396 .RE | |
397 .LP | |
398 3. Finally, the program centres filters one after another in | |
399 the region above 1 kHz until it encounters maxcf_afb (which is, | |
400 actually, the frequency above which no filters are centred). When | |
401 dencf_afb is increased, say to two, the program allocates two | |
402 filters per critical band and spaces them at half ERB steps. | |
403 .RE | |
404 .LP | |
405 Note: It is not the bandwidths of the filters that are | |
406 controlled by dencf_afb but rather the density of filters along | |
407 the frequency axis. Thus, doubling dencf_afb does not cause the | |
408 bandwidth of the filters to be halved; rather it results in more | |
409 overlap between adjacent filters. With regard to the images | |
410 produced by genbmm, dencf_afb determines the density of lines on | |
411 the surface rather than the shape of the features that appear on | |
412 the surface. | |
413 .LP | |
414 .SH MOTIVATION | |
415 .LP | |
416 The motivation for adopting the gammatone filter shape is | |
417 threefold: | |
418 .LP | |
419 1. It provides an excellent summary of physiological data | |
420 concerning the impulse response of primary auditory neurons in | |
421 small mammals such as cats (de Boer and de Jongh, 1978; Carney and | |
422 Yin, 1989) | |
423 .RE | |
424 .LP | |
425 2. The amplitude characteristic of the gammatone filter is very | |
426 similar to that of the Roex filter commonly used to represent the | |
427 human auditory filter (Patterson, et al, 1982; Schofield, 1985; | |
428 Patterson et al, 1988) . | |
429 .RE | |
430 .LP | |
431 3. There are recursive gammatone filters that make the calculation | |
432 particularly fast both on general purpose computers and special | |
433 purpose DSP chips (Holdsworth et al, 1988; Cooke, 1993; Slaney, 1993). | |
434 .RE | |
435 .LP | |
436 In summary, the gammatone filter is designed to provide a reasonable | |
437 trade-off between accuracy in simulating basilar membrane motion, and | |
438 computational load. | |
439 .RE | |
440 .LP | |
441 The motivation for adopting the transmission line filter is | |
442 as follows: | |
443 .LP | |
444 1. The outer hair cell circuit of the transmission line filter is | |
445 level dependent and so this design produces level-dependent basilar | |
446 membrane tuning curves (Giguere and Woodland, 1994). There is now | |
447 ample evidence that the basilar membrane motion is indeed highly | |
448 nonlinear and a major source of level compression (eg. Johnstone et | |
449 al., 1986). | |
450 .LP | |
451 2. The internal structure of the transmission line filter model is | |
452 based on the physics of the auditory periphery and therefore provides | |
453 a more realistic cochlear simulation than parallel filterbanks. It | |
454 generates combination tones of the form 2f1-f2 as observed in the | |
455 auditory system and it has the potential to generate cochlear echoes. | |
456 .LP | |
457 3. The wave-digital-filter implementation of the transmission line | |
458 filterbank is only about twice as slow as the gammatone filterbank | |
459 for an equivalent number of channels. | |
460 .RE | |
461 .LP | |
462 .SH "Phase Alignment" | |
463 .LP | |
464 There is no question that the output of the cochlea has a phase lag | |
465 corresponding to the strong rightward skew. However, perceptual | |
466 evidence indicates that this phase lag has to be enormous (> 4ms) to | |
467 affect what we hear; indeed, reversing the phase lag with synthetic | |
468 stimuli does not change what we hear (Patterson, 1987). Phase | |
469 information that appears in the basilar membrane motion but which we | |
470 do not hear, is removed in the third module by the strobe mechanism of | |
471 the temporal integration process. As a result, the stabilised auditory | |
472 images are always phase aligned even though the basilar membrane | |
473 motion and the neural activity patterns are not. | |
474 .RE | |
475 .LP | |
476 Prior to discovering the integration mechanism, we wanted to find | |
477 a way of reducing the skew from the basilar membrane image, in | |
478 order to provide a visual representation that was more like what | |
479 we hear. The genbmm program provides the following options for | |
480 phase aligning the responses of successive filters, determined | |
481 by the value of the option phase_gtf: | |
482 .RE | |
483 .LP | |
484 Value Effect | |
485 .PP | |
486 .TP 7 | |
487 -1 | |
488 Envelope alignment. | |
489 .RS | |
490 Shift the channels of output horizontally so that the points of | |
491 maximum response to an impulse (ie the envelope maxima) will be aligned. | |
492 .RE | |
493 .TP 7 | |
494 -2 | |
495 Envelope plus fine structure alignment. | |
496 .RS | |
497 Perform envelope-peak alignment as in option -1 and then shift the | |
498 fine structure phase in each channel so that a fine- structure peak | |
499 coincides with the envelope peak. | |
500 .RE | |
501 .TP 7 | |
502 -4 | |
503 Envelope plus peak alignment, `left justified'. | |
504 .RS | |
505 Align the envelopes and fine structure of all of the impulse responses | |
506 along the left edge of the image. | |
507 .RE | |
508 .TP 6 | |
509 0 | |
510 No phase compensation. | |
511 .TP 7 | |
512 +n | |
513 Advance each channel by n cycles of the centre frequency of the channel. | |
514 Approximate envelope alignment is achieved using phase_gtf = 3 | |
515 or 4. | |
516 .RE | |
517 .LP | |
518 We experimented with a number of phase compensation schemes | |
519 (Patterson et al., 1989) and concluded that the best option was | |
520 envelope plus peak alignment which corresponds to a value of | |
521 phase_gtf = -4. Accordingly, we recommend the use of phase_gtf | |
522 values of 0 (ie no phase compensation) or -4 (envelope plus peak | |
523 alignment). The remaining options are occasionally useful and so | |
524 they have been left in the software. | |
525 Note that for any phase compensation option other than 0 the time | |
526 scale is strictly correct only for the lowest channel. For any | |
527 other channel, the origin of the abscissa is offset to the right | |
528 by an amount equal to the difference between `the envelope peak | |
529 time of the lowest-frequency channel' and `the envelope peak time | |
530 of the given channel'. | |
531 .RE | |
532 .LP | |
533 .SH REFERENCES | |
534 .LP | |
535 .RE | |
536 .TP 4 | |
537 de Boer, E., and de Jongh, H.R. (1978). On cochlear encoding: | |
538 potentialities and limitations of the reverse-correlation | |
539 technique, J. Acoust. Soc. Am., 63, 115-135. | |
540 .RE | |
541 .LP | |
542 .TP 4 | |
543 Carney, L.H. and Yin, C.T. (1988) 'Temporal coding of resonances | |
544 by low-frequency auditory nerve fibers: Single fibre responses | |
545 and a population model', J.Neurophysiology, 60, 1653-1677. | |
546 .RE | |
547 .LP | |
548 .TP 4 | |
549 Cooke, M.P. (1993). Modelling Auditory Processing and | |
550 Organisation, Cambridge University Press. | |
551 .RE | |
552 .LP | |
553 .TP 4 | |
554 Giguere, C. and Woodland, P.C. (1994). A computational model of | |
555 the auditory periphery for speech and hearing research. I. Ascending | |
556 path. J.Acoust. Soc. Am. 95: 331-342. | |
557 .RE | |
558 .LP | |
559 .TP 4 | |
560 Glasberg, B.R. and B.C.J. Moore (1990). Derivation of auditory | |
561 filter shapes from notched-noise data. Hearing Research, 47, | |
562 103-138. | |
563 .RE | |
564 .LP | |
565 .TP 4 | |
566 Greenwood, D.D. (1961) 'Critical bandwidth and the frequency | |
567 coordinates of the basilar membrane', J. Acoust. Soc. Am. 33, | |
568 1344-1356. | |
569 .RE | |
570 .LP | |
571 .TP 4 | |
572 Greenwood, D.D. (1990). A cochlear frequency-position function | |
573 for several species - 29 years later. J. Acoust. Soc. Am., 87, | |
574 2592-2605. | |
575 .RE | |
576 .LP | |
577 .TP 4 | |
578 Holdsworth, J., Nimmo-Smith, I., Patterson, R.D. and Rice, P. | |
579 (1988) Annex C of 'Spiral Vos Final Report, Part A: The | |
580 Auditory Filterbank,' APU report 2341. | |
581 .RE | |
582 .LP | |
583 .TP 4 | |
584 Johnstone, B.M. et al. (1986). Hear Res. 22: 147-153. | |
585 .RE | |
586 .LP | |
587 .TP 4 | |
588 Moore, B.C.J and Glasberg, B.R. (1983), "Suggested formulae for | |
589 calculating auditory filter bandwidths and excitiation patterns," | |
590 J. Acoust. Soc. Am. 74, pp 750-753. | |
591 .RE | |
592 .LP | |
593 .TP 4 | |
594 Patuzzi, R., and Robertson, D. (1988) "Tuning in the mammalian | |
595 cochlea," Physiological Reviews 68, 1009-1082. | |
596 .RE | |
597 .LP | |
598 .TP 4 | |
599 Patterson, R.D. (1976). Auditory filter shapes derived with | |
600 noise stimuli. J. Acoust. Soc. Am., 59, 640-654. | |
601 .RE | |
602 .LP | |
603 .TP 4 | |
604 Patterson, R.D. (1987). A pulse ribbon model of monaural phase | |
605 perception. J. Acoust. Soc. Am., 82, 1560-1586. | |
606 .RE | |
607 .LP | |
608 .TP 4 | |
609 Patterson, R.D., Nimmo-Smith, I., Weber, D.L., and Milroy, R. | |
610 (1982). The deterioration of hearing with age: Frequency | |
611 selectivity, the critical ratio, the audiogram, and speech | |
612 threshold. J. Acoust. Soc. Am., 72, 1788-1803. | |
613 .RE | |
614 .LP | |
615 .TP 4 | |
616 Patterson, R.D., Allerhand, M.H. and Holdsworth, J. (1992) | |
617 'Auditory representations of speech sounds', In Visual | |
618 representations of speech signals, Eds. Martin Cooke and Steve | |
619 Beet, John Wiley & Sons. 307-314. | |
620 .RE | |
621 .LP | |
622 .TP 4 | |
623 Patterson, R. D., Holdsworth, J., Nimmo-Smith, I., and Rice, P. | |
624 (1988). SVOS Final Report: The Auditory Filterbank. APU report 2341. | |
625 .RE | |
626 .LP | |
627 .TP 4 | |
628 Patterson, R.D. and B.C.J. Moore (1986). Auditory filters and | |
629 excitation patterns as representations of frequency | |
630 resolution. In: Frequency Selectivity in Hearing (B. C. J. | |
631 Moore, ed.), pp. 123-177. Academic Press, London. | |
632 .RE | |
633 .LP | |
634 .TP 4 | |
635 Schofield, D. (1985) 'Visualisations of speech based on a model | |
636 of the peripheral auditory system', NPL Report DITC 62/85. | |
637 .RE | |
638 .LP | |
639 .TP 4 | |
640 Slaney, M. (1993) An efficient implementation of the Patterson- | |
641 Holdsworth auditory filter bank. Apple Computer Technical | |
642 Report #35. | |
643 .RE | |
644 .LP | |
645 .TP 4 | |
646 Zwicker, E. (1961) Subdivision of the audible frequency range into | |
647 critical bands (frequenzgruppen). J. Acoust. Soc. Am. 33, 248. | |
648 .LP | |
649 .SH EXAMPLES | |
650 .LP | |
651 The following command generates basilar membrane motion using the | |
652 gammatone filter design (the default) for an input filename cegc: | |
653 .RE | |
654 .LP | |
655 example% genbmm cegc | |
656 .RE | |
657 .LP | |
658 The following command generates basilar membrane motion using the | |
659 gammatone filter design (the default) for a filterbank with cf from | |
660 200 Hz to 5000 Hz at a density of 4 filters/critical band for the same | |
661 input filename: | |
662 .RE | |
663 .LP | |
664 example% genbmm channels=0 mincf=200 maxcf=5000 dencf=4. cegc | |
665 .RE | |
666 .LP | |
667 The following command generates basilar membrane motion using the | |
668 gammatone filter design (the default) and the audiogram function | |
669 instead of the outer/middle ear filter: | |
670 .RE | |
671 .LP | |
672 example% genbmm middle_ear=off audiogram=on cegc | |
673 .RE | |
674 .LP | |
675 The following command generates the basilar membrane motion using the | |
676 transmission line filter design instead of the default gammatone | |
677 filter: | |
678 .RE | |
679 .LP | |
680 example% genbmm filter=tlf cegc | |
681 .RE | |
682 .LP | |
683 The following command generates the basilar membrane motion using the | |
684 transmission line filter design and the auditory scale of Greenwood | |
685 (1990): | |
686 .RE | |
687 .LP | |
688 example% genbmm filter=tlf bwmin=22.85 quality=7.238 mmerb=1.0 cegc | |
689 .RE | |
690 .LP | |
691 The following command generates the basilar membrane motion using the | |
692 transmission line filter design, but with the nonlinear outer hair | |
693 cell feedback mechanism turned off: | |
694 .RE | |
695 .LP | |
696 example% genbmm filter=tlf feedback=off cegc | |
697 .LP | |
698 .SH FILES | |
699 .LP | |
700 .TP 13 | |
701 .genbmmrc | |
702 The options file for genbmm. | |
703 .LP | |
704 .SH SEE ALSO | |
705 .LP | |
706 genasa, gensgm | |
707 .LP | |
708 .SH BUGS | |
709 .LP | |
710 There is a bug in the hiddenline plotting of genbmm. It shows up when | |
711 the surface has deep valleys and there is a large phase delay. The | |
712 negative peaks show through on surfaces where they should be hidden. | |
713 .SH COPYRIGHT | |
714 .LP | |
715 Copyright (c) Applied Psychology Unit, Medical Research Council, 1995 | |
716 .LP | |
717 Permission to use, copy, modify, and distribute this software without fee | |
718 is hereby granted for research purposes, provided that this copyright | |
719 notice appears in all copies and in all supporting documentation, and that | |
720 the software is not redistributed for any fee (except for a nominal | |
721 shipping charge). Anyone wanting to incorporate all or part of this | |
722 software in a commercial product must obtain a license from the Medical | |
723 Research Council. | |
724 .LP | |
725 The MRC makes no representations about the suitability of this | |
726 software for any purpose. It is provided "as is" without express or | |
727 implied warranty. | |
728 .LP | |
729 THE MRC DISCLAIMS ALL WARRANTIES WITH REGARD TO THIS SOFTWARE, INCLUDING | |
730 ALL IMPLIED WARRANTIES OF MERCHANTABILITY AND FITNESS, IN NO EVENT SHALL | |
731 THE A.P.U. BE LIABLE FOR ANY SPECIAL, INDIRECT OR CONSEQUENTIAL DAMAGES | |
732 OR ANY DAMAGES WHATSOEVER RESULTING FROM LOSS OF USE, DATA OR PROFITS, | |
733 WHETHER IN AN ACTION OF CONTRACT, NEGLIGENCE OR OTHER TORTIOUS ACTION, | |
734 ARISING OUT OF OR IN CONNECTION WITH THE USE OR PERFORMANCE OF THIS | |
735 SOFTWARE. | |
736 .LP | |
737 .SH ACKNOWLEDGEMENTS | |
738 .LP | |
739 The AIM software was developed for Unix workstations by John | |
740 Holdsworth and Mike Allerhand of the MRC APU, under the direction of | |
741 Roy Patterson. The physiological version of AIM was developed by | |
742 Christian Giguere. The options handler is by Paul Manson. The revised | |
743 SAI module is by Jay Datta. Michael Akeroyd extended the postscript | |
744 facilites and developed the xreview routine for auditory image | |
745 cartoons. | |
746 .LP | |
747 The project was supported by the MRC and grants from the U.K. Defense | |
748 Research Agency, Farnborough (Research Contract 2239); the EEC Esprit | |
749 BR Porgramme, Project ACTS (3207); and the U.K. Hearing Research Trust. | |
750 |