annotate man/man1/genbmm.1 @ 0:5242703e91d3 tip

Initial checkin for AIM92 aimR8.2 (last updated May 1997).
author tomwalters
date Fri, 20 May 2011 15:19:45 +0100
parents
children
rev   line source
tomwalters@0 1 .TH GENBMM 1 "11 April 1994"
tomwalters@0 2 .LP
tomwalters@0 3 .SH NAME
tomwalters@0 4 .LP
tomwalters@0 5 genbmm \- generate basilar membrane motion
tomwalters@0 6 .LP
tomwalters@0 7 .SH SYNOPSIS
tomwalters@0 8 .LP
tomwalters@0 9 genbmm [ option=value | -option ] [ filename ]
tomwalters@0 10 .LP
tomwalters@0 11 .SH DESCRIPTION
tomwalters@0 12 .LP
tomwalters@0 13 The genbmm module of the AIM software simulates the spectral analysis
tomwalters@0 14 performed by the auditory system using a bank of auditory filters.
tomwalters@0 15 Specifically, genbmm converts an input wave into an array of filtered
tomwalters@0 16 waves, one for each channel of the filterbank. The surface of the
tomwalters@0 17 array of filtered waves is AIM's representation of basilar membrane
tomwalters@0 18 motion (BMM) as a function of time. AIM provides two alternative
tomwalters@0 19 methods for generating the BMM, linear, gammatone filterbank
tomwalters@0 20 (Patterson et al, 1988; Slaney 1993, Cooke, 1993), or a nonlinear,
tomwalters@0 21 transmission-line filterbank (Giguere and Woodland, 1994). For
tomwalters@0 22 convenience, they are referred to as the 'functional' filterbank and
tomwalters@0 23 the 'physiological' filterbank, respectively.
tomwalters@0 24 .LP
tomwalters@0 25 .SH OPTIONS
tomwalters@0 26 .LP
tomwalters@0 27 There are three sets of options for genbmm; they are grouped by
tomwalters@0 28 function and identified by the suffixes _afb, _gtf and _tlf. The first
tomwalters@0 29 set controls the distribution of the filtered waves across frequency
tomwalters@0 30 (suffix _afb); the second specifies the shape of the gammatone filter
tomwalters@0 31 (suffix _gtf); and the third specifies the shape of the transmission
tomwalters@0 32 line filter (suffix _tlf). These three groups of options are the
tomwalters@0 33 subject of this manual entry, together with an option that specifies
tomwalters@0 34 the filter choice (gtf or tlf), and an option that specifies whether a
tomwalters@0 35 middle ear function should be used with the gtf filterbank.
tomwalters@0 36 .LP
tomwalters@0 37 .SS "The Outer/Middle Ear function: middle_ear "
tomwalters@0 38 .PP
tomwalters@0 39 In the auditory system the middle ear causes a progressive attenuation
tomwalters@0 40 of sound energy in the region below about 500 Hz and a progressive
tomwalters@0 41 attenuation in the region above about 4000 Hz. There is also a
tomwalters@0 42 primary auditory canal resonance around 2700 Hz that provides a boost
tomwalters@0 43 in sound transmission. The resulting transfer function is a normal
tomwalters@0 44 aspect of auditory processing and preceeds spectral analysis. If the
tomwalters@0 45 functional filterbank is chosen (gtf), the outer/middle ear filter
tomwalters@0 46 acts directly on the input wave, and the stapes velocity wave it
tomwalters@0 47 generates is the input to the spectral filtering stage. If the
tomwalters@0 48 physiological filterbank is chosen (tlf), the outer/middle ear and
tomwalters@0 49 cochlear filter are performed simultaneously as in the auditory
tomwalters@0 50 system. The only parameter associated with this function is the
tomwalters@0 51 middle_ear switch which makes it possible to turn the outer/middle ear
tomwalters@0 52 filtering off when the functional filterbank is chosen.
tomwalters@0 53 .LP
tomwalters@0 54 .TP 13
tomwalters@0 55 middle_ear
tomwalters@0 56 Outer/middle ear switch
tomwalters@0 57 .RS
tomwalters@0 58 Switch. Default: on.
tomwalters@0 59 .RE
tomwalters@0 60 .RS
tomwalters@0 61 .LP
tomwalters@0 62 It is also possible to specify a floating point number, in which
tomwalters@0 63 case the middle ear output is multiplied by that value.
tomwalters@0 64 .RE
tomwalters@0 65 .LP
tomwalters@0 66 Note: The middle_ear option is ignored if option filter (see below)
tomwalters@0 67 is set to tlf. This is because the outer/middle stage and the
tomwalters@0 68 cochlear stage are bidirectionally coupled in the transmission
tomwalters@0 69 line filter implementation, and cannot be separated.
tomwalters@0 70 .RE
tomwalters@0 71 .LP
tomwalters@0 72 .SS "The Auditory FilterBank options: _afb "
tomwalters@0 73 .PP
tomwalters@0 74 The distribution of the filters across frequency and the total
tomwalters@0 75 number of output filters in the bank are determined by four parameters:
tomwalters@0 76 channels_afb, mincf_afb, maxcf_afb, and dencf_afb.
tomwalters@0 77 .LP
tomwalters@0 78 .TP 13
tomwalters@0 79 channels_afb
tomwalters@0 80 The number of channels in the filterbank.
tomwalters@0 81 .RS
tomwalters@0 82 Default unit: filters. Default value: 75
tomwalters@0 83 .RE
tomwalters@0 84 .TP 13
tomwalters@0 85 mincf_afb
tomwalters@0 86 The minimum centre frequency
tomwalters@0 87 .RS
tomwalters@0 88 Default unit: Hz. Default value: 100 Hz.
tomwalters@0 89 .RE
tomwalters@0 90 .TP 13
tomwalters@0 91 maxcf_afb
tomwalters@0 92 The maximum centre frequency
tomwalters@0 93 .RS
tomwalters@0 94 Default unit: Hz. Default value: 6000 Hz.
tomwalters@0 95 .RE
tomwalters@0 96 .TP 13
tomwalters@0 97 dencf_afb
tomwalters@0 98 The density of the filters in the filterbank.
tomwalters@0 99 .RS
tomwalters@0 100 Units: filters/critical band. Default: off
tomwalters@0 101 .RE
tomwalters@0 102 .RS
tomwalters@0 103 .LP
tomwalters@0 104 dencf_afb provides an alternative method of specifying the number of channels
tomwalters@0 105 in terms of the density of filters along the frequency scale.
tomwalters@0 106 .RE
tomwalters@0 107 .LP
tomwalters@0 108 Note: channels_afb overrides dencf_afb whenever it has a non-zero
tomwalters@0 109 value. The values of dencf_afb and channels_afb may conflict at
tomwalters@0 110 this point, in which case dencf_afb is ignored.
tomwalters@0 111 .RE
tomwalters@0 112 .LP
tomwalters@0 113 WARNING: When using the transmission line filter (filter=tlf), the
tomwalters@0 114 channel density should be 3 or more filters/erb. Using a lower
tomwalters@0 115 density may lead to excessive spatial discretization errors (see
tomwalters@0 116 Giguere and Woodland (1994) for a discussion). To view a small number
tomwalters@0 117 of channels, use a reasonable density and reduce the number of
tomwalters@0 118 displayed channels using option downchannel.
tomwalters@0 119 .LP
tomwalters@0 120 .TP 14
tomwalters@0 121 audiogram_afb
tomwalters@0 122 The audiogram
tomwalters@0 123 .RS
tomwalters@0 124 Units: none. Default: off. Status: obsolete.
tomwalters@0 125 .RE
tomwalters@0 126 .LP
tomwalters@0 127 Note: In the versions up to and including AIM R6.15, this parameter
tomwalters@0 128 was used as a means of approximating equal loudness contours, as well
tomwalters@0 129 as middle ear attenuation. It applies a spectral weighting function at
tomwalters@0 130 the output of the filterbank. With the addition of the outer/middle
tomwalters@0 131 ear transfer function, this parameter is obsolete, and so the default
tomwalters@0 132 value is off. Users who wish to use the audiogram parameter instead of
tomwalters@0 133 the new outer/middle filter as a loudness equilisation function can
tomwalters@0 134 still do so by setting audiogram_afb=on and middle_ear=off. As before,
tomwalters@0 135 audiogram_afb is applied as a power function and so as the value of
tomwalters@0 136 audiogram_afb decreases from 1 to 0, the degree of attenuation
tomwalters@0 137 decreases. Values greater than unity are allowed but their
tomwalters@0 138 interpretation is unclear.
tomwalters@0 139 .RE
tomwalters@0 140 .LP
tomwalters@0 141 The ERB scale for the gammatone auditory filterbank
tomwalters@0 142 is specificed with three options: bwmin_afb, quality_afb,
tomwalters@0 143 and mmerb_afb.
tomwalters@0 144 .LP
tomwalters@0 145 .TP 13
tomwalters@0 146 bwmin_afb
tomwalters@0 147 The minimum bandwidth for an auditory filter.
tomwalters@0 148 .RS
tomwalters@0 149 Default unit: Hz. Default value: 24.7
tomwalters@0 150 .RE
tomwalters@0 151 .TP 13
tomwalters@0 152 quality_afb
tomwalters@0 153 The limiting quality factor for high frequency auditory filters.
tomwalters@0 154 .RS
tomwalters@0 155 Units: scalar. Default: 9.265
tomwalters@0 156 .RE
tomwalters@0 157 .TP 13
tomwalters@0 158 mmerb_afb
tomwalters@0 159 The length of one erb-rate unit along the basilar membrane.
tomwalters@0 160 .RS
tomwalters@0 161 Units: mm. Default: 0.89
tomwalters@0 162 .RE
tomwalters@0 163 .LP 13
tomwalters@0 164 A listing of the parameters for the filter in the bank can be directed
tomwalters@0 165 to the terminal at run time by setting info_afb=on.
tomwalters@0 166 .RE
tomwalters@0 167 .TP
tomwalters@0 168 info_afb
tomwalters@0 169 Print filterbank information to stderr.
tomwalters@0 170 Switch. Default: off.
tomwalters@0 171 .RE
tomwalters@0 172 .LP
tomwalters@0 173 The physiological data on human cochlear frequency-position
tomwalters@0 174 function (Greenwood, 1990) and the psychoacoustic data on auditory
tomwalters@0 175 filter bandwidth (Patterson and Moore, 1986) indicate that the
tomwalters@0 176 spectral analysis performed in the cochlea is like a 'constant Q'
tomwalters@0 177 system (quality_afb) that asymptotes to a minimum filter bandwidth
tomwalters@0 178 (bwmin_afb) at low centre frequencies. That is,
tomwalters@0 179 .PD 0
tomwalters@0 180 .LP
tomwalters@0 181 .PD 4
tomwalters@0 182 .LP
tomwalters@0 183 erb = bwmin_afb + centre-frequency/quality_afb.
tomwalters@0 184 .PD 0
tomwalters@0 185 .LP
tomwalters@0 186 .PD 4
tomwalters@0 187 .LP
tomwalters@0 188 If we assume, as Greenwood suggests, that each filter bandwidth
tomwalters@0 189 corresponds to a constant distance (mmerb_afb) along the basilar
tomwalters@0 190 membrane, it is possible to scale frequency in terms of erb units (or
tomwalters@0 191 position along the basilar membrane) by integrating the inverse of the
tomwalters@0 192 erb function above.
tomwalters@0 193 .RE
tomwalters@0 194 .LP
tomwalters@0 195 Glasberg and Moore (1990) have reviewed the available human filter
tomwalters@0 196 shape data and concluded that the optimum values for bwmin_afb and
tomwalters@0 197 quality_afb are 24.7 and 9.265, respectively, together with mmerb_afb
tomwalters@0 198 of 0.89. (As a rule of thumb for rapid estimation, erb = 25 + 10% of
tomwalters@0 199 cf ). The auditory scale used by Greenwood (1990) can be specified by
tomwalters@0 200 setting bwmin_afb=22.85, quality_afb=7.238 and mmerb_afb=1.0. A
tomwalters@0 201 reasonable approximation to the Bark scale (Zwicker, 1961) is obtained
tomwalters@0 202 by setting bwmin_afb=80, quality_afb=6.5 and mmerb_afb=0.89.
tomwalters@0 203 .RE
tomwalters@0 204 .LP
tomwalters@0 205 .SS "Auditory filter design: filter "
tomwalters@0 206 .PP
tomwalters@0 207 The choice of filterbank -- linear gammatone or nonlinear transmission
tomwalters@0 208 line -- is determined by option filter.
tomwalters@0 209 .LP
tomwalters@0 210 .TP 13
tomwalters@0 211 filter
tomwalters@0 212 The auditory filter design
tomwalters@0 213 .RS
tomwalters@0 214 Default: gtf. Choices: gtf, tlf, off.
tomwalters@0 215 .RE
tomwalters@0 216 .LP
tomwalters@0 217 When gtf is specified, the options below with suffix _gtf apply, and
tomwalters@0 218 when tlf is specified, the options below with suffix _tlf apply. When
tomwalters@0 219 off is specified, the input wave (or the stapes velocity) is passed on
tomwalters@0 220 directly to the next stage. This provides for non-auditory use of the
tomwalters@0 221 modules following the filterbank with their associated displays. For
tomwalters@0 222 example, the envelope of the input wave (or stapes velocity) can be
tomwalters@0 223 extracted using the rectification and integration modules that follow
tomwalters@0 224 genbmm. The entry point genasa has the most convenient default
tomwalters@0 225 settings for this purpose. The default value for the filter option is
tomwalters@0 226 gtf.
tomwalters@0 227 .RE
tomwalters@0 228 .LP
tomwalters@0 229 .SS "The GammaTone Filter options: _gtf "
tomwalters@0 230 .LP
tomwalters@0 231 .TP 13
tomwalters@0 232 order_gtf
tomwalters@0 233 The order of the gammatone filter
tomwalters@0 234 .RS
tomwalters@0 235 Units: none. Default: 4
tomwalters@0 236 .RE
tomwalters@0 237 .RS
tomwalters@0 238 .LP
tomwalters@0 239 The order of the filter, order_gtf, determines the number of filtering
tomwalters@0 240 stages and so it determines the slope of the skirts of the attenuation
tomwalters@0 241 function and their extent. The default value is 4 and the range of
tomwalters@0 242 useful values is from about 2 to 8. The processing time increases
tomwalters@0 243 linearly with order above about order 2.
tomwalters@0 244 .RE
tomwalters@0 245 .LP
tomwalters@0 246 Note that the bandwidth calculation takes account of the fact that
tomwalters@0 247 changes in order_gtf affect bandwidth. Thus, as long as bwmin_afb is
tomwalters@0 248 fixed, changing the order will not affect the bandwidths of the
tomwalters@0 249 resulting filters. Increasing the order of the filter increases the
tomwalters@0 250 delay of the onset of the impulse response but it has little effect on
tomwalters@0 251 the shape of the envelope of the impulse response for orders greater
tomwalters@0 252 than three. The human auditory system is not sensitive to small phase
tomwalters@0 253 changes between filter channels (Patterson, 1987) and so filter order
tomwalters@0 254 is not well constrained by human experimental data. The default value
tomwalters@0 255 (4) is used because this value provides the best match between the
tomwalters@0 256 amplitude characteristics of the gammatone and roex filters for humans
tomwalters@0 257 (Patterson et al., 1988).
tomwalters@0 258 .LP
tomwalters@0 259 .TP 13
tomwalters@0 260 gain_gtf
tomwalters@0 261 Filter output amplification
tomwalters@0 262 .RS
tomwalters@0 263 Units: scalar. Default: 4.
tomwalters@0 264 .RE
tomwalters@0 265 .RS
tomwalters@0 266 .LP
tomwalters@0 267 The ratio of input to output level across the auditory filter
tomwalters@0 268 when the input is a sinusoid at the cf of the filter.
tomwalters@0 269 .RE
tomwalters@0 270 .TP 13
tomwalters@0 271 phase_gtf
tomwalters@0 272 The phase of the impulse response (obsolete)
tomwalters@0 273 .RS
tomwalters@0 274 Units: none. Default: 0.
tomwalters@0 275 .RE
tomwalters@0 276 .LP
tomwalters@0 277 In the absence of phase compensation, the surface of basilar membrane
tomwalters@0 278 motion has a strong rightward skew in the low-frequency channels
tomwalters@0 279 because the filters get progressively narrower as centre frequency
tomwalters@0 280 decreases, and this narrowing is accompanied by a slower filter
tomwalters@0 281 response. There are occassionally non-auditory reasons for wanting to
tomwalters@0 282 align the channels across frequency in one way or another. The
tomwalters@0 283 software provides four alignment systems which are discussed at the
tomwalters@0 284 end of this entry just before the references under the title Phase
tomwalters@0 285 Alignment.
tomwalters@0 286 .RE
tomwalters@0 287 .LP
tomwalters@0 288 .SS "The Transmission Line Filter options: _tlf "
tomwalters@0 289 .LP
tomwalters@0 290 .TP 13
tomwalters@0 291 motion_tlf
tomwalters@0 292 The basilar membrane output motion variable
tomwalters@0 293 .RS
tomwalters@0 294 Default: vel. Choices: vel, disp.
tomwalters@0 295 .RE
tomwalters@0 296 .RS
tomwalters@0 297 .LP
tomwalters@0 298 If vel (velocity) is specified, the output of genbmm
tomwalters@0 299 is the basilar membrane velocity. If disp (displacement)
tomwalters@0 300 is specified, the output of genbmm is the basilar membrane
tomwalters@0 301 displacement. The default value is vel.
tomwalters@0 302 .RE
tomwalters@0 303 .TP 13
tomwalters@0 304 outdencf_tlf
tomwalters@0 305 The density of the filters outside the display
tomwalters@0 306 range.
tomwalters@0 307 .RS
tomwalters@0 308 Units: filters/critical band. Default: 4.
tomwalters@0 309 .RE
tomwalters@0 310 .RS
tomwalters@0 311 .LP
tomwalters@0 312 In the transmission line filter implementation, it is necessary to
tomwalters@0 313 simulate the basilar membrane over its entire length. The option
tomwalters@0 314 outdencf_tlf provides a means of specifying the number of additional
tomwalters@0 315 channels that must be computed at the basal and apical ends of the
tomwalters@0 316 cochlea, ie. outside the range specified by mincf_afb and maxcf_afb
tomwalters@0 317 (see above). These additional channels are only computed for internal
tomwalters@0 318 use and are not passed to the next stage of processing.
tomwalters@0 319 .RE
tomwalters@0 320 .TP 13
tomwalters@0 321 qref_tlf
tomwalters@0 322 The local quality factor of each basilar membrane channel
tomwalters@0 323 .RS
tomwalters@0 324 Units: scalar. Default: 2.
tomwalters@0 325 .RE
tomwalters@0 326 .RS
tomwalters@0 327 .LP
tomwalters@0 328 Note: With the transmission line filter, the bandwidth is not
tomwalters@0 329 determined by options bwmin_afb and quality_afb at high levels but
tomwalters@0 330 rather by option qref_tlf (see above).
tomwalters@0 331 .RE
tomwalters@0 332 .TP 13
tomwalters@0 333 feedback_tlf
tomwalters@0 334 The feedback gain of the outer hair cell circuit
tomwalters@0 335 .RS
tomwalters@0 336 Units: scalar. Default: 0.99
tomwalters@0 337 .RE
tomwalters@0 338 .RS
tomwalters@0 339 .LP
tomwalters@0 340 WARNING: A value for feedback_afb greater than or equal to 1.0 can
tomwalters@0 341 lead to unstable behaviour at low-levels (ie. oscillation). However,
tomwalters@0 342 the model output will not grow unbound. The growth of the oscillations
tomwalters@0 343 will be limited by the saturating nonlinearity of the outer hair cell
tomwalters@0 344 circuit, and the model output will go into a kind of limit-cycle.
tomwalters@0 345 These model oscillations have not yet been studied in detail and are
tomwalters@0 346 likely to deviate substantially from real cochlear emissions.
tomwalters@0 347 .RE
tomwalters@0 348 .TP 13
tomwalters@0 349 dsat_tlf
tomwalters@0 350 The basilar membrane displacement at the half-saturation point
tomwalters@0 351 of the outer hair cell circuit
tomwalters@0 352 .RS
tomwalters@0 353 Units: cm. Default: 5.75e-6
tomwalters@0 354 .RE
tomwalters@0 355 .TP 13
tomwalters@0 356 gain_tlf
tomwalters@0 357 Filter output amplification
tomwalters@0 358 .RS
tomwalters@0 359 Units: scalar. Default: 4.
tomwalters@0 360 .RE
tomwalters@0 361 .RS
tomwalters@0 362 .LP
tomwalters@0 363 Note: There is an internal gain of 4.0 within the software of
tomwalters@0 364 the transmission line model itself. The total gain is therefore
tomwalters@0 365 4.0 times the value for gain_tlf.
tomwalters@0 366 .RE
tomwalters@0 367 .LP
tomwalters@0 368 Note: A linearized version of the transmission line filter with
tomwalters@0 369 roughly the same bandwidth as the gammatone filter can be obtained by
tomwalters@0 370 setting feedback_tlf=0 and qref_tlf to about 10. The main difference
tomwalters@0 371 is that the low-frequency skirt of the transmission line filter is
tomwalters@0 372 less steep than that of the gammatone.
tomwalters@0 373 .LP
tomwalters@0 374 .SH FURTHER DESCRIPTION
tomwalters@0 375 .LP
tomwalters@0 376 .SS "The distribution of filter centres along the ERB scale. "
tomwalters@0 377 .LP
tomwalters@0 378 Given values for mincf_afb maxcf_afb and channels_afb (or
tomwalters@0 379 dencf_afb), the program creates an array of centre frequencies
tomwalters@0 380 in three steps:
tomwalters@0 381 .LP
tomwalters@0 382 1. It centres a filter at 1.0 kHz.
tomwalters@0 383 .RE
tomwalters@0 384 .LP
tomwalters@0 385 2. Then it centres filters below 1.0 kHz, one after another,
tomwalters@0 386 until it encounters mincf_afb. (Thus, mincf_afb is actually the
tomwalters@0 387 frequency below which no filters are centred). The step size,
tomwalters@0 388 that is the distance between centre frequencies, is determined
tomwalters@0 389 by dencf_afb. When dencf_afb is equal to one, the centre
tomwalters@0 390 frequencies are 1 ERB apart. The ERB is the Equivalent
tomwalters@0 391 Rectangular Bandwidth of the filter (about 14% larger than the 3
tomwalters@0 392 dB bandwidth of the filter). The function relating the ERB to the
tomwalters@0 393 centre frequency of the filter is taken from a `critical band'
tomwalters@0 394 equation introduced by Greenwood (1961) and adapted to human
tomwalters@0 395 auditory masking by Glasberg and Moore (1990).
tomwalters@0 396 .RE
tomwalters@0 397 .LP
tomwalters@0 398 3. Finally, the program centres filters one after another in
tomwalters@0 399 the region above 1 kHz until it encounters maxcf_afb (which is,
tomwalters@0 400 actually, the frequency above which no filters are centred). When
tomwalters@0 401 dencf_afb is increased, say to two, the program allocates two
tomwalters@0 402 filters per critical band and spaces them at half ERB steps.
tomwalters@0 403 .RE
tomwalters@0 404 .LP
tomwalters@0 405 Note: It is not the bandwidths of the filters that are
tomwalters@0 406 controlled by dencf_afb but rather the density of filters along
tomwalters@0 407 the frequency axis. Thus, doubling dencf_afb does not cause the
tomwalters@0 408 bandwidth of the filters to be halved; rather it results in more
tomwalters@0 409 overlap between adjacent filters. With regard to the images
tomwalters@0 410 produced by genbmm, dencf_afb determines the density of lines on
tomwalters@0 411 the surface rather than the shape of the features that appear on
tomwalters@0 412 the surface.
tomwalters@0 413 .LP
tomwalters@0 414 .SH MOTIVATION
tomwalters@0 415 .LP
tomwalters@0 416 The motivation for adopting the gammatone filter shape is
tomwalters@0 417 threefold:
tomwalters@0 418 .LP
tomwalters@0 419 1. It provides an excellent summary of physiological data
tomwalters@0 420 concerning the impulse response of primary auditory neurons in
tomwalters@0 421 small mammals such as cats (de Boer and de Jongh, 1978; Carney and
tomwalters@0 422 Yin, 1989)
tomwalters@0 423 .RE
tomwalters@0 424 .LP
tomwalters@0 425 2. The amplitude characteristic of the gammatone filter is very
tomwalters@0 426 similar to that of the Roex filter commonly used to represent the
tomwalters@0 427 human auditory filter (Patterson, et al, 1982; Schofield, 1985;
tomwalters@0 428 Patterson et al, 1988) .
tomwalters@0 429 .RE
tomwalters@0 430 .LP
tomwalters@0 431 3. There are recursive gammatone filters that make the calculation
tomwalters@0 432 particularly fast both on general purpose computers and special
tomwalters@0 433 purpose DSP chips (Holdsworth et al, 1988; Cooke, 1993; Slaney, 1993).
tomwalters@0 434 .RE
tomwalters@0 435 .LP
tomwalters@0 436 In summary, the gammatone filter is designed to provide a reasonable
tomwalters@0 437 trade-off between accuracy in simulating basilar membrane motion, and
tomwalters@0 438 computational load.
tomwalters@0 439 .RE
tomwalters@0 440 .LP
tomwalters@0 441 The motivation for adopting the transmission line filter is
tomwalters@0 442 as follows:
tomwalters@0 443 .LP
tomwalters@0 444 1. The outer hair cell circuit of the transmission line filter is
tomwalters@0 445 level dependent and so this design produces level-dependent basilar
tomwalters@0 446 membrane tuning curves (Giguere and Woodland, 1994). There is now
tomwalters@0 447 ample evidence that the basilar membrane motion is indeed highly
tomwalters@0 448 nonlinear and a major source of level compression (eg. Johnstone et
tomwalters@0 449 al., 1986).
tomwalters@0 450 .LP
tomwalters@0 451 2. The internal structure of the transmission line filter model is
tomwalters@0 452 based on the physics of the auditory periphery and therefore provides
tomwalters@0 453 a more realistic cochlear simulation than parallel filterbanks. It
tomwalters@0 454 generates combination tones of the form 2f1-f2 as observed in the
tomwalters@0 455 auditory system and it has the potential to generate cochlear echoes.
tomwalters@0 456 .LP
tomwalters@0 457 3. The wave-digital-filter implementation of the transmission line
tomwalters@0 458 filterbank is only about twice as slow as the gammatone filterbank
tomwalters@0 459 for an equivalent number of channels.
tomwalters@0 460 .RE
tomwalters@0 461 .LP
tomwalters@0 462 .SH "Phase Alignment"
tomwalters@0 463 .LP
tomwalters@0 464 There is no question that the output of the cochlea has a phase lag
tomwalters@0 465 corresponding to the strong rightward skew. However, perceptual
tomwalters@0 466 evidence indicates that this phase lag has to be enormous (> 4ms) to
tomwalters@0 467 affect what we hear; indeed, reversing the phase lag with synthetic
tomwalters@0 468 stimuli does not change what we hear (Patterson, 1987). Phase
tomwalters@0 469 information that appears in the basilar membrane motion but which we
tomwalters@0 470 do not hear, is removed in the third module by the strobe mechanism of
tomwalters@0 471 the temporal integration process. As a result, the stabilised auditory
tomwalters@0 472 images are always phase aligned even though the basilar membrane
tomwalters@0 473 motion and the neural activity patterns are not.
tomwalters@0 474 .RE
tomwalters@0 475 .LP
tomwalters@0 476 Prior to discovering the integration mechanism, we wanted to find
tomwalters@0 477 a way of reducing the skew from the basilar membrane image, in
tomwalters@0 478 order to provide a visual representation that was more like what
tomwalters@0 479 we hear. The genbmm program provides the following options for
tomwalters@0 480 phase aligning the responses of successive filters, determined
tomwalters@0 481 by the value of the option phase_gtf:
tomwalters@0 482 .RE
tomwalters@0 483 .LP
tomwalters@0 484 Value Effect
tomwalters@0 485 .PP
tomwalters@0 486 .TP 7
tomwalters@0 487 -1
tomwalters@0 488 Envelope alignment.
tomwalters@0 489 .RS
tomwalters@0 490 Shift the channels of output horizontally so that the points of
tomwalters@0 491 maximum response to an impulse (ie the envelope maxima) will be aligned.
tomwalters@0 492 .RE
tomwalters@0 493 .TP 7
tomwalters@0 494 -2
tomwalters@0 495 Envelope plus fine structure alignment.
tomwalters@0 496 .RS
tomwalters@0 497 Perform envelope-peak alignment as in option -1 and then shift the
tomwalters@0 498 fine structure phase in each channel so that a fine- structure peak
tomwalters@0 499 coincides with the envelope peak.
tomwalters@0 500 .RE
tomwalters@0 501 .TP 7
tomwalters@0 502 -4
tomwalters@0 503 Envelope plus peak alignment, `left justified'.
tomwalters@0 504 .RS
tomwalters@0 505 Align the envelopes and fine structure of all of the impulse responses
tomwalters@0 506 along the left edge of the image.
tomwalters@0 507 .RE
tomwalters@0 508 .TP 6
tomwalters@0 509 0
tomwalters@0 510 No phase compensation.
tomwalters@0 511 .TP 7
tomwalters@0 512 +n
tomwalters@0 513 Advance each channel by n cycles of the centre frequency of the channel.
tomwalters@0 514 Approximate envelope alignment is achieved using phase_gtf = 3
tomwalters@0 515 or 4.
tomwalters@0 516 .RE
tomwalters@0 517 .LP
tomwalters@0 518 We experimented with a number of phase compensation schemes
tomwalters@0 519 (Patterson et al., 1989) and concluded that the best option was
tomwalters@0 520 envelope plus peak alignment which corresponds to a value of
tomwalters@0 521 phase_gtf = -4. Accordingly, we recommend the use of phase_gtf
tomwalters@0 522 values of 0 (ie no phase compensation) or -4 (envelope plus peak
tomwalters@0 523 alignment). The remaining options are occasionally useful and so
tomwalters@0 524 they have been left in the software.
tomwalters@0 525 Note that for any phase compensation option other than 0 the time
tomwalters@0 526 scale is strictly correct only for the lowest channel. For any
tomwalters@0 527 other channel, the origin of the abscissa is offset to the right
tomwalters@0 528 by an amount equal to the difference between `the envelope peak
tomwalters@0 529 time of the lowest-frequency channel' and `the envelope peak time
tomwalters@0 530 of the given channel'.
tomwalters@0 531 .RE
tomwalters@0 532 .LP
tomwalters@0 533 .SH REFERENCES
tomwalters@0 534 .LP
tomwalters@0 535 .RE
tomwalters@0 536 .TP 4
tomwalters@0 537 de Boer, E., and de Jongh, H.R. (1978). On cochlear encoding:
tomwalters@0 538 potentialities and limitations of the reverse-correlation
tomwalters@0 539 technique, J. Acoust. Soc. Am., 63, 115-135.
tomwalters@0 540 .RE
tomwalters@0 541 .LP
tomwalters@0 542 .TP 4
tomwalters@0 543 Carney, L.H. and Yin, C.T. (1988) 'Temporal coding of resonances
tomwalters@0 544 by low-frequency auditory nerve fibers: Single fibre responses
tomwalters@0 545 and a population model', J.Neurophysiology, 60, 1653-1677.
tomwalters@0 546 .RE
tomwalters@0 547 .LP
tomwalters@0 548 .TP 4
tomwalters@0 549 Cooke, M.P. (1993). Modelling Auditory Processing and
tomwalters@0 550 Organisation, Cambridge University Press.
tomwalters@0 551 .RE
tomwalters@0 552 .LP
tomwalters@0 553 .TP 4
tomwalters@0 554 Giguere, C. and Woodland, P.C. (1994). A computational model of
tomwalters@0 555 the auditory periphery for speech and hearing research. I. Ascending
tomwalters@0 556 path. J.Acoust. Soc. Am. 95: 331-342.
tomwalters@0 557 .RE
tomwalters@0 558 .LP
tomwalters@0 559 .TP 4
tomwalters@0 560 Glasberg, B.R. and B.C.J. Moore (1990). Derivation of auditory
tomwalters@0 561 filter shapes from notched-noise data. Hearing Research, 47,
tomwalters@0 562 103-138.
tomwalters@0 563 .RE
tomwalters@0 564 .LP
tomwalters@0 565 .TP 4
tomwalters@0 566 Greenwood, D.D. (1961) 'Critical bandwidth and the frequency
tomwalters@0 567 coordinates of the basilar membrane', J. Acoust. Soc. Am. 33,
tomwalters@0 568 1344-1356.
tomwalters@0 569 .RE
tomwalters@0 570 .LP
tomwalters@0 571 .TP 4
tomwalters@0 572 Greenwood, D.D. (1990). A cochlear frequency-position function
tomwalters@0 573 for several species - 29 years later. J. Acoust. Soc. Am., 87,
tomwalters@0 574 2592-2605.
tomwalters@0 575 .RE
tomwalters@0 576 .LP
tomwalters@0 577 .TP 4
tomwalters@0 578 Holdsworth, J., Nimmo-Smith, I., Patterson, R.D. and Rice, P.
tomwalters@0 579 (1988) Annex C of 'Spiral Vos Final Report, Part A: The
tomwalters@0 580 Auditory Filterbank,' APU report 2341.
tomwalters@0 581 .RE
tomwalters@0 582 .LP
tomwalters@0 583 .TP 4
tomwalters@0 584 Johnstone, B.M. et al. (1986). Hear Res. 22: 147-153.
tomwalters@0 585 .RE
tomwalters@0 586 .LP
tomwalters@0 587 .TP 4
tomwalters@0 588 Moore, B.C.J and Glasberg, B.R. (1983), "Suggested formulae for
tomwalters@0 589 calculating auditory filter bandwidths and excitiation patterns,"
tomwalters@0 590 J. Acoust. Soc. Am. 74, pp 750-753.
tomwalters@0 591 .RE
tomwalters@0 592 .LP
tomwalters@0 593 .TP 4
tomwalters@0 594 Patuzzi, R., and Robertson, D. (1988) "Tuning in the mammalian
tomwalters@0 595 cochlea," Physiological Reviews 68, 1009-1082.
tomwalters@0 596 .RE
tomwalters@0 597 .LP
tomwalters@0 598 .TP 4
tomwalters@0 599 Patterson, R.D. (1976). Auditory filter shapes derived with
tomwalters@0 600 noise stimuli. J. Acoust. Soc. Am., 59, 640-654.
tomwalters@0 601 .RE
tomwalters@0 602 .LP
tomwalters@0 603 .TP 4
tomwalters@0 604 Patterson, R.D. (1987). A pulse ribbon model of monaural phase
tomwalters@0 605 perception. J. Acoust. Soc. Am., 82, 1560-1586.
tomwalters@0 606 .RE
tomwalters@0 607 .LP
tomwalters@0 608 .TP 4
tomwalters@0 609 Patterson, R.D., Nimmo-Smith, I., Weber, D.L., and Milroy, R.
tomwalters@0 610 (1982). The deterioration of hearing with age: Frequency
tomwalters@0 611 selectivity, the critical ratio, the audiogram, and speech
tomwalters@0 612 threshold. J. Acoust. Soc. Am., 72, 1788-1803.
tomwalters@0 613 .RE
tomwalters@0 614 .LP
tomwalters@0 615 .TP 4
tomwalters@0 616 Patterson, R.D., Allerhand, M.H. and Holdsworth, J. (1992)
tomwalters@0 617 'Auditory representations of speech sounds', In Visual
tomwalters@0 618 representations of speech signals, Eds. Martin Cooke and Steve
tomwalters@0 619 Beet, John Wiley & Sons. 307-314.
tomwalters@0 620 .RE
tomwalters@0 621 .LP
tomwalters@0 622 .TP 4
tomwalters@0 623 Patterson, R. D., Holdsworth, J., Nimmo-Smith, I., and Rice, P.
tomwalters@0 624 (1988). SVOS Final Report: The Auditory Filterbank. APU report 2341.
tomwalters@0 625 .RE
tomwalters@0 626 .LP
tomwalters@0 627 .TP 4
tomwalters@0 628 Patterson, R.D. and B.C.J. Moore (1986). Auditory filters and
tomwalters@0 629 excitation patterns as representations of frequency
tomwalters@0 630 resolution. In: Frequency Selectivity in Hearing (B. C. J.
tomwalters@0 631 Moore, ed.), pp. 123-177. Academic Press, London.
tomwalters@0 632 .RE
tomwalters@0 633 .LP
tomwalters@0 634 .TP 4
tomwalters@0 635 Schofield, D. (1985) 'Visualisations of speech based on a model
tomwalters@0 636 of the peripheral auditory system', NPL Report DITC 62/85.
tomwalters@0 637 .RE
tomwalters@0 638 .LP
tomwalters@0 639 .TP 4
tomwalters@0 640 Slaney, M. (1993) An efficient implementation of the Patterson-
tomwalters@0 641 Holdsworth auditory filter bank. Apple Computer Technical
tomwalters@0 642 Report #35.
tomwalters@0 643 .RE
tomwalters@0 644 .LP
tomwalters@0 645 .TP 4
tomwalters@0 646 Zwicker, E. (1961) Subdivision of the audible frequency range into
tomwalters@0 647 critical bands (frequenzgruppen). J. Acoust. Soc. Am. 33, 248.
tomwalters@0 648 .LP
tomwalters@0 649 .SH EXAMPLES
tomwalters@0 650 .LP
tomwalters@0 651 The following command generates basilar membrane motion using the
tomwalters@0 652 gammatone filter design (the default) for an input filename cegc:
tomwalters@0 653 .RE
tomwalters@0 654 .LP
tomwalters@0 655 example% genbmm cegc
tomwalters@0 656 .RE
tomwalters@0 657 .LP
tomwalters@0 658 The following command generates basilar membrane motion using the
tomwalters@0 659 gammatone filter design (the default) for a filterbank with cf from
tomwalters@0 660 200 Hz to 5000 Hz at a density of 4 filters/critical band for the same
tomwalters@0 661 input filename:
tomwalters@0 662 .RE
tomwalters@0 663 .LP
tomwalters@0 664 example% genbmm channels=0 mincf=200 maxcf=5000 dencf=4. cegc
tomwalters@0 665 .RE
tomwalters@0 666 .LP
tomwalters@0 667 The following command generates basilar membrane motion using the
tomwalters@0 668 gammatone filter design (the default) and the audiogram function
tomwalters@0 669 instead of the outer/middle ear filter:
tomwalters@0 670 .RE
tomwalters@0 671 .LP
tomwalters@0 672 example% genbmm middle_ear=off audiogram=on cegc
tomwalters@0 673 .RE
tomwalters@0 674 .LP
tomwalters@0 675 The following command generates the basilar membrane motion using the
tomwalters@0 676 transmission line filter design instead of the default gammatone
tomwalters@0 677 filter:
tomwalters@0 678 .RE
tomwalters@0 679 .LP
tomwalters@0 680 example% genbmm filter=tlf cegc
tomwalters@0 681 .RE
tomwalters@0 682 .LP
tomwalters@0 683 The following command generates the basilar membrane motion using the
tomwalters@0 684 transmission line filter design and the auditory scale of Greenwood
tomwalters@0 685 (1990):
tomwalters@0 686 .RE
tomwalters@0 687 .LP
tomwalters@0 688 example% genbmm filter=tlf bwmin=22.85 quality=7.238 mmerb=1.0 cegc
tomwalters@0 689 .RE
tomwalters@0 690 .LP
tomwalters@0 691 The following command generates the basilar membrane motion using the
tomwalters@0 692 transmission line filter design, but with the nonlinear outer hair
tomwalters@0 693 cell feedback mechanism turned off:
tomwalters@0 694 .RE
tomwalters@0 695 .LP
tomwalters@0 696 example% genbmm filter=tlf feedback=off cegc
tomwalters@0 697 .LP
tomwalters@0 698 .SH FILES
tomwalters@0 699 .LP
tomwalters@0 700 .TP 13
tomwalters@0 701 .genbmmrc
tomwalters@0 702 The options file for genbmm.
tomwalters@0 703 .LP
tomwalters@0 704 .SH SEE ALSO
tomwalters@0 705 .LP
tomwalters@0 706 genasa, gensgm
tomwalters@0 707 .LP
tomwalters@0 708 .SH BUGS
tomwalters@0 709 .LP
tomwalters@0 710 There is a bug in the hiddenline plotting of genbmm. It shows up when
tomwalters@0 711 the surface has deep valleys and there is a large phase delay. The
tomwalters@0 712 negative peaks show through on surfaces where they should be hidden.
tomwalters@0 713 .SH COPYRIGHT
tomwalters@0 714 .LP
tomwalters@0 715 Copyright (c) Applied Psychology Unit, Medical Research Council, 1995
tomwalters@0 716 .LP
tomwalters@0 717 Permission to use, copy, modify, and distribute this software without fee
tomwalters@0 718 is hereby granted for research purposes, provided that this copyright
tomwalters@0 719 notice appears in all copies and in all supporting documentation, and that
tomwalters@0 720 the software is not redistributed for any fee (except for a nominal
tomwalters@0 721 shipping charge). Anyone wanting to incorporate all or part of this
tomwalters@0 722 software in a commercial product must obtain a license from the Medical
tomwalters@0 723 Research Council.
tomwalters@0 724 .LP
tomwalters@0 725 The MRC makes no representations about the suitability of this
tomwalters@0 726 software for any purpose. It is provided "as is" without express or
tomwalters@0 727 implied warranty.
tomwalters@0 728 .LP
tomwalters@0 729 THE MRC DISCLAIMS ALL WARRANTIES WITH REGARD TO THIS SOFTWARE, INCLUDING
tomwalters@0 730 ALL IMPLIED WARRANTIES OF MERCHANTABILITY AND FITNESS, IN NO EVENT SHALL
tomwalters@0 731 THE A.P.U. BE LIABLE FOR ANY SPECIAL, INDIRECT OR CONSEQUENTIAL DAMAGES
tomwalters@0 732 OR ANY DAMAGES WHATSOEVER RESULTING FROM LOSS OF USE, DATA OR PROFITS,
tomwalters@0 733 WHETHER IN AN ACTION OF CONTRACT, NEGLIGENCE OR OTHER TORTIOUS ACTION,
tomwalters@0 734 ARISING OUT OF OR IN CONNECTION WITH THE USE OR PERFORMANCE OF THIS
tomwalters@0 735 SOFTWARE.
tomwalters@0 736 .LP
tomwalters@0 737 .SH ACKNOWLEDGEMENTS
tomwalters@0 738 .LP
tomwalters@0 739 The AIM software was developed for Unix workstations by John
tomwalters@0 740 Holdsworth and Mike Allerhand of the MRC APU, under the direction of
tomwalters@0 741 Roy Patterson. The physiological version of AIM was developed by
tomwalters@0 742 Christian Giguere. The options handler is by Paul Manson. The revised
tomwalters@0 743 SAI module is by Jay Datta. Michael Akeroyd extended the postscript
tomwalters@0 744 facilites and developed the xreview routine for auditory image
tomwalters@0 745 cartoons.
tomwalters@0 746 .LP
tomwalters@0 747 The project was supported by the MRC and grants from the U.K. Defense
tomwalters@0 748 Research Agency, Farnborough (Research Contract 2239); the EEC Esprit
tomwalters@0 749 BR Porgramme, Project ACTS (3207); and the U.K. Hearing Research Trust.
tomwalters@0 750