Mercurial > hg > aim92

diff man/man1/gennap.1 @ 0:5242703e91d3 tip
Initial checkin for AIM92 aimR8.2 (last updated May 1997).
author: tomwalters
date: Fri, 20 May 2011 15:19:45 +0100
--- /dev/null	Thu Jan 01 00:00:00 1970 +0000
+++ b/man/man1/gennap.1	Fri May 20 15:19:45 2011 +0100
@@ -0,0 +1,405 @@
+.TH GENNAP 1 "8 April 1994"
+.LP
+.SH NAME
+.LP
+gennap \- generate neural activity pattern
+.LP
+.SH SYNOPSIS
+.LP
+gennap [ option=value | -option ] [ filename ]
+.LP
+.SH DESCRIPTION
+.LP
+The gennap module of the AIM software converts an input wave into a
+simulated neural activity pattern (NAP), which is AIM's representation
+of the pattern of information in the auditory nerve at about the level
+of the cochlear nucleus.  Gennap begins by calculating the basilar
+membrane motion (BMM) associated with the input wave using the genbmm
+module, and then it applies several additional transforms that we know
+occur in some form during the neural transduction process.  AIM
+provides two alternative methods for generating the NAP, a
+two-dimensional adaptive thresholding mechanism (Holdsworth and
+Patterson, 1993), and an array of inner haircell simulators based
+(Meddis et al., 1990; Giguere and Woodland, 1994).  The adaptive
+thresholding mechanism applies rectification, log compression,
+adaptation in time, and suppression across frequency; its purpose is
+to stabilise the level of the membrane activity with compression and
+then sharpen the features that appear in the compressed membrane
+motion.  Together, the gammatone filterbank and adaptive thresholding
+form a 'functional' cochlea simulation.  The Meddis module applies
+level-dependant compression and adaptation that simulate the response
+of inner haircells to membrane motion.  The cells are not coupled and
+so there is no frequency sharpening in this module.  Together, the
+transmission-line filterbank and the Meddis module form a
+'physiological' cochlea simulation.
+.LP
+.SH OPTIONS
+.LP
+The options for gennap are grouped according to the functions they
+control. The adaptive thresholding options are identified by the
+common suffix _at; the Meddis module options are identified by the
+common suffix _med.  These two groups of options are the subject of
+this manual entry, together with two additional options that specify
+whether rectification and compression operations are required before
+the transduction stage.  There is also an option to specify the choice
+of the transduction function.
+.LP
+.SH  RECTIFICATION AND COMPRESSION
+.LP
+The adaptive thresholding process begins with rectification and log
+compression of the BMM.  It is occasionally useful to have these
+functions available separately and so the options 'rectify' and
+'compress' are presented separately in the options list before the
+neural transduction options.
+.RE
+.LP
+.TP 13
+rectify
+Rectification switch
+.RS
+Switch. Default value: off. 
+.RE
+.RS
+.LP
+If rectify is on, the BMM is half-wave rectified.
+The compression operation also performs half-wave rectification (to
+avoid taking logs of negative numbers).  So the rectify option is
+really here just to provide for rectified BMM in the absence of
+compression.  As a result, the default for option rectify is
+off. (Note: Full wave rectification is produced if rectify is set to
+2.  This is useful when calculating envelopes with genasa.)
+.RE
+.LP
+.TP 13
+compress
+Compression switch
+.RS
+Switch. Default value: on. 
+.RE
+.RS
+.LP
+The compressor is strictly logarithmic and so to this point, the
+functional cochlea simulation is level independent.  In the auditory
+system, the compressor is logarithmic over the lower part of its range
+and then it asymptotes to a soft limit. The default for option
+compress is on (note that the compressor also performs half-wave
+rectification).
+.RE
+.LP
+Important: The default value for option compress is 'on' which assumes
+that the transduction module is adaptive thresholding (the default for
+the transduction option described below).  If the Meddis transduction
+module is selected (transduction=med), compress should be set to 'off'
+to obtain the operation described in Giguerre and Woodland
+(1994). This can be done on the command line (see EXAMPLES) or in the
+appropriate .gen???rc files.
+.RE
+.LP
+.SH NEURAL TRANSDUCTION
+.LP
+The neural transduction is performed either by two-dimensional
+adaptive thresholding or an array of Meddis haircells. The choice is
+controlled by the option 'transduction'.
+.LP
+.TP 13
+transduction
+The transduction function
+.RS
+Switch. Default value: at. Choices: at, med, off.
+.RE
+.LP
+If adaptive thresholding is specified (at), the options with suffix
+_at below apply; if the Meddis module is specified (med), the options
+with suffix _med below apply. If off is specified, no transduction
+function is applied.  The default is at.
+.RE
+.LP
+.SS "Two-dimensional adaptive thresholding: _at "
+.PP
+The adaptive thresholding mechanism is a functional model of neural
+encoding. Its purpose is to enhance the contrast of the larger
+features that appear in the surface of the BMM and reduce those
+aspects of the representation which are just a direct consequence of
+the filtering and compression processes (Holdsworth and Patterson,
+1993).  The process begins with rectification and compression of the
+BMM.  The tail of the envelope of the impulse response of the
+gammatone filter is exponential. As a result, logarithmic compression
+is used, since this makes the filter decay function linear in NAP
+coordinates. Following compression, adaptation is applied in time and
+suppression is applied across frequency.
+.LP
+Briefly, an adaptive threshold value is maintained for each channel
+and updated at the sampling rate. The new value is the largest of a)
+the previous value reduced by a fast-acting temporal decay factor
+(t1recovery_at), b) the previous value reduced by a longer-term
+temporal decay factor (t2recovery_at), c) the adapted level in the
+channel immediately above, reduced by a frequency spread factor
+(frecovery_at), d) the adapted level in the channel immediately below,
+reduced by the same frequency spread factor, or e) a floor level that
+precludes the mechanism listening to its own internal noise
+(reclimit_at).  The mechanism produces output whenever the input
+exceeds the adaptive threshold, and the output level is the difference
+between the input and the adaptive threshold. The adaptation and
+suppression are coupled, and they jointly sharpen features like vowel
+formants which appear smeared in compressed BMM.
+.LP
+.TP 13
+trise_at
+Threshold rise rate 
+.RS
+Default value: 1000. 
+.RE
+.RS
+.LP
+Upward Adaptation: This option specifies the rate at which the
+adaptive threshold will rise in response to a rise in signal
+level. The default value, 1000, means that the adaptive threshold
+responds very quickly to increases in the input wave; essentially, it
+follows the envelope of any rise in signal amplitude.
+.RE
+.LP
+Downward Adaptation: Following the cessation of sound, or a rapid drop
+in input level, temporal adaptation occurs in two stages as determined
+by t1recovery_at, t2recovery_at and propt2t1_at: If the default values
+are used, the mechanism initially adapts at a rate slightly slower
+than the decay rate of the gammatone filter in the given channel, and
+this represses much of the ringing of the impulse response of the
+filter.  Later the adaptation switches to a slower rate more in line
+with data on auditory forward masking.  The option propt2t1_at
+determines the point at which the initial fast rate of decay gives way
+to the slower limiting decay rate.
+.RE
+.LP
+.TP 13
+t1recovery_at
+The initial rate of threshold recovery relative to filter decay rate 
+.RS
+Default value: 0.6. 
+.RE
+.RS
+.LP
+This option determines the initial rate of decay of the adaptive
+threshold relative to the rate of decay of the auditory filter,
+provided propt2t1_at is less than unity.  Values of t1recovery_at less
+than unity cause the adaptive threshold to decay more slowly than the
+auditory filter and thereby to remove the filter response from the
+representation when it is the sole reason for BMM activity.  The rate
+of decay is linear with respect to the log-compressed BMM, so it is
+like an exponential decay with respect to the BMM.
+.RE
+.LP
+.TP 13
+t2recovery_at
+The secondary threshold recovery rate
+.RS
+Default value: 0.2. 
+.RE
+.RS
+.LP
+This option determines the limiting rate of decay of the adaptive
+threshold when the sound ceases provided propt2t1_at is less than
+unity.  The default value causes the adaptive threshold to decay more
+slowly than the initial rate as observed in auditory forward masking.
+Note, however, that the system to this point is level independent,
+whereas auditory forward masking is level dependent.
+.RE
+.LP
+.TP 13
+propt2t1_at
+The point at which t1recovery_at gives way to t2_recovery_at
+.RS
+Default value: 0.5. 
+.RE
+.RS
+.LP
+This option determines the point at which the initial fast rate of
+decay (t1recovery_at) gives way to the slower limiting decay rate
+(t2recovery_at).  The nomanclature assumes that propt2t1_at is a value
+less than unity.  Otherwise the the roles of the initial and limiting
+decays are reversed.
+.RE
+.LP
+.TP 13
+frecovery_at
+Recovery rate across frequency 
+.RS
+Default value: 20. 
+.RE
+.RS
+.LP
+This parameter specifies the rate at which a threshold value in one channel 
+propagates to influence threshold in neighbouring channels. 
+.RE
+.LP
+.TP 13
+reclimit_at
+Limitation on recovery level 
+.RS
+Default units: mB. Default value: 500 mB. (mB=milliBells)
+.RE
+.RS
+.LP
+In order to prevent the mechanism from encountering system noise, 
+or alternately, to reduce sensitivity to stimulus noise, there is a 
+limit placed on the recovery that the adaptive threshold can achieve. 
+The limit, reclimit_at, is the limit of the sensitivity of the system. 
+.RE
+.LP
+.TP 13
+gain_at
+Output gain 
+.RS
+Default units: scalar. Default value: 1. 
+.RE
+.LP
+.SS "Meddis haircell model: _med "
+.PP
+The purpose of the Meddis module is to simulate neural transduction of
+BMM as performed by the inner haircells of the cochlea.  There is one
+haircell simulation unit for each output channel of the filterbank.
+The haircell equations (Meddis et al., 1990) are solved using the wave
+digital filter algorithm described in Giguere and Woodland (1994). The
+characteristics of the haircell are controlled by options: fiber_med,
+thresh_med, and gain_med.
+.LP
+.TP 13
+fiber_med
+The spontaneous-rate of the simulated fiber
+.RS
+Default value: medium. Choices: medium, high.
+.RE
+.RS
+.LP
+If medium is specified, a medium spontaneous-rate haircell fiber is
+simulated. If high is specified, a high spontaneous-rate 
+fiber is simulated. The properties of these two types of fibers 
+are listed in Table II in Meddis et al. (1990).
+The default value is medium.
+.RE
+.LP
+.TP 13
+thresh_med
+The threshold shift of the fiber
+.RS
+Default Units: dB. Default value: 0.   
+.RE
+.RS
+.LP
+This option shifts the entire rate-intensity function of the haircell
+fiber horizontally to a higher or lower level, to accomodate changes
+in the scaling of the input wave.  A positive (negative) value
+increases (decreases) the rate- and saturation-thresholds of the fiber
+by that amount.  This operation does not change the dynamic range, the
+spontaneous and saturation rates, or the adaptation time constants or
+synchronization index of the fiber.
+.RE
+.LP
+.TP 13
+gain_med
+Output gain 
+.RS
+Default units: scalar. Default value: 1. 
+.RE
+.RS
+.LP
+Note: There is an internal gain of 20.0 within the software of
+the Meddis haircell model itself. The total gain is therefore
+20.0 times the value for gain_med.
+.RE
+.LP
+.SH REFERENCES
+.LP
+.RE
+.TP 4
+Giguere, C. and Woodland, P.C. (1994).  A computational model of
+the auditory periphery for speech and hearing research. I. Ascending
+path. J.Acoust. Soc. Am. 95: 331-342.
+.RE
+.LP
+.TP 4
+Holdsworth, J. (1990). Two-Dimensional adaptive thresholding.
+Annex 4 of AAM-HAP Report 1, APU contract Report.
+.RE
+.LP
+.TP 4
+Holdsworth, J. and Patterson, R.D. (1993). "Analysis of waveforms,"
+UK Patent GB 2234078B.
+.LP
+.TP 4
+Meddis, R., Hewitt, M. and Shackleton, T. (1990). Implementation
+details of a computational model of the inner-haircell/auditory-nerve
+synapse. J.Acoust. Soc. Am. 87: 1813-1816.
+.RE
+.LP
+.SH EXAMPLES
+.LP
+The following command generates the neural activity pattern using the
+gammatone auditory filterbank (the default) and the adaptive
+thresholding (the default) for an input file named cegc:
+.RE
+.LP
+example% gennap cegc
+.RE
+.LP
+The following command generates the neural activity pattern using the
+gammatone filterbank (the default) and Meddis haircell
+transduction for input cegc:
+.RE
+.LP
+example% gennap compress=off transduction=meddis cegc
+.RE
+.LP
+The following command generates the neural activity pattern using the
+transmission line filterbank and Meddis haircell transduction for cegc:
+.RE
+.LP
+example% gennap filter=tlf compress=off transduction=meddis cegc
+.LP
+.SH FILES
+.LP
+.TP 13
+ .gennaprc 
+The options file for gennap.
+.LP
+.SH SEE ALSO
+.LP
+genepn, gencgm, genbmm
+.LP
+.SH COPYRIGHT
+.LP
+Copyright (c) Applied Psychology Unit, Medical Research Council, 1995
+.LP
+Permission to use, copy, modify, and distribute this software without fee 
+is hereby granted for research purposes, provided that this copyright 
+notice appears in all copies and in all supporting documentation, and that 
+the software is not redistributed for any fee (except for a nominal 
+shipping charge). Anyone wanting to incorporate all or part of this 
+software in a commercial product must obtain a license from the Medical 
+Research Council.
+.LP
+The MRC makes no representations about the suitability of this 
+software for any purpose.  It is provided "as is" without express or 
+implied warranty.
+.LP
+THE MRC DISCLAIMS ALL WARRANTIES WITH REGARD TO THIS SOFTWARE, INCLUDING 
+ALL IMPLIED WARRANTIES OF MERCHANTABILITY AND FITNESS, IN NO EVENT SHALL 
+THE A.P.U. BE LIABLE FOR ANY SPECIAL, INDIRECT OR CONSEQUENTIAL DAMAGES 
+OR ANY DAMAGES WHATSOEVER RESULTING FROM LOSS OF USE, DATA OR PROFITS, 
+WHETHER IN AN ACTION OF CONTRACT, NEGLIGENCE OR OTHER TORTIOUS ACTION, 
+ARISING OUT OF OR IN CONNECTION WITH THE USE OR PERFORMANCE OF THIS 
+SOFTWARE.
+.LP
+.SH ACKNOWLEDGEMENTS
+.LP
+The AIM software was developed for Unix workstations by John
+Holdsworth and Mike Allerhand of the MRC APU, under the direction of
+Roy Patterson. The physiological version of AIM was developed by
+Christian Giguere. The options handler is by Paul Manson. The revised
+SAI module is by Jay Datta. Michael Akeroyd extended the postscript
+facilites and developed the xreview routine for auditory image
+cartoons.
+.LP
+The project was supported by the MRC and grants from the U.K. Defense
+Research Agency, Farnborough (Research Contract 2239); the EEC Esprit
+BR Porgramme, Project ACTS (3207); and the U.K. Hearing Research Trust.
+
author	tomwalters
date	Fri, 20 May 2011 15:19:45 +0100
parents
children