comparison man/man1/gencgm.1 @ 0:5242703e91d3 tip

Initial checkin for AIM92 aimR8.2 (last updated May 1997).
author tomwalters
date Fri, 20 May 2011 15:19:45 +0100
parents
children
comparison
equal deleted inserted replaced
-1:000000000000 0:5242703e91d3
1 .TH GENCGM 1 "11 May 1995"
2 .LP
3 .SH NAME
4 .LP
5 gencgm \- generate a cochleogram
6 .LP
7 .SH SYNOPSIS
8 .LP
9 gencgm [ option=value | -option ] [ filename ]
10 .LP
11 .SH DESCRIPTION
12 .LP
13
14 Gencgm converts the input wave into a simulated neural activity
15 pattern (NAP) and summarises the NAP as a sequence of excitation
16 patterns (EPNs) that collectively form a 'cochleogram' (CGM). The
17 operation takes place in three stages: spectral analysis, neural
18 encoding, and temporal integration. In the spectral analysis stage,
19 the input wave is converted into an array of filtered waves, one for
20 each channel of a gammatone auditory filterbank. The surface of the
21 array of filtered waves is AIM's representation of basilar membrane
22 motion (BMM) as a function of time (manaim genbmm). In the neural
23 encoding stage, compression, adaptation and suppression, are used to
24 convert each wave from the filterbank into a simulation of the
25 aggregate neural response to that wave. The array of responses is
26 AIM's simulation of the neural activity pattern (NAP) in the auditory
27 nerve at about the level of the cochlear nucleus (manaim gennap).
28 Finally, the NAP is converted into a sequence of excitation patterns
29 (EPNs) by calculating the envelope of the NAP and extracting spectral
30 slices from the envelope every 'frstep_epn' ms. The envelope is
31 calculated continuously, by lowpass filtering the individual channels
32 of the NAP as they flow from the cochlea simulation.
33 .LP
34 When the sequence of excitation patterns is presented in spectrogram
35 format, it is referred to as a 'cochleogram' (CGM). The spectrogram
36 format has time on the abscissa (x-axis), filter centre-frequency on
37 the ordinate (y-axis), and activity level as the degree of black in
38 the display. In AIM, the suffix 'cgm' is used to distinguish this
39 spectral representation from the other spectral representations
40 provided by the software ('asa' auditory spectral analysis, 'sgm'
41 auditory spectrogram, and 'epn' excitation pattern).
42 .LP
43 The NAP generated by gencgm is the same as that produced by gennap
44 (manaim gennap). The primary differences are in the display defaults
45 and the inclusion of the Leaky Integration used to construct the
46 excitation patterns that form the cochleogram. As a result, this
47 manual entry is restricted to describing the option values that differ
48 from those in gennap and the additional options required to control
49 the Leaky Integration.
50 .LP
51 .SH DISPLAY DEFAULTS
52 .LP
53 The default values for three of the display options are reset to
54 produce a spectrographic format rather than a landscape. Specifically,
55 display=greyscale, bottom=0 and top=2500. The number of channels is
56 set to 128 for compatibility with the auditory spectrum modules,
57 genasa and genepn. When using AIM as a preprocessor for speech
58 recognition the number of channels would typically be reduced to
59 between 24 and 32. Use option 'downsample' if it is necessary to
60 reduce the output to less than 24 channels across the speech range.
61 .LP
62 .SH COMPRESSION AND LEAKY INTEGRATION
63 .LP
64 Compression and lowpass filtering are activated after the neural
65 encoding stage:
66 .LP
67 .SS "Compression"
68 .PP
69 Cochleograms are usually produced via the functional route in AIM. In
70 this case, compress is set on
71 .LP
72 .TP 13
73 compress
74 Logarithmic compressor switch
75 .RS
76 Switch. Default: on.
77 .RE
78 .RS
79 .LP
80 Note: The compressor in the functional route of AIM is logarithmic and
81 it screens out negative BMM values before compression. This rectifies
82 the wave during the compression process and so the separate rectify
83 option is left off.
84 .RE
85 .LP
86 .RS
87 .LP
88 Note: The compressor in the physiological route of AIM is an integral
89 part of the tlf module, so when using this route to produce a
90 cochleogram, turn off the logarithmic compressor
91 (i.e. compress=off). The compressor in tlf does not screen out
92 negative values so it is also important to set rectify=on.
93 .RE
94 .RS
95 .LP
96 Full wave rectification is produced if rectify is set to 2. This will
97 lead to a smoother cochleogram from both the physiological and the
98 functional versions of AIM.
99 .RE
100 .LP
101 .SS "Transduction"
102 .PP
103 .LP
104 .TP 13
105 transduction
106 Neural transduction switch (at, meddis, off)
107 .RS
108 Switch. Default: at.
109
110 .RE
111 .LP
112 .SS "Leaky Integration"
113 .PP
114 .LP
115 .TP 13
116 stages_idt
117 Number of stages of lowpass filtering
118 .RS
119 Default unit: scalar. Default value: 2
120 .RE
121 .TP 13
122 tup_idt
123 The time constant for each filter stage
124 .RS
125 Default unit: ms. Default value: 8 ms.
126 .RE
127 .LP
128 The Equivalent Rectandular Duration (ERD) of a two stage lowpass
129 filter is about 1.6 times the time constant of each stage, or
130 12.8 ms in the current case.
131 .TP 13
132 frstep_epn
133 The time between successive spectral frames
134 .RS
135 Default unit: ms. Default value: 10 ms.
136 .RE
137 .LP
138 With a frstep_epn of 10 ms, gencgm will produce spectral frames at a
139 rate of 100 per second.
140 .LP
141 .TP 13
142 downsample
143 The time between successive spectral frames.
144 .RS
145 Default unit: ms. Default value: 10 ms.
146 .RE
147 .LP
148 Downsample is simply another name for frstep_epn, provided to
149 facilitate a different mode of thinking about time-series data.
150 .LP
151 .SH FILES
152 .LP
153 .TP 13
154 .gencgmrc
155 The options file for gencgm.
156 .LP
157 .SH SEE ALSO
158 .LP
159 gensgm, genasa, genepn, gennap, genbmm
160 .LP
161 .SH BUGS
162 .LP
163 None currently known.
164 .SH COPYRIGHT
165 .LP
166 Copyright (c) Applied Psychology Unit, Medical Research Council, 1995
167 .LP
168 Permission to use, copy, modify, and distribute this software without fee
169 is hereby granted for research purposes, provided that this copyright
170 notice appears in all copies and in all supporting documentation, and that
171 the software is not redistributed for any fee (except for a nominal
172 shipping charge). Anyone wanting to incorporate all or part of this
173 software in a commercial product must obtain a license from the Medical
174 Research Council.
175 .LP
176 The MRC makes no representations about the suitability of this
177 software for any purpose. It is provided "as is" without express or
178 implied warranty.
179 .LP
180 THE MRC DISCLAIMS ALL WARRANTIES WITH REGARD TO THIS SOFTWARE, INCLUDING
181 ALL IMPLIED WARRANTIES OF MERCHANTABILITY AND FITNESS, IN NO EVENT SHALL
182 THE A.P.U. BE LIABLE FOR ANY SPECIAL, INDIRECT OR CONSEQUENTIAL DAMAGES
183 OR ANY DAMAGES WHATSOEVER RESULTING FROM LOSS OF USE, DATA OR PROFITS,
184 WHETHER IN AN ACTION OF CONTRACT, NEGLIGENCE OR OTHER TORTIOUS ACTION,
185 ARISING OUT OF OR IN CONNECTION WITH THE USE OR PERFORMANCE OF THIS
186 SOFTWARE.
187 .LP
188 .SH ACKNOWLEDGEMENTS
189 .LP
190 The AIM software was developed for Unix workstations by John
191 Holdsworth and Mike Allerhand of the MRC APU, under the direction of
192 Roy Patterson. The physiological version of AIM was developed by
193 Christian Giguere. The options handler is by Paul Manson. The revised
194 SAI module is by Jay Datta. Michael Akeroyd extended the postscript
195 facilites and developed the xreview routine for auditory image
196 cartoons.
197 .LP
198 The project was supported by the MRC and grants from the U.K. Defense
199 Research Agency, Farnborough (Research Contract 2239); the EEC Esprit
200 BR Porgramme, Project ACTS (3207); and the U.K. Hearing Research Trust.
201