Mercurial > hg > aim92
comparison man/man1/gencgm.1 @ 0:5242703e91d3 tip
Initial checkin for AIM92 aimR8.2 (last updated May 1997).
author | tomwalters |
---|---|
date | Fri, 20 May 2011 15:19:45 +0100 |
parents | |
children |
comparison
equal
deleted
inserted
replaced
-1:000000000000 | 0:5242703e91d3 |
---|---|
1 .TH GENCGM 1 "11 May 1995" | |
2 .LP | |
3 .SH NAME | |
4 .LP | |
5 gencgm \- generate a cochleogram | |
6 .LP | |
7 .SH SYNOPSIS | |
8 .LP | |
9 gencgm [ option=value | -option ] [ filename ] | |
10 .LP | |
11 .SH DESCRIPTION | |
12 .LP | |
13 | |
14 Gencgm converts the input wave into a simulated neural activity | |
15 pattern (NAP) and summarises the NAP as a sequence of excitation | |
16 patterns (EPNs) that collectively form a 'cochleogram' (CGM). The | |
17 operation takes place in three stages: spectral analysis, neural | |
18 encoding, and temporal integration. In the spectral analysis stage, | |
19 the input wave is converted into an array of filtered waves, one for | |
20 each channel of a gammatone auditory filterbank. The surface of the | |
21 array of filtered waves is AIM's representation of basilar membrane | |
22 motion (BMM) as a function of time (manaim genbmm). In the neural | |
23 encoding stage, compression, adaptation and suppression, are used to | |
24 convert each wave from the filterbank into a simulation of the | |
25 aggregate neural response to that wave. The array of responses is | |
26 AIM's simulation of the neural activity pattern (NAP) in the auditory | |
27 nerve at about the level of the cochlear nucleus (manaim gennap). | |
28 Finally, the NAP is converted into a sequence of excitation patterns | |
29 (EPNs) by calculating the envelope of the NAP and extracting spectral | |
30 slices from the envelope every 'frstep_epn' ms. The envelope is | |
31 calculated continuously, by lowpass filtering the individual channels | |
32 of the NAP as they flow from the cochlea simulation. | |
33 .LP | |
34 When the sequence of excitation patterns is presented in spectrogram | |
35 format, it is referred to as a 'cochleogram' (CGM). The spectrogram | |
36 format has time on the abscissa (x-axis), filter centre-frequency on | |
37 the ordinate (y-axis), and activity level as the degree of black in | |
38 the display. In AIM, the suffix 'cgm' is used to distinguish this | |
39 spectral representation from the other spectral representations | |
40 provided by the software ('asa' auditory spectral analysis, 'sgm' | |
41 auditory spectrogram, and 'epn' excitation pattern). | |
42 .LP | |
43 The NAP generated by gencgm is the same as that produced by gennap | |
44 (manaim gennap). The primary differences are in the display defaults | |
45 and the inclusion of the Leaky Integration used to construct the | |
46 excitation patterns that form the cochleogram. As a result, this | |
47 manual entry is restricted to describing the option values that differ | |
48 from those in gennap and the additional options required to control | |
49 the Leaky Integration. | |
50 .LP | |
51 .SH DISPLAY DEFAULTS | |
52 .LP | |
53 The default values for three of the display options are reset to | |
54 produce a spectrographic format rather than a landscape. Specifically, | |
55 display=greyscale, bottom=0 and top=2500. The number of channels is | |
56 set to 128 for compatibility with the auditory spectrum modules, | |
57 genasa and genepn. When using AIM as a preprocessor for speech | |
58 recognition the number of channels would typically be reduced to | |
59 between 24 and 32. Use option 'downsample' if it is necessary to | |
60 reduce the output to less than 24 channels across the speech range. | |
61 .LP | |
62 .SH COMPRESSION AND LEAKY INTEGRATION | |
63 .LP | |
64 Compression and lowpass filtering are activated after the neural | |
65 encoding stage: | |
66 .LP | |
67 .SS "Compression" | |
68 .PP | |
69 Cochleograms are usually produced via the functional route in AIM. In | |
70 this case, compress is set on | |
71 .LP | |
72 .TP 13 | |
73 compress | |
74 Logarithmic compressor switch | |
75 .RS | |
76 Switch. Default: on. | |
77 .RE | |
78 .RS | |
79 .LP | |
80 Note: The compressor in the functional route of AIM is logarithmic and | |
81 it screens out negative BMM values before compression. This rectifies | |
82 the wave during the compression process and so the separate rectify | |
83 option is left off. | |
84 .RE | |
85 .LP | |
86 .RS | |
87 .LP | |
88 Note: The compressor in the physiological route of AIM is an integral | |
89 part of the tlf module, so when using this route to produce a | |
90 cochleogram, turn off the logarithmic compressor | |
91 (i.e. compress=off). The compressor in tlf does not screen out | |
92 negative values so it is also important to set rectify=on. | |
93 .RE | |
94 .RS | |
95 .LP | |
96 Full wave rectification is produced if rectify is set to 2. This will | |
97 lead to a smoother cochleogram from both the physiological and the | |
98 functional versions of AIM. | |
99 .RE | |
100 .LP | |
101 .SS "Transduction" | |
102 .PP | |
103 .LP | |
104 .TP 13 | |
105 transduction | |
106 Neural transduction switch (at, meddis, off) | |
107 .RS | |
108 Switch. Default: at. | |
109 | |
110 .RE | |
111 .LP | |
112 .SS "Leaky Integration" | |
113 .PP | |
114 .LP | |
115 .TP 13 | |
116 stages_idt | |
117 Number of stages of lowpass filtering | |
118 .RS | |
119 Default unit: scalar. Default value: 2 | |
120 .RE | |
121 .TP 13 | |
122 tup_idt | |
123 The time constant for each filter stage | |
124 .RS | |
125 Default unit: ms. Default value: 8 ms. | |
126 .RE | |
127 .LP | |
128 The Equivalent Rectandular Duration (ERD) of a two stage lowpass | |
129 filter is about 1.6 times the time constant of each stage, or | |
130 12.8 ms in the current case. | |
131 .TP 13 | |
132 frstep_epn | |
133 The time between successive spectral frames | |
134 .RS | |
135 Default unit: ms. Default value: 10 ms. | |
136 .RE | |
137 .LP | |
138 With a frstep_epn of 10 ms, gencgm will produce spectral frames at a | |
139 rate of 100 per second. | |
140 .LP | |
141 .TP 13 | |
142 downsample | |
143 The time between successive spectral frames. | |
144 .RS | |
145 Default unit: ms. Default value: 10 ms. | |
146 .RE | |
147 .LP | |
148 Downsample is simply another name for frstep_epn, provided to | |
149 facilitate a different mode of thinking about time-series data. | |
150 .LP | |
151 .SH FILES | |
152 .LP | |
153 .TP 13 | |
154 .gencgmrc | |
155 The options file for gencgm. | |
156 .LP | |
157 .SH SEE ALSO | |
158 .LP | |
159 gensgm, genasa, genepn, gennap, genbmm | |
160 .LP | |
161 .SH BUGS | |
162 .LP | |
163 None currently known. | |
164 .SH COPYRIGHT | |
165 .LP | |
166 Copyright (c) Applied Psychology Unit, Medical Research Council, 1995 | |
167 .LP | |
168 Permission to use, copy, modify, and distribute this software without fee | |
169 is hereby granted for research purposes, provided that this copyright | |
170 notice appears in all copies and in all supporting documentation, and that | |
171 the software is not redistributed for any fee (except for a nominal | |
172 shipping charge). Anyone wanting to incorporate all or part of this | |
173 software in a commercial product must obtain a license from the Medical | |
174 Research Council. | |
175 .LP | |
176 The MRC makes no representations about the suitability of this | |
177 software for any purpose. It is provided "as is" without express or | |
178 implied warranty. | |
179 .LP | |
180 THE MRC DISCLAIMS ALL WARRANTIES WITH REGARD TO THIS SOFTWARE, INCLUDING | |
181 ALL IMPLIED WARRANTIES OF MERCHANTABILITY AND FITNESS, IN NO EVENT SHALL | |
182 THE A.P.U. BE LIABLE FOR ANY SPECIAL, INDIRECT OR CONSEQUENTIAL DAMAGES | |
183 OR ANY DAMAGES WHATSOEVER RESULTING FROM LOSS OF USE, DATA OR PROFITS, | |
184 WHETHER IN AN ACTION OF CONTRACT, NEGLIGENCE OR OTHER TORTIOUS ACTION, | |
185 ARISING OUT OF OR IN CONNECTION WITH THE USE OR PERFORMANCE OF THIS | |
186 SOFTWARE. | |
187 .LP | |
188 .SH ACKNOWLEDGEMENTS | |
189 .LP | |
190 The AIM software was developed for Unix workstations by John | |
191 Holdsworth and Mike Allerhand of the MRC APU, under the direction of | |
192 Roy Patterson. The physiological version of AIM was developed by | |
193 Christian Giguere. The options handler is by Paul Manson. The revised | |
194 SAI module is by Jay Datta. Michael Akeroyd extended the postscript | |
195 facilites and developed the xreview routine for auditory image | |
196 cartoons. | |
197 .LP | |
198 The project was supported by the MRC and grants from the U.K. Defense | |
199 Research Agency, Farnborough (Research Contract 2239); the EEC Esprit | |
200 BR Porgramme, Project ACTS (3207); and the U.K. Hearing Research Trust. | |
201 |