tomwalters@0
|
1 .TH GENWAV 1 "11 May 1995"
|
tomwalters@0
|
2 .LP
|
tomwalters@0
|
3 .SH NAME
|
tomwalters@0
|
4 .LP
|
tomwalters@0
|
5 genwav \- display the wave in filename.
|
tomwalters@0
|
6 .LP
|
tomwalters@0
|
7 .SH SYNOPSIS
|
tomwalters@0
|
8 .LP
|
tomwalters@0
|
9 genwav [ option=value | -option ] [ filename ]
|
tomwalters@0
|
10 .LP
|
tomwalters@0
|
11 .SH DESCRIPTION
|
tomwalters@0
|
12 .LP
|
tomwalters@0
|
13
|
tomwalters@0
|
14 Genwav sets up and Xwindow and displays a segment of the input wave
|
tomwalters@0
|
15 in the window. The size of the window and the size of the wave are
|
tomwalters@0
|
16 determined by options, as are a number of other input/output
|
tomwalters@0
|
17 functions. These options have no direct bearing on the auditory
|
tomwalters@0
|
18 processing performed by AIM. For convenience, these Non-Auditory
|
tomwalters@0
|
19 options are associated with the instruction genwav (the one
|
tomwalters@0
|
20 non-auditory instruction), and they are listed at the top of the
|
tomwalters@0
|
21 options tables prior to the auditory options.
|
tomwalters@0
|
22
|
tomwalters@0
|
23 .LP
|
tomwalters@0
|
24 There are three classes of Non-Auditory options:
|
tomwalters@0
|
25 .LP
|
tomwalters@0
|
26 I) DISPLAY OPTIONS that determine the format of the auditory representations
|
tomwalters@0
|
27 of sound on the screen, or on paper when printed.
|
tomwalters@0
|
28 .LP
|
tomwalters@0
|
29 II) OUTPUT OPTIONS that determine the format and content of files used
|
tomwalters@0
|
30 to store the auditory representations of sounds.
|
tomwalters@0
|
31 .LP
|
tomwalters@0
|
32 III) INPUT OPTIONS that determine how the wave in the input file should
|
tomwalters@0
|
33 be interpreted.
|
tomwalters@0
|
34 .LP
|
tomwalters@0
|
35 The output options are presented before the input options so that the
|
tomwalters@0
|
36 input options will be adjacent to the filterbank options in the
|
tomwalters@0
|
37 options tables produced by genbmm and subsequent instructions.
|
tomwalters@0
|
38
|
tomwalters@0
|
39 .SS
|
tomwalters@0
|
40 I. DISPLAY OPTIONS
|
tomwalters@0
|
41 .LP
|
tomwalters@0
|
42
|
tomwalters@0
|
43 The AIM modules produce output in the form of a set of functions, one
|
tomwalters@0
|
44 for each channel of the auditory filterbank. For example, the output
|
tomwalters@0
|
45 of genbmm is a set of functions that simulate basilar membrane motion
|
tomwalters@0
|
46 produced in response to the input wave. By default, the AIM software
|
tomwalters@0
|
47 puts an Xwindow up on the computer screen and displays the output in
|
tomwalters@0
|
48 the window. This section describes the options that control these
|
tomwalters@0
|
49 displays.
|
tomwalters@0
|
50
|
tomwalters@0
|
51 .LP
|
tomwalters@0
|
52 The display options are: title, display, x0-win, y0-win, width_win,
|
tomwalters@0
|
53 height_win, display, view, top, bottom, overlap, headroom,
|
tomwalters@0
|
54 magnification, pensize, hiddenline.
|
tomwalters@0
|
55 .LP
|
tomwalters@0
|
56 A. The Display Window Title, Position, and Size
|
tomwalters@0
|
57 .RS 3
|
tomwalters@0
|
58
|
tomwalters@0
|
59 .LP
|
tomwalters@0
|
60 title Title of output display.
|
tomwalters@0
|
61 .RS 5
|
tomwalters@0
|
62 Character string. Default: input file name.
|
tomwalters@0
|
63 .RE
|
tomwalters@0
|
64 .LP
|
tomwalters@0
|
65 The title of the output being displayed. If no title is given, the
|
tomwalters@0
|
66 display bears the name of the file of the input wave.
|
tomwalters@0
|
67
|
tomwalters@0
|
68 .LP
|
tomwalters@0
|
69 display Display output on screen
|
tomwalters@0
|
70 .RS 5
|
tomwalters@0
|
71 Switch. Default: on.
|
tomwalters@0
|
72 .RE
|
tomwalters@0
|
73 .LP
|
tomwalters@0
|
74
|
tomwalters@0
|
75 Normally this switch is on and a bitmap of the output is displayed in
|
tomwalters@0
|
76 a graphical window on the computer screen. The switch is provided
|
tomwalters@0
|
77 because the time taken to create the displays is considerable, and it
|
tomwalters@0
|
78 is useful to turn it dsiplay off using AIM as a preprocessor for
|
tomwalters@0
|
79 speech recognition.
|
tomwalters@0
|
80
|
tomwalters@0
|
81 .LP
|
tomwalters@0
|
82 x0_win Left edge of window
|
tomwalters@0
|
83 .RS 5
|
tomwalters@0
|
84 Unit: pixels. Default: centre.
|
tomwalters@0
|
85 .RE
|
tomwalters@0
|
86 .LP
|
tomwalters@0
|
87 The left edge of the window into which the display will be drawn,
|
tomwalters@0
|
88 relative to the left edge of the screen (i.e. the x-coordinate of the
|
tomwalters@0
|
89 window within the screen). A value of centre will cause centring in
|
tomwalters@0
|
90 the horizontal dimension (provided the window manager does not
|
tomwalters@0
|
91 override).
|
tomwalters@0
|
92 .LP
|
tomwalters@0
|
93 y0-win Lower edge of window
|
tomwalters@0
|
94 .RS 5
|
tomwalters@0
|
95 Unit: pixels. Default: centre.
|
tomwalters@0
|
96 .RE
|
tomwalters@0
|
97 .LP
|
tomwalters@0
|
98 The lower edge of the window into which the display will be drawn,
|
tomwalters@0
|
99 relative to the lower edge of the screen (i.e. the y-coordinate of the
|
tomwalters@0
|
100 window within the screen). A value of centre will cause centring in
|
tomwalters@0
|
101 the vertical dimension (provided the window manager does not
|
tomwalters@0
|
102 override).
|
tomwalters@0
|
103 .LP
|
tomwalters@0
|
104 Taken as a pair x0_win and y0-win determine the origin of the window,
|
tomwalters@0
|
105 relative to the screen origin which is assumed to be the lower left
|
tomwalters@0
|
106 corner of the screen.
|
tomwalters@0
|
107 .LP
|
tomwalters@0
|
108 width_win Window width
|
tomwalters@0
|
109 .RS 5
|
tomwalters@0
|
110 Unit: pixels. Default: 640.
|
tomwalters@0
|
111 .RE
|
tomwalters@0
|
112 .LP
|
tomwalters@0
|
113 The width of the window into which the display will be drawn.
|
tomwalters@0
|
114 .LP
|
tomwalters@0
|
115 height_win Window height
|
tomwalters@0
|
116 .RS 6
|
tomwalters@0
|
117 Unit: pixels. Default: 480.
|
tomwalters@0
|
118 .RE
|
tomwalters@0
|
119 .LP
|
tomwalters@0
|
120 The height of the window into which the display will be drawn.
|
tomwalters@0
|
121 .RE
|
tomwalters@0
|
122
|
tomwalters@0
|
123
|
tomwalters@0
|
124 .LP
|
tomwalters@0
|
125 B. Display Controls
|
tomwalters@0
|
126 .RS 3
|
tomwalters@0
|
127 .LP
|
tomwalters@0
|
128 top The largest postive value visible in the display
|
tomwalters@0
|
129 .LP
|
tomwalters@0
|
130 Scalar. Default value: 1024 (for genwav)
|
tomwalters@0
|
131 .LP
|
tomwalters@0
|
132 Each of the functions in the multi-channel output of a module is
|
tomwalters@0
|
133 displayed in a transparent window. Provided the channel density is not
|
tomwalters@0
|
134 too low, the functions are related and the set of functions produces a
|
tomwalters@0
|
135 display that looks like a complex landscape. Top determines the
|
tomwalters@0
|
136 largest positive value that will appear in the transparent windows of
|
tomwalters@0
|
137 the individual functions, so top must be as large as the largest value
|
tomwalters@0
|
138 in the full set of functions. Increasing top has the effect of moving
|
tomwalters@0
|
139 the viewer farther up above the landscape.
|
tomwalters@0
|
140 .LP
|
tomwalters@0
|
141 bottom The largest negative value visible in the
|
tomwalters@0
|
142 .RS 5
|
tomwalters@0
|
143 display
|
tomwalters@0
|
144 .RE
|
tomwalters@0
|
145 .RS 5
|
tomwalters@0
|
146 Scalar. Default value: -1024 (for genwav)
|
tomwalters@0
|
147 .RE
|
tomwalters@0
|
148 .LP
|
tomwalters@0
|
149 Bottom determines the largest negative value that will appear in the
|
tomwalters@0
|
150 transparent windows of the individual functions, so bottom must be as
|
tomwalters@0
|
151 large in the negative direction as the largest negative value in the
|
tomwalters@0
|
152 full set of functions. Increasing bottom in the negative direction has
|
tomwalters@0
|
153 the effect of depeening the valleys in the landscape.
|
tomwalters@0
|
154 .LP
|
tomwalters@0
|
155 overlap The overlap of transparent windows of the
|
tomwalters@0
|
156 .RS 5
|
tomwalters@0
|
157 individual functions
|
tomwalters@0
|
158 .RE
|
tomwalters@0
|
159 .RS 5
|
tomwalters@0
|
160 Scalar: percentage. Default value: 50%
|
tomwalters@0
|
161 .RE
|
tomwalters@0
|
162 .LP
|
tomwalters@0
|
163 The fact that the output functions are related means that they
|
tomwalters@0
|
164 fit up under each other in the display in a way that concentrates the
|
tomwalters@0
|
165 lines on the landscape and improves the display.
|
tomwalters@0
|
166 .LP
|
tomwalters@0
|
167 headroom Display with headroom for the uppermost channel
|
tomwalters@0
|
168 .RS 5
|
tomwalters@0
|
169 Scalar: percentage. Default value: 0%
|
tomwalters@0
|
170 .RE
|
tomwalters@0
|
171 .LP
|
tomwalters@0
|
172 Because of the overlap of the transparent windows, part of the
|
tomwalters@0
|
173 uppermost transparent window is hidden by the upper edge of the
|
tomwalters@0
|
174 display window. This can cause truncation of the waves in the upper
|
tomwalters@0
|
175 channels. To avoid truncation, headroom enables the user to specify
|
tomwalters@0
|
176 that the highest channel ought to be centred below the upper edge of
|
tomwalters@0
|
177 the window. The value specified is taken to be the percentage of the
|
tomwalters@0
|
178 window between the zero line of the upper channel and the upper edge
|
tomwalters@0
|
179 of the window.
|
tomwalters@0
|
180 .LP
|
tomwalters@0
|
181 magnification Display magnification
|
tomwalters@0
|
182 .RS 9
|
tomwalters@0
|
183 Scalar. Default: 1.0.
|
tomwalters@0
|
184 .RE
|
tomwalters@0
|
185 .LP
|
tomwalters@0
|
186 The degree to which the amplitude of the functions in the display
|
tomwalters@0
|
187 should be magnified before being displayed. This parameter is merely
|
tomwalters@0
|
188 for adjusting the visual contrast of the display. The magnification
|
tomwalters@0
|
189 option is a multiplier, so a value of 1 implies drawing to scale,
|
tomwalters@0
|
190 while a value of 10 implies ten times (10x) the size of values in the
|
tomwalters@0
|
191 module output and 0.1 implies one tenth of the output size.
|
tomwalters@0
|
192 Magnification is related to, but separate from, the gain options which
|
tomwalters@0
|
193 affect the values of the output functions and the values stored in any
|
tomwalters@0
|
194 output files. Magnification is an alternative means of controlling the
|
tomwalters@0
|
195 size of the functions in the display -- alternative to top and bottom.
|
tomwalters@0
|
196 .LP
|
tomwalters@0
|
197 pensize The size of the lines in the displays and the
|
tomwalters@0
|
198 .RS 5
|
tomwalters@0
|
199 dots on the spiral
|
tomwalters@0
|
200 .RE
|
tomwalters@0
|
201 .RS 5
|
tomwalters@0
|
202 Unit: pixels. Default: 1.
|
tomwalters@0
|
203 .RE
|
tomwalters@0
|
204 .LP
|
tomwalters@0
|
205 This option allows the user to specify the thickness of the lines in
|
tomwalters@0
|
206 the display and the size of the dots on spiral auditory images. It
|
tomwalters@0
|
207 also affects the lines and dots in postscript plots. It is provided
|
tomwalters@0
|
208 primarily for use with printers which have much more resolution than
|
tomwalters@0
|
209 computer screens. On laser printers a value of 3-5 gives reasonable
|
tomwalters@0
|
210 line thickness. On the screen, a linewidth greater than 1 produces
|
tomwalters@0
|
211 slow drawing, and a gagged, blurred display.
|
tomwalters@0
|
212 .LP
|
tomwalters@0
|
213 hiddenline Draw with overlapping parts of functions
|
tomwalters@0
|
214 .RS 5
|
tomwalters@0
|
215 hidden
|
tomwalters@0
|
216 .RE
|
tomwalters@0
|
217 .RS 5
|
tomwalters@0
|
218 Switch. Default: on.
|
tomwalters@0
|
219 .RE
|
tomwalters@0
|
220 .LP
|
tomwalters@0
|
221 This switch specifies whether or not a 'hidden line' algorithm should
|
tomwalters@0
|
222 be used when drawing the display. It also affects printed displays.
|
tomwalters@0
|
223 In almost all cases, hiddenline results in more attractive displays of
|
tomwalters@0
|
224 waveforms, and it often makes complex displays easier to understand,
|
tomwalters@0
|
225 so the default is 'on'. Note: hiddenline almost doubles the drawing
|
tomwalters@0
|
226 time so it is sometimes useful to switch it off on slower machines.
|
tomwalters@0
|
227 .LP
|
tomwalters@0
|
228
|
tomwalters@0
|
229 .SS
|
tomwalters@0
|
230 II. OUTPUT OPTIONS
|
tomwalters@0
|
231 .RS 3
|
tomwalters@0
|
232 .LP
|
tomwalters@0
|
233 The output options are listed and described before the input options
|
tomwalters@0
|
234 so that the input options will be adjacent to the filterbank options
|
tomwalters@0
|
235 in the listings produced by genbmm and subsequent modules. The output
|
tomwalters@0
|
236 options are downchannel, erase_ctn, animate_ctn, bitmap_ctn,
|
tomwalters@0
|
237 postscript, output, and header.
|
tomwalters@0
|
238 .LP
|
tomwalters@0
|
239 downchannel Average adjacent channels of multichannel
|
tomwalters@0
|
240 .RS 7
|
tomwalters@0
|
241 representations
|
tomwalters@0
|
242 .RE
|
tomwalters@0
|
243 .RS 7
|
tomwalters@0
|
244 Units: Number of averagings.
|
tomwalters@0
|
245 .RE
|
tomwalters@0
|
246 .RS 7
|
tomwalters@0
|
247 Default value: 0.
|
tomwalters@0
|
248 .RE
|
tomwalters@0
|
249 .LP
|
tomwalters@0
|
250
|
tomwalters@0
|
251 There is interaction between channels in the transmission-line
|
tomwalters@0
|
252 filterbank of the physiological version of AIM, and in the neural
|
tomwalters@0
|
253 encoding of the functional version of AIM. The minimum channel
|
tomwalters@0
|
254 density for these processes to operate properly is four channels per
|
tomwalters@0
|
255 ERB and 2 channels per ERB, respectively. For broadband signals like
|
tomwalters@0
|
256 speech this means that the minimum number of channels is on the order
|
tomwalters@0
|
257 of 128 and 64, respectively. This channel density can produce
|
tomwalters@0
|
258 cluttered displays, and more importantly, it is far too many channels
|
tomwalters@0
|
259 for current speech recognition systems which typically use 12-24
|
tomwalters@0
|
260 channels. This is not just a computer power problem; the recognition
|
tomwalters@0
|
261 systems actually perform less well with extra channels. Accordingly,
|
tomwalters@0
|
262 the option 'downchannel' provides the option of reducing the channel
|
tomwalters@0
|
263 density at output, so that AIM can operate with the appropriate
|
tomwalters@0
|
264 channel density and still provide output that is compatible with
|
tomwalters@0
|
265 displays and speech recognition systems.
|
tomwalters@0
|
266
|
tomwalters@0
|
267 .LP
|
tomwalters@0
|
268 Downchannel averages pairs of adjacent channels and the option value
|
tomwalters@0
|
269 specifies how many times it should execute the averaging process. Each
|
tomwalters@0
|
270 averaging reduces the number of channels by a factor of 2, so for
|
tomwalters@0
|
271 proper transmission-line filtering and an output file with 16
|
tomwalters@0
|
272 channels, set channels_afb=128 and downchannel=3 (three successive
|
tomwalters@0
|
273 halvings of the number of channels).
|
tomwalters@0
|
274
|
tomwalters@0
|
275
|
tomwalters@0
|
276 .LP
|
tomwalters@0
|
277 A. Animated Cartoons
|
tomwalters@0
|
278 .LP
|
tomwalters@0
|
279 .RS 3
|
tomwalters@0
|
280 Four of the AIM instructions produce output in the form of sequences
|
tomwalters@0
|
281 of spectral frames (gensgm, gencgm, genasa and genepn). Bitmap
|
tomwalters@0
|
282 versions of the displays of the frames can be stored by AIM and
|
tomwalters@0
|
283 replayed by review and xreview. When the sequence of frames is played
|
tomwalters@0
|
284 rapidly, it appears as an animated cartoon that shows the dynamic
|
tomwalters@0
|
285 behaviour of the spectrum of the sound.
|
tomwalters@0
|
286 .LP
|
tomwalters@0
|
287 Similarly, the AIM instructions for auditory images (gensai and
|
tomwalters@0
|
288 genspl) produce sequences of landscape frames, and bitmap versions of
|
tomwalters@0
|
289 the landscape displays can also be stored by AIM and replayed by
|
tomwalters@0
|
290 review and xreview. Indeed, it was the desire to produce auditory
|
tomwalters@0
|
291 image cartoons that led to the development of much of the AIM software
|
tomwalters@0
|
292 package. The animated cartoons or auditory images show the dynamic
|
tomwalters@0
|
293 behaviour of features in the images, like the motion of formants in
|
tomwalters@0
|
294 diphthongs and the motion of notes in a melody.
|
tomwalters@0
|
295 .LP
|
tomwalters@0
|
296 This section describes the options that control the construction and
|
tomwalters@0
|
297 storage of sequences of bitmaps; there is a separate manual entries for
|
tomwalters@0
|
298 the xreview routine that replays the bitmaps ( 'manaim xreview').
|
tomwalters@0
|
299
|
tomwalters@0
|
300
|
tomwalters@0
|
301 .LP
|
tomwalters@0
|
302 erase_ctn Erase the current frame before presenting
|
tomwalters@0
|
303 .RS 7
|
tomwalters@0
|
304 the next frame
|
tomwalters@0
|
305 .RE
|
tomwalters@0
|
306 .RS 7
|
tomwalters@0
|
307 Switch. Default value: on.
|
tomwalters@0
|
308 .RE
|
tomwalters@0
|
309 .LP
|
tomwalters@0
|
310
|
tomwalters@0
|
311 Normally, when presenting a sequence of frames as an animated cartoon,
|
tomwalters@0
|
312 one wants to erase the current frame before presenting the next. When
|
tomwalters@0
|
313 the frames are spectra, however, the set of frames can together form a
|
tomwalters@0
|
314 meaningful display; for example, the set of rising spectra produced at
|
tomwalters@0
|
315 the onset of a sound produces a contour map of the onset. The option
|
tomwalters@0
|
316 erase_ctn enables the user to observe the full set of spectra
|
tomwalters@0
|
317 simultaneously. (See aimdemo_gtf_spectra or aimdemo_tlf_spectra ).
|
tomwalters@0
|
318
|
tomwalters@0
|
319 .LP
|
tomwalters@0
|
320 animate_ctn Store frames in memory and replay all of
|
tomwalters@0
|
321 .RS 7
|
tomwalters@0
|
322 them as a cartoon
|
tomwalters@0
|
323 .RE
|
tomwalters@0
|
324 .RS 7
|
tomwalters@0
|
325 Switch. Default value: off.
|
tomwalters@0
|
326 .RE
|
tomwalters@0
|
327 .LP
|
tomwalters@0
|
328 When this option is on, AIM stores the bitmaps of the frames it
|
tomwalters@0
|
329 produces in the memory of the machine and replays them rapidly when
|
tomwalters@0
|
330 the instruction is complete. Type RETURN to animate the cartoon again;
|
tomwalters@0
|
331 type 'q RETURN' to exit the instruction. (This option was important
|
tomwalters@0
|
332 when machines were slower and before the availability of review and
|
tomwalters@0
|
333 xreview. It is now largely obsolete.)
|
tomwalters@0
|
334 .LP
|
tomwalters@0
|
335 bitmap_ctn Store bitmaps of frames in a file for
|
tomwalters@0
|
336 .RS 7
|
tomwalters@0
|
337 replay as a cartoon
|
tomwalters@0
|
338 .RE
|
tomwalters@0
|
339 .RS 7
|
tomwalters@0
|
340 Switch. Default value: off.
|
tomwalters@0
|
341 .RE
|
tomwalters@0
|
342 .LP
|
tomwalters@0
|
343 When this option is on, bitmaps of the frames produced for the input
|
tomwalters@0
|
344 in file_name will be stored in file_name.ctn. The sequence of frames can later be replayed using either
|
tomwalters@0
|
345 .LP
|
tomwalters@0
|
346 > review file_name or
|
tomwalters@0
|
347 .LP
|
tomwalters@0
|
348 > xreview file_name
|
tomwalters@0
|
349 .LP
|
tomwalters@0
|
350 Both of these programs enable the user to vary the rate of animation,
|
tomwalters@0
|
351 the section of the sequence to be view, etc. The xreview version has a
|
tomwalters@0
|
352 window interface with useful information and is the preferred version
|
tomwalters@0
|
353 in most cases.
|
tomwalters@0
|
354 .RE
|
tomwalters@0
|
355
|
tomwalters@0
|
356 .RS 3
|
tomwalters@0
|
357 B. Output Files for Printing and Postprocessing
|
tomwalters@0
|
358
|
tomwalters@0
|
359 .LP
|
tomwalters@0
|
360 Postscript Produce printer-ready output
|
tomwalters@0
|
361 .RS 7
|
tomwalters@0
|
362 Switch. Default value: off.
|
tomwalters@0
|
363 .RE
|
tomwalters@0
|
364 .LP
|
tomwalters@0
|
365 This switch causes AIM to produce a printer-ready version of the
|
tomwalters@0
|
366 displays it presents on the computer screen. For example, the NAP of
|
tomwalters@0
|
367 a 32-ms section of cegc can be printed using
|
tomwalters@0
|
368 .LP
|
tomwalters@0
|
369 > gennap length=32 postscript=on cegc | lpr -Plw
|
tomwalters@0
|
370 .LP
|
tomwalters@0
|
371 where 'lpr' is the Unix printer-driver and the 'lw' of -Plw specifies
|
tomwalters@0
|
372 the destination printer. You may need to check the name of your
|
tomwalters@0
|
373 system's printer driver and laser printer.
|
tomwalters@0
|
374 .LP
|
tomwalters@0
|
375 Alternately the postscript version of the display may be directed to a
|
tomwalters@0
|
376 file using an instruction like
|
tomwalters@0
|
377 .LP
|
tomwalters@0
|
378 > gennap length=32 postscript=on cegc > cegc_nap.ps
|
tomwalters@0
|
379 .LP
|
tomwalters@0
|
380 and printed later at the users convenience. In this example, the file
|
tomwalters@0
|
381 name cegc_nap.ps is not generated by AIM; the '_nap.ps' suffix is
|
tomwalters@0
|
382 added by the user following standard conventions to indicate that the file
|
tomwalters@0
|
383 contains a NAP in postscript form.
|
tomwalters@0
|
384
|
tomwalters@0
|
385 .RS 3
|
tomwalters@0
|
386 .LP
|
tomwalters@0
|
387 THREE POSTSCRIPT CAUTIONS:
|
tomwalters@0
|
388 .LP
|
tomwalters@0
|
389 Postscript files of landscape displays from AIM are very large. As a
|
tomwalters@0
|
390 result, we recommend
|
tomwalters@0
|
391 .LP
|
tomwalters@0
|
392 a) that you NOT switch postscript on without redirecting the output to
|
tomwalters@0
|
393 a file, as it will cause the output to be display on the screen in a
|
tomwalters@0
|
394 seemingly endless display,
|
tomwalters@0
|
395 .LP
|
tomwalters@0
|
396 b) that you be careful NOT to print postscript files on a printer
|
tomwalters@0
|
397 which does not understand the Postscript language, as it can cause the
|
tomwalters@0
|
398 printer to put out an extremely long file, one column per page!
|
tomwalters@0
|
399 .LP
|
tomwalters@0
|
400 c) that you NOT set postscript=on in an options file as it will
|
tomwalters@0
|
401 generate large files in the directory without your noticing.
|
tomwalters@0
|
402 .RE
|
tomwalters@0
|
403
|
tomwalters@0
|
404 .LP
|
tomwalters@0
|
405 output Generate an output file
|
tomwalters@0
|
406 .RS 3
|
tomwalters@0
|
407 Switch. Default value: off.
|
tomwalters@0
|
408 .RE
|
tomwalters@0
|
409 .LP
|
tomwalters@0
|
410 This switch causes the array of functions that defines AIM's
|
tomwalters@0
|
411 simulation of basilar membrane motion, or a neural activity pattern,
|
tomwalters@0
|
412 or an auditory image, to be stored in a file for subsequent processing
|
tomwalters@0
|
413 by the aimtools or other, user defined, operators. By convention, the
|
tomwalters@0
|
414 file is given the same name as the input file, but with a suffix
|
tomwalters@0
|
415 reflecting the entry point, to distinguish it from the input file on
|
tomwalters@0
|
416 the one hand and from other output files on the other hand. The naming
|
tomwalters@0
|
417 system enables the user to construct and store a set of output files
|
tomwalters@0
|
418 for one input file without the need to specify a sequence of file
|
tomwalters@0
|
419 names. The suffixes are those used to identify the modules in the
|
tomwalters@0
|
420 listing produced by 'gen -help'. So, for example, the following
|
tomwalters@0
|
421 command line:
|
tomwalters@0
|
422 .LP
|
tomwalters@0
|
423 > gennap output=on length=32 cegc
|
tomwalters@0
|
424 .LP
|
tomwalters@0
|
425 will produce an output file named cegc.nap containing a multiplexed
|
tomwalters@0
|
426 version of the functions that define the NAP of the first 32 ms of
|
tomwalters@0
|
427 cegc.
|
tomwalters@0
|
428 .LP
|
tomwalters@0
|
429 The spectrographic representations produced by gensgm and gencgm can
|
tomwalters@0
|
430 be stored in the same way, as can the sequences of spectra produced by
|
tomwalters@0
|
431 genasa and genepn. It is the output files of genasa and gencgm that
|
tomwalters@0
|
432 are used to interface AIM with speech recognition systems (Robinson et
|
tomwalters@0
|
433 al., 1990; Patterson et al., 1995; Giguere and Woodland, 1994a).
|
tomwalters@0
|
434 Details of the file formats are presented in docs/aimFileFormat.
|
tomwalters@0
|
435 .LP
|
tomwalters@0
|
436 Header Put a header on the output file
|
tomwalters@0
|
437 .RS 3
|
tomwalters@0
|
438 Flag. Default value: on.
|
tomwalters@0
|
439 .RE
|
tomwalters@0
|
440 .LP
|
tomwalters@0
|
441 By default, a header is prepended to each output file so that
|
tomwalters@0
|
442 subsequent processors have access to the history of the file. Details
|
tomwalters@0
|
443 of the header structure are presented in docs/aimFileFormat.
|
tomwalters@0
|
444 .LP
|
tomwalters@0
|
445 .RE
|
tomwalters@0
|
446
|
tomwalters@0
|
447 .SS
|
tomwalters@0
|
448 III. INPUT OPTIONS
|
tomwalters@0
|
449 .LP
|
tomwalters@0
|
450 The input options enable the user to process a subsection of the input
|
tomwalters@0
|
451 wave, and to specify characterisitcs of the wave.
|
tomwalters@0
|
452 .LP
|
tomwalters@0
|
453 The input options are: input_wave, start_wave, length_wave,
|
tomwalters@0
|
454 samplerate, swap_wave, bits_wave, dB_wave.
|
tomwalters@0
|
455 .LP
|
tomwalters@0
|
456 input_wave Default input wave name
|
tomwalters@0
|
457 .RS 13
|
tomwalters@0
|
458 Filename. Default value: none.
|
tomwalters@0
|
459 .RE
|
tomwalters@0
|
460 .LP
|
tomwalters@0
|
461 The name of the wave file to process. This option permits simple
|
tomwalters@0
|
462 repetitive processing of the same input file without repetitive typing. It
|
tomwalters@0
|
463 also enables one to circumvent the Unix convention of having the filename
|
tomwalters@0
|
464 last on the command line. This option is overridden if the user supplies a
|
tomwalters@0
|
465 wave file name at the end of the command line.
|
tomwalters@0
|
466 .LP
|
tomwalters@0
|
467 start_wave Start point in wave
|
tomwalters@0
|
468 .RS 13
|
tomwalters@0
|
469 Default unit: ms. Default value: 0.
|
tomwalters@0
|
470 .RE
|
tomwalters@0
|
471 .LP
|
tomwalters@0
|
472 The point in the input wave at which processing should begin. The
|
tomwalters@0
|
473 start_wave option is expressed in milliseconds and its default value is the
|
tomwalters@0
|
474 beginning of the file (i.e. 0 ms into the file).
|
tomwalters@0
|
475 .LP
|
tomwalters@0
|
476 length_wave Length of wave
|
tomwalters@0
|
477 .RS 13
|
tomwalters@0
|
478 Default unit: ms. Default value: remainder.
|
tomwalters@0
|
479 .RE
|
tomwalters@0
|
480 .LP
|
tomwalters@0
|
481 The number of milliseconds of the wave that ought to be processed,
|
tomwalters@0
|
482 beyond the start point. The special value 'remainder' indicates that
|
tomwalters@0
|
483 the entire length of the wave from the start point to the end of the
|
tomwalters@0
|
484 file should be processed.
|
tomwalters@0
|
485 .LP
|
tomwalters@0
|
486 samplerate Input wave sample rate
|
tomwalters@0
|
487 .RS 13
|
tomwalters@0
|
488 Default unit: Hertz. Default value: 20,000 Hz.
|
tomwalters@0
|
489 .RE
|
tomwalters@0
|
490 .LP
|
tomwalters@0
|
491 The rate at which the input wave was sampled.
|
tomwalters@0
|
492 .LP
|
tomwalters@0
|
493 swap_wave Swap the bytes in each binary pair of the
|
tomwalters@0
|
494 .RS 13
|
tomwalters@0
|
495 input file
|
tomwalters@0
|
496 .RE
|
tomwalters@0
|
497 .RS 13
|
tomwalters@0
|
498 Switch. Default: off.
|
tomwalters@0
|
499 .RE
|
tomwalters@0
|
500 .LP
|
tomwalters@0
|
501 The order of the bytes in short integers varies between manufacturers.
|
tomwalters@0
|
502 Specifically the order for Sun and HP is opposite that for DEC SGI and
|
tomwalters@0
|
503 IBM. The default setting (off) is for the latter byte order.
|
tomwalters@0
|
504 .LP
|
tomwalters@0
|
505 bits_wave Bits in the input wave
|
tomwalters@0
|
506 .RS 13
|
tomwalters@0
|
507 Unit: bits. Default: 12. (Only alternate: 16.)
|
tomwalters@0
|
508 .RE
|
tomwalters@0
|
509 .LP
|
tomwalters@0
|
510 The number of significant bits in each (16-bit) word of the input
|
tomwalters@0
|
511 wave. Note that gain_gtf or gaim_tlf should be changed to 0.0625 when
|
tomwalters@0
|
512 the number of bits is set to 16 to avoid overflow.
|
tomwalters@0
|
513 .LP
|
tomwalters@0
|
514 dB_wave Scaling of the input wave
|
tomwalters@0
|
515 .RS 13
|
tomwalters@0
|
516 (for physiological route only)
|
tomwalters@0
|
517 .RE
|
tomwalters@0
|
518 .RS 13
|
tomwalters@0
|
519 Units: dB. Default: 60 dB
|
tomwalters@0
|
520 .RE
|
tomwalters@0
|
521 .LP
|
tomwalters@0
|
522 This option enables the user to specify the relative level of
|
tomwalters@0
|
523 the input wave in decibels. It is particularly useful for
|
tomwalters@0
|
524 investigating the level-dependent properties of the
|
tomwalters@0
|
525 physiological version of AIM.
|
tomwalters@0
|
526 .LP
|
tomwalters@0
|
527 The functional route is level-independent and dB_wave is
|
tomwalters@0
|
528 ignored no matter what its value.
|
tomwalters@0
|
529 .LP
|
tomwalters@0
|
530 dB_wave can also be used to scale the input wave in absolute
|
tomwalters@0
|
531 units, i.e sound-pressure level (dB SPL), using the following
|
tomwalters@0
|
532 equation:
|
tomwalters@0
|
533 .LP
|
tomwalters@0
|
534 dB_wave = dBSPL - 20log(RMS/200)
|
tomwalters@0
|
535 .LP
|
tomwalters@0
|
536 where RMS is the root-mean-square amplitude of the input wave,
|
tomwalters@0
|
537 or the portion of the wave or interest, and dBSPL is the
|
tomwalters@0
|
538 desired sound-pressure level scaling (in dB). For
|
tomwalters@0
|
539 example, to scale to 60 dB SPL a wave with an RMS amplitude
|
tomwalters@0
|
540 of 467.3, dB_wave should be set to 52.6.
|
tomwalters@0
|
541 .LP
|
tomwalters@0
|
542 Note: The RMS value of a stored input wave can be calculated using
|
tomwalters@0
|
543 the tools provided with the AIM software.
|
tomwalters@0
|
544
|
tomwalters@0
|
545
|
tomwalters@0
|
546 .LP
|
tomwalters@0
|
547 .RE
|
tomwalters@0
|
548
|
tomwalters@0
|
549 .SH FILES
|
tomwalters@0
|
550 .LP
|
tomwalters@0
|
551 .genwavrc The options file for genwav.
|
tomwalters@0
|
552 .SH SEE ALSO
|
tomwalters@0
|
553 .LP
|
tomwalters@0
|
554 genbmm
|
tomwalters@0
|
555 .SH BUGS
|
tomwalters@0
|
556 .LP
|
tomwalters@0
|
557 .SH COPYRIGHT
|
tomwalters@0
|
558 .LP
|
tomwalters@0
|
559 Copyright (c) Applied Psychology Unit, Medical Research Council, 1995
|
tomwalters@0
|
560 .LP
|
tomwalters@0
|
561 Permission to use, copy, modify, and distribute this software without fee
|
tomwalters@0
|
562 is hereby granted for research purposes, provided that this copyright
|
tomwalters@0
|
563 notice appears in all copies and in all supporting documentation, and that
|
tomwalters@0
|
564 the software is not redistributed for any fee (except for a nominal
|
tomwalters@0
|
565 shipping charge). Anyone wanting to incorporate all or part of this
|
tomwalters@0
|
566 software in a commercial product must obtain a license from the Medical
|
tomwalters@0
|
567 Research Council.
|
tomwalters@0
|
568 .LP
|
tomwalters@0
|
569 The MRC makes no representations about the suitability of this
|
tomwalters@0
|
570 software for any purpose. It is provided "as is" without express or
|
tomwalters@0
|
571 implied warranty.
|
tomwalters@0
|
572 .LP
|
tomwalters@0
|
573 THE MRC DISCLAIMS ALL WARRANTIES WITH REGARD TO THIS SOFTWARE, INCLUDING
|
tomwalters@0
|
574 ALL IMPLIED WARRANTIES OF MERCHANTABILITY AND FITNESS, IN NO EVENT SHALL
|
tomwalters@0
|
575 THE A.P.U. BE LIABLE FOR ANY SPECIAL, INDIRECT OR CONSEQUENTIAL DAMAGES
|
tomwalters@0
|
576 OR ANY DAMAGES WHATSOEVER RESULTING FROM LOSS OF USE, DATA OR PROFITS,
|
tomwalters@0
|
577 WHETHER IN AN ACTION OF CONTRACT, NEGLIGENCE OR OTHER TORTIOUS ACTION,
|
tomwalters@0
|
578 ARISING OUT OF OR IN CONNECTION WITH THE USE OR PERFORMANCE OF THIS
|
tomwalters@0
|
579 SOFTWARE.
|
tomwalters@0
|
580 .LP
|
tomwalters@0
|
581 .SH ACKNOWLEDGEMENTS
|
tomwalters@0
|
582 .LP
|
tomwalters@0
|
583 The AIM software was developed for Unix workstations by John
|
tomwalters@0
|
584 Holdsworth and Mike Allerhand of the MRC APU, under the direction of
|
tomwalters@0
|
585 Roy Patterson. The physiological version of AIM was developed by
|
tomwalters@0
|
586 Christian Giguere. The options handler is by Paul Manson. The revised
|
tomwalters@0
|
587 SAI module is by Jay Datta. Michael Akeroyd extended the postscript
|
tomwalters@0
|
588 facilites and developed the xreview routine for auditory image
|
tomwalters@0
|
589 cartoons.
|
tomwalters@0
|
590 .LP
|
tomwalters@0
|
591 The project was supported by the MRC and grants from the U.K. Defense
|
tomwalters@0
|
592 Research Agency, Farnborough (Research Contract 2239); the EEC Esprit
|
tomwalters@0
|
593 BR Porgramme, Project ACTS (3207); and the U.K. Hearing Research Trust.
|
tomwalters@0
|
594
|