auditok: quickstart.rst comparison

comparison quickstart.rst @ 23:2beb3fb562f3

doc update

author	Amine Sehili <amine.sehili@gmail.com>
date	Sun, 29 Nov 2015 11:52:56 +0100
parents	6b2cc3ca5b6a
children

comparison

equal deleted inserted replaced

-:aceb9bc3d74e
+:2beb3fb562f3
 .. code:: python
 [(['A', 'B', 'C', 'D', 'b', 'b', 'E', 'F', 'c', 'G', 'H', 'I', 'd', 'd'], 3, 16), (['J', 'K', 'e', 'e'], 18, 21)]
-Notice the tailing lower case letters "dd" and "ee" at the end of the two
+Notice the trailing lower case letters "dd" and "ee" at the end of the two
-tokens. The default behavior of `StreamTokenizer` is to keep the *tailing
+tokens. The default behavior of `StreamTokenizer` is to keep the *trailing
 silence* if it doesn't exceed `max_continuous_silence`. This can be changed
-using the `DROP_TAILING_SILENCE` mode (see next example).
+using the `DROP_TRAILING_SILENCE` mode (see next example).
-Remove tailing silence
+Remove trailing silence
 -----------------------
-Tailing silence can be useful for many sound recognition applications, including
+Trailing silence can be useful for many sound recognition applications, including
-speech recognition. Moreover, from the human auditory system point of view, tailing
+speech recognition. Moreover, from the human auditory system point of view, trailing
 low energy signal helps removing abrupt signal cuts.
-If you want to remove it anyway, you can do it by setting `mode` to `StreamTokenizer.DROP_TAILING_SILENCE`:
+If you want to remove it anyway, you can do it by setting `mode` to `StreamTokenizer.DROP_TRAILING_SILENCE`:
 .. code:: python
 from auditok import StreamTokenizer, StringDataSource, DataValidator
 return frame.isupper()
 dsource = StringDataSource("aaaABCDbbEFcGHIdddJKee")
 tokenizer = StreamTokenizer(validator=UpperCaseChecker(),
 min_length=1, max_length=9999, max_continuous_silence=2,
-mode=StreamTokenizer.DROP_TAILING_SILENCE)
+mode=StreamTokenizer.DROP_TRAILING_SILENCE)
 tokenizer.tokenize(dsource)
 output:
 player.play(data)
 assert len(tokens) == 6
-Trim leading and tailing silence
+Trim leading and trailing silence
 ---------------------------------
 The  tokenizer in the following example is set up to remove the silence
 that precedes the first acoustic activity or follows the last activity
 in a record. It preserves whatever it founds between the two activities.
-In other words, it removes the leading and tailing silence.
+In other words, it removes the leading and trailing silence.
 Sampling rate is 44100 sample per second, we'll use an analysis window of 100 ms
 (i.e. block_size == 4410)
 Energy threshold is 50.
 The tokenizer will start accumulating windows up from the moment it encounters
 the first analysis window of an energy >= 50. ALL the following windows will be
-kept regardless of their energy. At the end of the analysis, it will drop tailing
+kept regardless of their energy. At the end of the analysis, it will drop trailing
 windows with an energy below 50.
 This is an interesting example because the audio file we're analyzing contains a very
 brief noise that occurs within the leading silence. We certainly do want our tokenizer
 to stop at this point and considers whatever it comes after as a useful signal.
 .. code:: python
 from auditok import ADSFactory, AudioEnergyValidator, StreamTokenizer, player_for, dataset
 # record = True so that we'll be able to rewind the source.
-asource = ADSFactory.ads(filename=dataset.was_der_mensch_saet_mono_44100_lead_tail_silence,
+asource = ADSFactory.ads(filename=dataset.was_der_mensch_saet_mono_44100_lead_trail_silence,
 record=True, block_size=4410)
 asource.open()
 original_signal = []
 # Read the whole signal
 # Create a validator with an energy threshold of 50
 validator = AudioEnergyValidator(sample_width=asource.get_sample_width(), energy_threshold=50)
 # Create a tokenizer with an unlimited token length and continuous silence within a token
-# Note the DROP_TAILING_SILENCE mode that will ensure removing tailing silence
+# Note the DROP_TRAILING_SILENCE mode that will ensure removing trailing silence
-trimmer = StreamTokenizer(validator, min_length = 20, max_length=99999999, init_min=3, init_max_silence=1, max_continuous_silence=9999999, mode=StreamTokenizer.DROP_TAILING_SILENCE)
+trimmer = StreamTokenizer(validator, min_length = 20, max_length=99999999, init_min=3, init_max_silence=1, max_continuous_silence=9999999, mode=StreamTokenizer.DROP_TRAILING_SILENCE)
 tokens = trimmer.tokenize(asource)
 # Make sure we only have one token
 trimmed_signal = ''.join(tokens[0][0])
 player = player_for(asource)
-print("Playing original signal (with leading and tailing silence)...")
+print("Playing original signal (with leading and trailing silence)...")
 player.play(original_signal)
 print("Playing trimmed signal...")
 player.play(trimmed_signal)

Mercurial > hg > auditok

comparison quickstart.rst @ 23:2beb3fb562f3