Wiki » History » Version 11

Chris Cannam, 2015-04-28 05:33 PM

1 9 Chris Cannam
h1. Preprocessing options
2 1 Chris Cannam
3 11 Chris Cannam
h2. Implemented
4 9 Chris Cannam
5 10 Chris Cannam
h4. Flatten dynamics
6 1 Chris Cannam
7 5 Chris Cannam
Integrating the "Flatten Dynamics plugin":/projects/flattendynamics code
8 2 Chris Cannam
9 10 Chris Cannam
h4. De-envelope
10 2 Chris Cannam
11 2 Chris Cannam
Using cepstral processing to factor out the spectral envelope, after "Simple Cepstrum":/projects/vamp-simple-cepstrum
12 2 Chris Cannam
13 10 Chris Cannam
h4. Remove broadband noise
14 3 Chris Cannam
15 1 Chris Cannam
Like the initial noise-removal phase of "Silvet":/projects/silvet -- which runs noise removal on the constant-Q spectrum, but I expect the method would work unchanged for STFT as well
16 1 Chris Cannam
17 10 Chris Cannam
h4. Add broadband noise
18 1 Chris Cannam
19 9 Chris Cannam
Surprisingly turned out to have a potential application in MATCH: adding -30dB white noise to the inputs gave a more stable alignment
20 1 Chris Cannam
21 11 Chris Cannam
h2. Not (yet) implemented
22 7 Chris Cannam
23 10 Chris Cannam
h4. Remove pitched sounds
24 1 Chris Cannam
25 9 Chris Cannam
For symmetry with "Remove broadband noise", something that filters out stable pitched sinusoidal or harmonic content in the manner of the "Devuvuzelator":/projects/devuvuzelator (does this have any application?)
26 4 Chris Cannam
27 10 Chris Cannam
h4. Other spectral whitenings?
28 8 Chris Cannam
29 10 Chris Cannam
h4. Other aspects of harmonic/percussive separation?
30 8 Chris Cannam
31 4 Chris Cannam
h2. Processing order
32 4 Chris Cannam
33 4 Chris Cannam
We should have one single plugin / embeddable C++ class / application and it should have a list of on/off toggles for the available preprocessing options.
34 4 Chris Cannam
35 4 Chris Cannam
If more than one option is selected, the preprocessors should be applied in a fixed order:
36 4 Chris Cannam
37 4 Chris Cannam
# De-envelope
38 1 Chris Cannam
# Remove broadband noise
39 7 Chris Cannam
# (Remove pitched sounds)
40 4 Chris Cannam
# Flatten dynamics
41 4 Chris Cannam
# Add broadband noise
42 6 Chris Cannam
43 6 Chris Cannam
Nos. 1-3 are carried out in the frequency domain, the remaining two in the time domain.