pmhd: sonic-annotator-0.7-osx-x86

annotate sonic-annotator-0.7-osx-x86_64/README @ 13:844d341cf643 tip

Back up before ISMIR

author	Yading Song <yading.song@eecs.qmul.ac.uk>
date	Thu, 31 Oct 2013 13:17:06 +0000
parents	f445c3017523
children

rev	line source
yading@11	1
yading@11	2 Sonic Annotator
yading@11	3 ===============
yading@11	4
yading@11	5 Sonic Annotator is a utility program for batch feature extraction from
yading@11	6 audio files. It runs Vamp audio analysis plugins on audio files, and
yading@11	7 can write the result features in a selection of formats.
yading@11	8
yading@11	9 For more information, see
yading@11	10
yading@11	11 http://www.omras2.org/SonicAnnotator
yading@11	12
yading@11	13 More documentation follows further down this README file, after the
yading@11	14 credits.
yading@11	15
yading@11	16
yading@11	17 Credits
yading@11	18 -------
yading@11	19
yading@11	20 Sonic Annotator was developed at the Centre for Digital Music,
yading@11	21 Queen Mary, University of London.
yading@11	22
yading@11	23 http://www.elec.qmul.ac.uk/digitalmusic/
yading@11	24
yading@11	25 The main program is by Mark Levy, Chris Cannam, and Chris Sutton.
yading@11	26 Sonic Annotator incorporates library code from the Sonic Visualiser
yading@11	27 application by Chris Cannam. Code copyright 2005-2007 Chris Cannam,
yading@11	28 copyright 2006-2011 Queen Mary, University of London, except where
yading@11	29 indicated in the individual source files.
yading@11	30
yading@11	31 This work was funded by the Engineering and Physical Sciences Research
yading@11	32 Council through the OMRAS2 project EP/E017614/1.
yading@11	33
yading@11	34 Sonic Annotator is free software; you can redistribute it and/or
yading@11	35 modify it under the terms of the GNU General Public License as
yading@11	36 published by the Free Software Foundation; either version 2 of the
yading@11	37 License, or (at your option) any later version. See the file COPYING
yading@11	38 included with this distribution for more information.
yading@11	39
yading@11	40 Sonic Annotator may also make use of the following libraries:
yading@11	41
yading@11	42 * Qt4 -- Copyright Nokia Corporation, distributed under the GPL
yading@11	43 * Ogg decoder -- Copyright CSIRO Australia, BSD license
yading@11	44 * MAD mp3 decoder -- Copyright Underbit Technologies Inc, GPL
yading@11	45 * libsamplerate -- Copyright Erik de Castro Lopo, GPL
yading@11	46 * libsndfile -- Copyright Erik de Castro Lopo, LGPL
yading@11	47 * FFTW3 -- Copyright Matteo Frigo and MIT, GPL
yading@11	48 * Vamp plugin SDK -- Copyright Chris Cannam, BSD license
yading@11	49 * Redland RDF libraries -- Copyright Dave Beckett and the University of Bristol, LGPL/Apache license
yading@11	50
yading@11	51 (Some distributions of Sonic Annotator may have one or more of these
yading@11	52 libraries statically linked.) Many thanks to their authors.
yading@11	53
yading@11	54 Sonic Annotator can also use QuickTime for audio file import on OS/X.
yading@11	55 For licensing reasons, you may not distribute binaries of Sonic
yading@11	56 Annotator with QuickTime support included for any platform that does
yading@11	57 not include QuickTime as part of the platform itself (see section 3 of
yading@11	58 version 2 of the GNU General Public License).
yading@11	59
yading@11	60
yading@11	61 Compiling Sonic Annotator
yading@11	62 --------------------------
yading@11	63
yading@11	64 If you are planning to compile Sonic Annotator from source code,
yading@11	65 please read the file INSTALL.
yading@11	66
yading@11	67
yading@11	68 A Quick Tutorial
yading@11	69 ================
yading@11	70
yading@11	71 To use Sonic Annotator, you need to tell it three things: what audio
yading@11	72 files to extract features from; what features to extract; and how and
yading@11	73 where to write the results. You can also optionally tell it to
yading@11	74 summarise the features.
yading@11	75
yading@11	76
yading@11	77 1. What audio files to extract features from
yading@11	78
yading@11	79 Sonic Annotator accepts a list of audio files on the command line.
yading@11	80 Any argument that is not understood as a supported command-line option
yading@11	81 will be taken to be the name of an audio file. Any number of files
yading@11	82 may be listed.
yading@11	83
yading@11	84 Several common audio file formats are supported, including MP3, Ogg,
yading@11	85 and a number of PCM formats such as WAV and AIFF. AAC is supported on
yading@11	86 OS/X only, and only if not DRM protected. WMA is not supported.
yading@11	87
yading@11	88 File paths do not have to be local; you can also provide remote HTTP
yading@11	89 or FTP URLs for Sonic Annotator to retrieve.
yading@11	90
yading@11	91 Sonic Annotator also accepts the names of playlist files (.m3u
yading@11	92 extension) and will process every file found in the playlist.
yading@11	93
yading@11	94 Finally, you can provide a local directory path instead of a file,
yading@11	95 together with the -r (recursive) option, for Sonic Annotator to
yading@11	96 process every audio file found in that directory or any of its
yading@11	97 subdirectories.
yading@11	98
yading@11	99
yading@11	100 2. What features to extract
yading@11	101
yading@11	102 Sonic Annotator applies "transforms" to its input audio files, where a
yading@11	103 transform (in this terminology) consists of a Vamp plugin together
yading@11	104 with a certain set of parameters and a specified execution context:
yading@11	105 step and block size, sample rate, etc.
yading@11	106
yading@11	107 (See http://www.vamp-plugins.org/ for more information about Vamp
yading@11	108 plugins.)
yading@11	109
yading@11	110 To use a particular transform, specify its filename on the command
yading@11	111 line with the -t option.
yading@11	112
yading@11	113 Transforms are usually described in RDF, following the transform part
yading@11	114 of the Vamp plugin ontology (http://purl.org/ontology/vamp/). A
yading@11	115 Transform may use any Vamp plugin that is currently installed and
yading@11	116 available on the system. You can obtain a list of available plugin
yading@11	117 outputs by running Sonic Annotator with the -l option, and you can
yading@11	118 obtain a skeleton transform description for one of these plugins with
yading@11	119 the -s option.
yading@11	120
yading@11	121 For example, if the example plugins from the Vamp plugin SDK are
yading@11	122 available and no other plugins are installed, you might have an
yading@11	123 exchange like this:
yading@11	124
yading@11	125 $ sonic-annotator -l
yading@11	126 vamp:vamp-example-plugins:amplitudefollower:amplitude
yading@11	127 vamp:vamp-example-plugins:fixedtempo:acf
yading@11	128 vamp:vamp-example-plugins:fixedtempo:detectionfunction
yading@11	129 vamp:vamp-example-plugins:fixedtempo:filtered_acf
yading@11	130 vamp:vamp-example-plugins:fixedtempo:tempo
yading@11	131 vamp:vamp-example-plugins:fixedtempo:candidates
yading@11	132 vamp:vamp-example-plugins:percussiononsets:detectionfunction
yading@11	133 vamp:vamp-example-plugins:percussiononsets:onsets
yading@11	134 vamp:vamp-example-plugins:powerspectrum:powerspectrum
yading@11	135 vamp:vamp-example-plugins:spectralcentroid:linearcentroid
yading@11	136 vamp:vamp-example-plugins:spectralcentroid:logcentroid
yading@11	137 vamp:vamp-example-plugins:zerocrossing:counts
yading@11	138 vamp:vamp-example-plugins:zerocrossing:zerocrossings
yading@11	139 $ sonic-annotator -s vamp:vamp-example-plugins:fixedtempo:tempo
yading@11	140 @prefix xsd: <http://www.w3.org/2001/XMLSchema#> .
yading@11	141 @prefix vamp: <http://purl.org/ontology/vamp/> .
yading@11	142 @prefix : <#> .
yading@11	143
yading@11	144 :transform a vamp:Transform ;
yading@11	145 vamp:plugin <http://vamp-plugins.org/rdf/plugins/vamp-example-plugins#fixedtempo> ;
yading@11	146 vamp:step_size "64"^^xsd:int ;
yading@11	147 vamp:block_size "256"^^xsd:int ;
yading@11	148 vamp:parameter_binding [
yading@11	149 vamp:parameter [ vamp:identifier "maxbpm" ] ;
yading@11	150 vamp:value "190"^^xsd:float ;
yading@11	151 ] ;
yading@11	152 vamp:parameter_binding [
yading@11	153 vamp:parameter [ vamp:identifier "maxdflen" ] ;
yading@11	154 vamp:value "10"^^xsd:float ;
yading@11	155 ] ;
yading@11	156 vamp:parameter_binding [
yading@11	157 vamp:parameter [ vamp:identifier "minbpm" ] ;
yading@11	158 vamp:value "50"^^xsd:float ;
yading@11	159 ] ;
yading@11	160 vamp:output <http://vamp-plugins.org/rdf/plugins/vamp-example-plugins#fixedtempo_output_tempo> .
yading@11	161 $
yading@11	162
yading@11	163 The output of -s is an RDF/Turtle document describing the default
yading@11	164 settings for the Tempo output of the Fixed Tempo Estimator plugin in
yading@11	165 the Vamp plugin SDK.
yading@11	166
yading@11	167 (The exact format of the RDF printed may differ -- e.g. if the
yading@11	168 plugin's RDF description is not installed and so its "home" URI is not
yading@11	169 known -- but the result should be functionally equivalent to this.)
yading@11	170
yading@11	171 You could run this transform by saving the RDF to a file and
yading@11	172 specifying that file with -t:
yading@11	173
yading@11	174 $ sonic-annotator -s vamp:vamp-example-plugins:fixedtempo:tempo > test.n3
yading@11	175 $ sonic-annotator -t test.n3 audio.wav -w csv --csv-stdout
yading@11	176 (... logging output on stderr, then ...)
yading@11	177 "audio.wav",0.002902494,5.196916099,68.7916,"68.8 bpm"
yading@11	178 $
yading@11	179
yading@11	180 The single line of output above consists of the audio file name, the
yading@11	181 timestamp and duration for a single feature, the value of that feature
yading@11	182 (the estimated tempo of the given region of time from that file, in
yading@11	183 bpm -- the plugin in question performs a single tempo estimation and
yading@11	184 nothing else) and the feature's label.
yading@11	185
yading@11	186 A quicker way to achieve the above is to use the -d (default) option
yading@11	187 to tell Sonic Annotator to use directly the default configuration for
yading@11	188 a named transform:
yading@11	189
yading@11	190 $ sonic-annotator -d vamp:vamp-example-plugins:fixedtempo:tempo audio.wav -w csv --csv-stdout
yading@11	191 (... some log output on stderr, then ...)
yading@11	192 "audio.wav",0.002902494,5.196916099,68.7916,"68.8 bpm"
yading@11	193 $
yading@11	194
yading@11	195 Although handy for experimentation, the -d option is inadvisable in
yading@11	196 any "production" situation because the plugin configuration is not
yading@11	197 guaranteed to be the same each time (for example if an updated version
yading@11	198 of a plugin changes some of its defaults). It's better to save a
yading@11	199 well-defined transform to file and refer to that, even if it is simply
yading@11	200 the transform created by the skeleton option.
yading@11	201
yading@11	202 To run more than one transform on the same audio files, just put more
yading@11	203 than one set of transform RDF descriptions in the same file, or give
yading@11	204 the -t option more than once with separate transform description
yading@11	205 files. Remember that if you want to specify more than one transform
yading@11	206 in the same file, they will need to have distinct URIs (that is, the
yading@11	207 ":transform" part of the example above, which may be any arbitrary
yading@11	208 name, must be distinct for each described transform).
yading@11	209
yading@11	210
yading@11	211 3. How and where to write the results
yading@11	212
yading@11	213 Sonic Annotator supports various different output modules (and it is
yading@11	214 fairly easy for the developer to add new ones). You have to choose at
yading@11	215 least one output module; use the -w (writer) option to do so. Each
yading@11	216 module has its own set of parameters which can be adjusted on the
yading@11	217 command line, as well as its own default rules about where to write
yading@11	218 the results.
yading@11	219
yading@11	220 The following writers are currently supported. (Others exist, but are
yading@11	221 not properly implemented or not supported.)
yading@11	222
yading@11	223 * csv
yading@11	224
yading@11	225 Writes the results into comma-separated data files.
yading@11	226
yading@11	227 One file is created for each transform applied to each input audio
yading@11	228 file, named after the input audio file and transform name with .csv
yading@11	229 suffix and ":" replaced by "_" throughout, placed in the same
yading@11	230 directory as the audio file.
yading@11	231
yading@11	232 To instruct Sonic Annotator to place the output files in another
yading@11	233 location, use --csv-basedir with a directory name.
yading@11	234
yading@11	235 To write a single file with all data in it, use --csv-one-file.
yading@11	236
yading@11	237 To write all data to stdout instead of to a file, use --csv-stdout.
yading@11	238
yading@11	239 Sonic Annotator will not write to an output file that already
yading@11	240 exists. If you want to make it do this, use --csv-force to
yading@11	241 overwrite or --csv-append to append to it.
yading@11	242
yading@11	243 The data generated consists of one line for each result feature,
yading@11	244 containing the feature timestamp, feature duration if present, all
yading@11	245 of the feature's bin values in order, followed by the feature's
yading@11	246 label if present. If the --csv-one-file or --csv-stdout option is
yading@11	247 specified, then an additional column will appear before any of the
yading@11	248 above, containing the audio file name from which the feature was
yading@11	249 extracted, if it differs from that of the previous row.
yading@11	250
yading@11	251 The default column separator is a comma; you can specify a
yading@11	252 different one with the --csv-separator option.
yading@11	253
yading@11	254 * rdf
yading@11	255
yading@11	256 Writes the results into RDF/Turtle documents following the Audio
yading@11	257 Features ontology (http://purl.org/ontology/af/).
yading@11	258
yading@11	259 One file is created for each input audio file containing the
yading@11	260 features extracted by all transforms applied to that file, named
yading@11	261 after the input audio file with .n3 extension, placed in the same
yading@11	262 directory as the audio file.
yading@11	263
yading@11	264 To instruct Sonic Annotator to place the output files in another
yading@11	265 location, use --rdf-basedir with a directory name.
yading@11	266
yading@11	267 To write a single file with all data (from all input audio files)
yading@11	268 in it, use --rdf-one-file.
yading@11	269
yading@11	270 To write one file for each transform applied to each input audio
yading@11	271 file, named after the input audio file and transform name with .n3
yading@11	272 suffix and ":" replaced by "_" throughout, use --rdf-many-files.
yading@11	273
yading@11	274 To write all data to stdout instead of to a file, use --rdf-stdout.
yading@11	275
yading@11	276 Sonic Annotator will not write to an output file that already
yading@11	277 exists. If you want to make it do this, use --rdf-force to
yading@11	278 overwrite or --rdf-append to append to it.
yading@11	279
yading@11	280 Sonic Annotator will use plugin description RDF if available to
yading@11	281 enhance its output (for example identifying note onset times as
yading@11	282 note onset times, if the plugin's RDF says that is what it
yading@11	283 produces, rather than writing them as plain events). Best results
yading@11	284 will be obtained if an RDF document is provided with your plugins
yading@11	285 (for example, vamp-example-plugins.n3) and you have this installed
yading@11	286 in the same location as the plugins. To override this enhanced
yading@11	287 output and write plain events for all features, use --rdf-plain.
yading@11	288
yading@11	289 The output RDF will include an available_as property linking the
yading@11	290 results to the original audio signal URI. By default, this will
yading@11	291 point to the URI of the file or resource containing the audio that
yading@11	292 Sonic Annotator processed, such as the file:/// location on disk.
yading@11	293 To override this, for example to process a local copy of a file
yading@11	294 while generating RDF that describes a copy of it available on a
yading@11	295 network, you can use the --rdf-signal-uri option to specify an
yading@11	296 alternative signal URI.
yading@11	297
yading@11	298
yading@11	299 4. Optionally, how to summarise the features
yading@11	300
yading@11	301 Sonic Annotator can also calculate and write summaries of features,
yading@11	302 such as mean and median values.
yading@11	303
yading@11	304 To obtain a summary as well as the feature results, just use the -S
yading@11	305 option, naming the type of summary you want (min, max, mean, median,
yading@11	306 mode, sum, variance, sd or count). You can also tell it to produce
yading@11	307 only the summary, not the individual features, with --summary-only.
yading@11	308
yading@11	309 Alternatively, you can specify a summary in a transform description.
yading@11	310 The following example tells Sonic Annotator to write both the times of
yading@11	311 note onsets estimated by the simple percussion onset detector example
yading@11	312 plugin, and the variance of the plugin's onset detection function.
yading@11	313 (It will only process the audio file and run the plugin once.)
yading@11	314
yading@11	315 @prefix rdf: <http://www.w3.org/1999/02/22-rdf-syntax-ns#>.
yading@11	316 @prefix vamp: <http://purl.org/ontology/vamp/>.
yading@11	317 @prefix examples: <http://vamp-plugins.org/rdf/plugins/vamp-example-plugins#>.
yading@11	318 @prefix : <#>.
yading@11	319
yading@11	320 :transform1 a vamp:Transform;
yading@11	321 vamp:plugin examples:percussiononsets ;
yading@11	322 vamp:output examples:percussiononsets_output_onsets .
yading@11	323
yading@11	324 :transform0 a vamp:Transform;
yading@11	325 vamp:plugin examples:percussiononsets ;
yading@11	326 vamp:output examples:percussiononsets_output_detectionfunction ;
yading@11	327 vamp:summary_type "variance" .
yading@11	328
yading@11	329 Sonic Annotator can also summarise in segments -- if you provide a
yading@11	330 comma-separated list of times as an argument to the --segments option,
yading@11	331 it will calculate one summary for each segment bounded by the times
yading@11	332 you provided. For example,
yading@11	333
yading@11	334 $ sonic-annotator -d vamp:vamp-example-plugins:percussiononsets:detectionfunction -S variance --sumary-only --segments 1,2,3 -w csv --csv-stdout audio.wav
yading@11	335 (... some log output on stderr, then ...)
yading@11	336 ,0.000000000,1.000000000,variance,1723.99,"(variance, continuous-time average)"
yading@11	337 ,1.000000000,1.000000000,variance,1981.75,"(variance, continuous-time average)"
yading@11	338 ,2.000000000,1.000000000,variance,1248.79,"(variance, continuous-time average)"
yading@11	339 ,3.000000000,7.031020407,variance,1030.06,"(variance, continuous-time average)"
yading@11	340
yading@11	341 Here the first row contains a summary covering the time period from 0
yading@11	342 to 1 second, the second from 1 to 2 seconds, the third from 2 to 3
yading@11	343 seconds and the fourth from 3 seconds to the end of the (short) audio
yading@11	344 file.
yading@11	345

Mercurial > hg > pmhd

annotate sonic-annotator-0.7-osx-x86_64/README @ 13:844d341cf643 tip