annotate publications/sempre2014/mauch_sempre2014_abstract.txt @ 203:833374f16e93 v0.2

0.2
author Chris Cannam
date Wed, 05 Mar 2014 18:11:07 +0000
parents 2eb71b65e4ef
children
rev   line source
matthiasm@166 1 Paper title.
gyorgyf@176 2 Matthias Mauch, Chris Cannam and György Fazekas: Efficient Computer-Aided Pitch Track and Note Estimation for Scientific Applications
matthiasm@166 3
matthiasm@166 4 Abstract.
matthiasm@179 5 We present **Tony**, a free, open-source software tool for computer-aided pitch track and note annotation of melodic audio content. The accurate annotation of fundamental frequencies and notes is essential to the scientific study of intonation in singing and other instruments. Unlike commercial applications for singers and producers or other academic tools for generic music annotation and visualisation **Tony** has been designed for the scientific study of monophonic music: a) it implements state-of-the art algorithms for pitch and note estimation from audio, b) it provides visual and auditory feedback of the extracted pitches for the identification of detection errors, c) it provides an intelligent graphical user interface through which the user can identify and rapidly correct estimation errors, d) it provides functions for exporting pitch track and note track enabling further processing in spreadsheets or other applications. Software versions for Windows, OSX and Linux platforms can be downloaded from http://code.soundsoftware.ac.uk/projects/tony
matthiasm@166 6
matthiasm@166 7 Keyword 1.
matthiasm@172 8 Pitch/Note Analysis
matthiasm@166 9
matthiasm@166 10 Keyword 2.
matthiasm@172 11 Software
matthiasm@166 12
matthiasm@166 13 Keyword 3.
matthiasm@172 14 Singing.
matthiasm@166 15
matthiasm@166 16 Aims.
gyorgyf@176 17
matthiasm@179 18 Our goal is to make the annotation of melodic content for scientific analysis more efficient. Music psychologists interested in the analysis of pitch and intonation usually use software programs originally aimed at the analysis of speech (e.g. Praat http://www.fon.hum.uva.nl/praat/) or generic audio annotation tools (e.g. Sonic Visualiser http://www.sonicvisualiser.org/) to extract pitches of notes from audio recordings. Since these programs were not conceived for musical pitch analysis, the process of extracting note frequencies remains laborious and can take many times the duration of the recording. Commercial tools such as Melodyne (http://www.celemony.com/), Songs2See (http://www.songs2see.com/) or Sing&See (http://www.singandsee.com/) also exists for these purposes. However, their frequency estimation procedures are typically not public (proprietary code), and they do not provide export formats well suited for scientific analysis. A pitch annotation system [1] developed for scientific use exists, but it does not feature note extraction, and it is not available as free and open-source software. This is why, during research on intonation [2], we decided to develop our own tool that would avoid the above shortcomings.
matthiasm@166 19
matthiasm@166 20 Methods.
matthiasm@179 21
matthiasm@179 22 For automatic pitch and note estimation we use the pYIN method [3]. The method provides precise pitch and note estimates and automatically determines which parts of the recording are voiced. The graphical user interface is based upon open source software libraries originally developed for the Sonic Visualiser software. It shows the audio waveform, a spectrogram representation, the pitch track and notes. Users can scroll and zoom in time. **Tony** does not only play back the original audio, but also, optionally, sonifications of the pitch track (melody line) and the note track (discrete pitches with durations). Notes' pitches are estimated using the median of the pitch track corresponding to the time extent (duration) of the note. The user can delete, move, cut, merge, crop and extend notes, and the notes’ frequency is adapted accordingly. The user can delete spurious parts of the pitch track and shift the pitch track in frequency. In order to efficiently correct erroneous pitch tracks, the user can select a time interval, and **Tony** will provide various alternative pitch tracks. The user can then pick the correct one.
matthiasm@166 23
matthiasm@166 24 Outcomes.
matthiasm@179 25
matthiasm@179 26 The system is currently used for two projects: 1) the generation of new training and test data for Music Informatics research, and 2) a research project on intonation in unaccompanied solo singing. Preliminary feedback by the users suggests that the system does indeed facilitate pitch annotation and provides vital features that cannot be found in other tools.
matthiasm@166 27
matthiasm@166 28 Title for final section.
matthiasm@166 29 Conclusions
matthiasm@166 30
matthiasm@166 31 [Q37].
matthiasm@179 32 We presented **Tony**, a new software tool for computer-assisted annotation of melodic audio content for scientific analysis. No other existing program combines pitch and note estimation, a graphical user interface with auditory feedback, rapid, computer-aided correction of pitches and extensive exporting facilities. **Tony** is freely available for use on Windows, OSX and Linux platforms from http://code.soundsoftware.ac.uk/projects/tony/.
matthiasm@166 33
matthiasm@179 34 Acknowledgements.
matthiasm@179 35
matthiasm@179 36 Matthias Mauch is funded by the Royal Academy of Engineering. We would like to thank Justin Salamon, Rachel Bittner and Juan Bello for their comments and coding help.
matthiasm@166 37
matthiasm@166 38 Three key references. (APA v6)
matthiasm@172 39 [1] Pant, S., Rao, V., & Rao, P. (2010). A melody detection user interface for polyphonic music. 2010 National Conference On Communications (NCC), 2010.
matthiasm@172 40 [2] Mauch, M., Frieler, K., & Dixon, S. (under review). Intonation in Unaccompanied Singing: Accuracy, Drift and a Model of Intonation Memory.
matthiasm@172 41 [3] Mauch, M., & Dixon, S. (2014). pYIN : a Fundamental Frequency Estimator Using Probabilistic Threshold Distributions. In Proceedings of the IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP 2014).
matthiasm@166 42
matthiasm@166 43 Comments/queries to organisers.
matthiasm@166 44