Wiki » History » Version 1

Matthias Mauch, 2012-11-05 03:10 PM

1 1 Matthias Mauch
h1. Wiki
2 1 Matthias Mauch
3 1 Matthias Mauch
h2. Specification
4 1 Matthias Mauch
5 1 Matthias Mauch
The Tony tool will be a very simple user interface for the exact annotation of notes, note pitches and performance of human singing and other monophonic instruments.
6 1 Matthias Mauch
7 1 Matthias Mauch
h3. Users
8 1 Matthias Mauch
9 1 Matthias Mauch
We expect users to be unfamiliar with programming, but to be familiar with music and research in general. In order to illustrate this, these are some possible users:
10 1 Matthias Mauch
11 1 Matthias Mauch
# Friedrich (56) is a musicologist at a university in northern Germany. Sample publication: "Zur Ästhetik der Stimme bei Wagner - Eine vergleichende Analyse historischer und moderner Aufnahmen". He uses a Windows Vista laptop with, mainly for Email correspondence and keeping lists of audio recordings in an Excel spreadsheet. He listens to Music from CDs, noting down singing characteristics on a paper notepad. He plays the cello and organises outings to the Berlin philharmonics with his Seminar students.
12 1 Matthias Mauch
# Sam (27) is an ethnomusicologist studying the music of Brazilian immigrants in Portugal. She records the immigrants' music on location in Lisbon and compares the rhythmic and 
13 1 Matthias Mauch
# Peter (23) is part of a 
14 1 Matthias Mauch
15 1 Matthias Mauch
h3. Components
16 1 Matthias Mauch
17 1 Matthias Mauch
   * automatic pitch and note transcription methods
18 1 Matthias Mauch
   * GUI for correction, and additional note-based or phrase-based annotations
19 1 Matthias Mauch
   * export to RDF and graphics
20 1 Matthias Mauch
21 1 Matthias Mauch
22 1 Matthias Mauch
GUI
23 1 Matthias Mauch
The most important aspect of the GUI is that it should be simple because the users are unlikely to be familiar with complicated programs.
24 1 Matthias Mauch
25 1 Matthias Mauch
   * User A: German musicologist
26 1 Matthias Mauch
      * is used to Windows 95 but recently acquired a laptop with Windows 7 installed
27 1 Matthias Mauch
      * uses Word to write papers and Outlook for email, but no other computer programs
28 1 Matthias Mauch
   * User B:
29 1 Matthias Mauch
30 1 Matthias Mauch
31 1 Matthias Mauch
1. Signal processing: mainly pitch extraction via Yin, but including more outputs than the original Yin (in particular: frequency estimate, even if Yin thinks there's no pitch, and the underlying salience function).
32 1 Matthias Mauch
1. Note alignment: align notes given a pitch file
33 1 Matthias Mauch
1. Note detection: this may be implemented as a variant of the HMM I made, with three states per output.
34 1 Matthias Mauch
1. Interactive GUI with these key features:
35 1 Matthias Mauch
     * scrolling/zooming
36 1 Matthias Mauch
          * scroll through piece horizontally (in time) and vertically (in pitch)
37 1 Matthias Mauch
          * vertical zoom
38 1 Matthias Mauch
     * Note editing
39 1 Matthias Mauch
          * edit note start and end times,
40 1 Matthias Mauch
          * edit continuous note pitch,
41 1 Matthias Mauch
          * choose note pitch estimate (mean, median, manual)
42 1 Matthias Mauch
          * turn note to rest (unpitched)
43 1 Matthias Mauch
          * lock notes (i.e. prevent from re-estimation)
44 1 Matthias Mauch
     * Audio Playback
45 1 Matthias Mauch
          * playback original audio
46 1 Matthias Mauch
          * sonify notes
47 1 Matthias Mauch
          * loop around current note
48 1 Matthias Mauch
     * Undo note and pitch track changes
49 1 Matthias Mauch
     * Export notes and pitches
50 1 Matthias Mauch
1. Further possible GUI features
51 1 Matthias Mauch
     * automatic re-estimation after edits (kind of Maximum A Posteriori or something)
52 1 Matthias Mauch
     * export as MIDI
53 1 Matthias Mauch
1. Explicitly unwanted features
54 1 Matthias Mauch
     * delete note (notes can only be made into rests -- we assume a contiguous, monophonic melody)
55 1 Matthias Mauch
     * horizontal zoom
56 1 Matthias Mauch
     * polyphonic processing