Notes on first meeting » History » Version 32

Gyorgy Fazekas, 2012-02-24 12:58 PM

1 6 Gyorgy Fazekas
These notes are based on our initial meeting on 24 January 2012. The aim was to collect some use cases and have an initial idea on what needs to be done to extend or revise the existing Audio Features Ontology.
2 1 Gyorgy Fazekas
3 14 Gyorgy Fazekas
h1. Topics discussed 
4 1 Gyorgy Fazekas
5 14 Gyorgy Fazekas
A rough list of topics discussed during the first meeting:
6 14 Gyorgy Fazekas
7 1 Gyorgy Fazekas
** What are the main research use cases for an Audio Features Ontology (AF) ?
8 2 Gyorgy Fazekas
** Are they served well by the existing AF ? 
9 1 Gyorgy Fazekas
** If not, what are the most important extensions we need to do?
10 1 Gyorgy Fazekas
** Does the fundamental structure of the ontology need to be changed?
11 1 Gyorgy Fazekas
** What is the relation of AF to existing software, including:
12 10 Gyorgy Fazekas
13 8 Gyorgy Fazekas
 * software like: Sonic Annotator, Sonic Visualiser, SAWA, AudioDB other tools... 
14 1 Gyorgy Fazekas
 * and projects like: OMRAS2, EASAIER, SALAMI, new Semantic Media/Semantic Audio grants...
15 7 Gyorgy Fazekas
** Personal Objectives: what are we going to do with a modified/re-engineered ontology?
16 1 Gyorgy Fazekas
17 2 Gyorgy Fazekas
18 4 Gyorgy Fazekas
h1. Use cases:
19 1 Gyorgy Fazekas
20 12 Gyorgy Fazekas
Use cases discussed so far:
21 12 Gyorgy Fazekas
22 12 Gyorgy Fazekas
+Thomas:+
23 1 Gyorgy Fazekas
  
24 1 Gyorgy Fazekas
** drive audio effects -> adaptive effect (controlling effects)
25 1 Gyorgy Fazekas
** KM like use case: association of audio effects and audio features e.g. pitch shifter won’t change onsets
26 1 Gyorgy Fazekas
** part of the AFX ontology
27 1 Gyorgy Fazekas
** more audio features
28 1 Gyorgy Fazekas
** technical classification of audio effects
29 1 Gyorgy Fazekas
30 1 Gyorgy Fazekas
31 12 Gyorgy Fazekas
+Steve:+
32 1 Gyorgy Fazekas
** Finding structure, repeated sequences of features
33 28 Gyorgy Fazekas
** Beat related stuff, BPM (tempo, major/minor is it an audio feature)
34 28 Gyorgy Fazekas
** Chords, Chord sequences => Chord ontology
35 1 Gyorgy Fazekas
** Melody and notes
36 1 Gyorgy Fazekas
37 1 Gyorgy Fazekas
38 12 Gyorgy Fazekas
+George:+
39 1 Gyorgy Fazekas
** Improve SAWA
40 1 Gyorgy Fazekas
** Facilitate the development of intelligent music production systems
41 1 Gyorgy Fazekas
** Release large content based metadata repositories in RDF
42 1 Gyorgy Fazekas
** Re-release the MSD in RDF (??)
43 1 Gyorgy Fazekas
** Deploy a knowledge based environment for content-based audio analysis based on the concept of the Knowledge Machine that can combine multiple modalities
44 1 Gyorgy Fazekas
** Research reproducibility using Ontologies as a model to exchange research data.
45 1 Gyorgy Fazekas
46 6 Gyorgy Fazekas
47 18 Gyorgy Fazekas
h1. Fundamental structure of the existing AF Ontology:
48 18 Gyorgy Fazekas
49 18 Gyorgy Fazekas
The Audio Features Ontology currently provides a core model which distinguishes between audio features based on two attributes:
50 18 Gyorgy Fazekas
51 18 Gyorgy Fazekas
# Temporal characteristics
52 18 Gyorgy Fazekas
# Data density
53 18 Gyorgy Fazekas
54 18 Gyorgy Fazekas
The first dichotomy allows for describing features either instantaneous events (e.g. note onsets, tempo change), or features with a known time duration (notes, structural segments, harmonic segments, the extent of an STFT or Chromagram frame).
55 18 Gyorgy Fazekas
56 18 Gyorgy Fazekas
The second dichotomy addresses a representational issue, and allows for describing how a feature relates to the extent of an audio file: 
57 18 Gyorgy Fazekas
** whether it is scattered and irregularly occurs during the course of a track (i.e. sparse),
58 18 Gyorgy Fazekas
** or occurs regularly and have a fixed duration (i.e. dense).
59 19 Gyorgy Fazekas
60 18 Gyorgy Fazekas
Alternative conceptualisations and some examples are summarised below:
61 18 Gyorgy Fazekas
62 18 Gyorgy Fazekas
!http://isophonics.net/sites/isophonics.net/files/FeatureConceptualisations.png!
63 30 Gyorgy Fazekas
*Fig 1.* Conceptualisations of content-based features. [maybe commit my relevant thesis chapter]
64 18 Gyorgy Fazekas
65 18 Gyorgy Fazekas
The main scope of the existing ontology is to provide a framework for communication and to describe the association of features and audio signals. It does not classify features, describe their interrelationships or their computation. It deals with data density, and temporal characteristics only and differentiates between dense signal-like features of various dimensionality, (chromagrams, detection functions) and sparse features that are scattered across the signal timeline. This core model is shown in the following diagram:
66 18 Gyorgy Fazekas
67 18 Gyorgy Fazekas
!http://isophonics.net/sites/isophonics.net/files/AF_ontology_small.png!
68 18 Gyorgy Fazekas
*Fig. 2.* Core model of the existing Audio Features Ontology
69 18 Gyorgy Fazekas
70 25 Gyorgy Fazekas
h2. RDF Example:
71 1 Gyorgy Fazekas
72 26 Gyorgy Fazekas
AF heavily relies on the event and timeline ontologies to refer to event in time and timelines corresponding to the duration of an audio signal or a dense signal-like feature. Here's an RDF example produced by SAWA/Sonic Annotator describing temporal segments using the ontology:
73 25 Gyorgy Fazekas
74 23 Gyorgy Fazekas
<pre>
75 23 Gyorgy Fazekas
<http://isophonics.net/sawa/audiofile/temp/AU775621fe> a mo:AudioFile ;
76 21 Gyorgy Fazekas
    dc:title """music-test.wav""" ;
77 21 Gyorgy Fazekas
    mo:encodes :signal_1.
78 21 Gyorgy Fazekas
79 21 Gyorgy Fazekas
:signal_1 a mo:Signal ;
80 21 Gyorgy Fazekas
    mo:time [
81 21 Gyorgy Fazekas
        a tl:Interval ;
82 21 Gyorgy Fazekas
        tl:onTimeLine :signal_timeline_1
83 21 Gyorgy Fazekas
    ] .
84 21 Gyorgy Fazekas
85 21 Gyorgy Fazekas
:signal_timeline_1 a tl:Timeline .
86 21 Gyorgy Fazekas
87 21 Gyorgy Fazekas
:event_2 a <http://purl.org/ontology/af/StructuralSegment> ;
88 21 Gyorgy Fazekas
    event:time [ 
89 21 Gyorgy Fazekas
        a tl:Interval ;
90 21 Gyorgy Fazekas
        tl:onTimeLine :signal_timeline_1 ;
91 21 Gyorgy Fazekas
        tl:at "PT19.600000000S"^^xsd:duration ;
92 21 Gyorgy Fazekas
        tl:duration "PT10.500000000S"^^xsd:duration ;
93 21 Gyorgy Fazekas
    ] ;
94 1 Gyorgy Fazekas
    af:feature "9" .
95 23 Gyorgy Fazekas
</pre>
96 26 Gyorgy Fazekas
97 21 Gyorgy Fazekas
98 1 Gyorgy Fazekas
h1. Open issues:
99 6 Gyorgy Fazekas
100 1 Gyorgy Fazekas
Some important questions to be decided on:
101 12 Gyorgy Fazekas
102 1 Gyorgy Fazekas
h2. Domain boundaries and scope:
103 1 Gyorgy Fazekas
104 27 Gyorgy Fazekas
** What is the ideal domain a revised AF?
105 19 Gyorgy Fazekas
106 19 Gyorgy Fazekas
 * Are Musicological concepts outside the domain of an AF ?
107 31 Gyorgy Fazekas
 * Should the currently included speech features moved to another ontology?
108 31 Gyorgy Fazekas
 ** They were added only to satisfy EASAIER requirements.
109 19 Gyorgy Fazekas
 * How about Physical features: 
110 19 Gyorgy Fazekas
 ** Acoustic features, 
111 19 Gyorgy Fazekas
 ** Perceptual Features, 
112 19 Gyorgy Fazekas
 ** DSP type feature, 
113 19 Gyorgy Fazekas
 ** Musical Features (musically meaningful features related to acoustics) 
114 1 Gyorgy Fazekas
115 11 Gyorgy Fazekas
** The scope of the revised ontology may be:
116 1 Gyorgy Fazekas
117 1 Gyorgy Fazekas
 * Facilitate data-exchange for various purposes: (e.g. Linked Open Data, Research reproducibility, etc...)
118 17 Gyorgy Fazekas
 * Facilitate building intelligent/knowledge-based systems:
119 17 Gyorgy Fazekas
 ** How expressive the Ontology should be?
120 6 Gyorgy Fazekas
 ** What kind of reasoning services should be supported?
121 17 Gyorgy Fazekas
122 6 Gyorgy Fazekas
h2. What are the strength and weaknesses of the existing ontology?
123 17 Gyorgy Fazekas
124 20 Gyorgy Fazekas
* Does it serve us well?
125 20 Gyorgy Fazekas
* For example, loudness is defined as a segment in AF, and it does not fit a perceptual attribute well.
126 20 Gyorgy Fazekas
* What depth do we want ? (both in terms of scope and the level of detail) 
127 20 Gyorgy Fazekas
** do we want to describe feature extraction workflows using this or another ontology
128 20 Gyorgy Fazekas
* How AF relates to the DSP workflows used when extracting them?
129 1 Gyorgy Fazekas
130 1 Gyorgy Fazekas
131 18 Gyorgy Fazekas
h1. Existing resources :
132 2 Gyorgy Fazekas
133 18 Gyorgy Fazekas
h2. Some work related to Steve's use cases, segmentation and Ontologies:
134 2 Gyorgy Fazekas
135 2 Gyorgy Fazekas
** SALAMI Project: Kevin Page, DaveDeRoure http://salami.music.mcgill.ca/
136 2 Gyorgy Fazekas
** The Segment Ontology: http://users.ox.ac.uk/~oerc0033/preprints/admire2011.pdf
137 2 Gyorgy Fazekas
** PopStructure Ontology: Kurt Jacobson Unpublished. 
138 2 Gyorgy Fazekas
(Example available: http://wiki.musicontology.com/index.php/Structural_annotations_of_%22Can%27t_buy_me_love%22_by_the_Beatles) 
139 1 Gyorgy Fazekas
** Similarity Ontology: Kurt Jacobson http://grasstunes.net/ontology/musim/musim.html
140 1 Gyorgy Fazekas
141 2 Gyorgy Fazekas
142 18 Gyorgy Fazekas
h1. Ideas/resources for new Ontologies:
143 2 Gyorgy Fazekas
144 1 Gyorgy Fazekas
** Steve has worked on Acoustics related ontology
145 2 Gyorgy Fazekas
146 24 Gyorgy Fazekas
* Creating a DSP ontology:
147 24 Gyorgy Fazekas
* include processing steps down to math operations 
148 24 Gyorgy Fazekas
** this can take advantage to the log and math:namespaces in CWM: 
149 24 Gyorgy Fazekas
*** http://www.w3.org/DesignIssues/Notation3.html
150 24 Gyorgy Fazekas
*** http://markmail.org/download.xqy?id=6xj4qlauo442ymme&number=2
151 24 Gyorgy Fazekas
* describe common DSP parameters
152 1 Gyorgy Fazekas
153 24 Gyorgy Fazekas
* create an Acoustics Ontology
154 24 Gyorgy Fazekas
* describe Musicological concepts
155 24 Gyorgy Fazekas
* describe concepts related to cognitive and perceptual issues
156 1 Gyorgy Fazekas
157 1 Gyorgy Fazekas
158 2 Gyorgy Fazekas
h2. Currently missing features
159 1 Gyorgy Fazekas
160 1 Gyorgy Fazekas
* MFCC-s 
161 30 Gyorgy Fazekas
** the problem here is that the definition of MFCC is slightly ambiguous in the literature, 
162 30 Gyorgy Fazekas
** this is a case for describing the workflow of computation...
163 30 Gyorgy Fazekas
* Rythmogram, Beatspectrum etc...
164 30 Gyorgy Fazekas
* combined/composite features 
165 30 Gyorgy Fazekas
** weighted combinations or 
166 30 Gyorgy Fazekas
** statistical averages over features (maybe only relevant if separately defined somewhere...)
167 30 Gyorgy Fazekas
* RMS energy (still not included?)
168 31 Gyorgy Fazekas
169 31 Gyorgy Fazekas
This list needs to be extended based on thorough literature review.
170 1 Gyorgy Fazekas
171 1 Gyorgy Fazekas
172 1 Gyorgy Fazekas
h2. Development issues
173 1 Gyorgy Fazekas
174 2 Gyorgy Fazekas
** chaining, combination, weighting
175 1 Gyorgy Fazekas
** how you associate features with arbitrary data
176 2 Gyorgy Fazekas
** summary feature types 
177 1 Gyorgy Fazekas
** SM (similarity matrix) are they part of the ontoogy?
178 1 Gyorgy Fazekas
** how to describe salience, can you hear it, can you perceive, is there an agreement
179 1 Gyorgy Fazekas
** how to describe weighting, confidence
180 1 Gyorgy Fazekas
** mood, music psychology, cognition, emotion, (perception ?)
181 1 Gyorgy Fazekas
** provenance => music provenance
182 1 Gyorgy Fazekas
** deprecation and versioning
183 1 Gyorgy Fazekas
184 18 Gyorgy Fazekas
185 1 Gyorgy Fazekas
h1. Objectives:
186 18 Gyorgy Fazekas
187 1 Gyorgy Fazekas
Long term goals and some concrete tasks that can be done as the outcome of the collaboration:
188 1 Gyorgy Fazekas
189 1 Gyorgy Fazekas
** A version of Sonic Annotator that produces output adhering the new ontology
190 2 Gyorgy Fazekas
** Are we making people happier by doing so?
191 1 Gyorgy Fazekas
** gradual transition period?
192 1 Gyorgy Fazekas
** extend other software toolkits; e.g. a verison of Marsyas in C++
193 1 Gyorgy Fazekas
** multitrack processing using Sonic Annotator (this feature might come along soon)
194 1 Gyorgy Fazekas
195 18 Gyorgy Fazekas
196 1 Gyorgy Fazekas
h2. Some immediate tasks before the next meeting:
197 2 Gyorgy Fazekas
198 1 Gyorgy Fazekas
** collect more resources 
199 2 Gyorgy Fazekas
** Verify the relationship between AF as is, and other feature/segmentation Ontologies
200 2 Gyorgy Fazekas
** what other software uses it?
201 2 Gyorgy Fazekas
** papers and literature review
202 1 Gyorgy Fazekas
** relation to projects e.g. SIEMAC
203 1 Gyorgy Fazekas
** collect features that we need
204 2 Gyorgy Fazekas
** define scope (extend the diagram of the set of ontologies: )
205 1 Gyorgy Fazekas
** collect specific application examples from existing processing chain / workflow
206 18 Gyorgy Fazekas
207 2 Gyorgy Fazekas
Collect software/projects that use/produce audio features:
208 18 Gyorgy Fazekas
209 18 Gyorgy Fazekas
** plugins (analysis, adaptive effects, adaptive synthesis)
210 18 Gyorgy Fazekas
** LADSPA, 
211 18 Gyorgy Fazekas
** VAMP, 
212 18 Gyorgy Fazekas
** Marsyas, 
213 18 Gyorgy Fazekas
** CLAM, 
214 18 Gyorgy Fazekas
** libextract, 
215 18 Gyorgy Fazekas
** COMirva, 
216 18 Gyorgy Fazekas
** MIRtoolbox, 
217 18 Gyorgy Fazekas
** Supercollider, 
218 18 Gyorgy Fazekas
** other frameworks
219 3 Gyorgy Fazekas
220 3 Gyorgy Fazekas
221 32 Gyorgy Fazekas
See graph for how this relates to existing ontologies, ontology libraries:
222 32 Gyorgy Fazekas
http://www.isophonics.net/sites/isophonics.net/files/combined-frameworks.png