comparison docs/WAC2016/WAC2016.tex @ 1950:6eec4abc85e7

Minor edits to paper, updated installation instructions, added to be ignored files (e.g. LaTeX auxiliary files)
author Brecht De Man <b.deman@qmul.ac.uk>
date Wed, 30 Sep 2015 18:34:40 +0200
parents 162d571a71d0
children 28b385057ded
comparison
equal deleted inserted replaced
1949:162d571a71d0 1950:6eec4abc85e7
1 \documentclass{sig-alternate} 1 \documentclass{sig-alternate}
2 \usepackage{hyperref} 2 \usepackage{hyperref} % make links (like references, links to Sections, ...) clickable
3 \usepackage{enumitem} % tighten itemize etc by appending '[noitemsep,nolistsep]'
3 4
4 \begin{document} 5 \begin{document}
5 6
6 % Copyright 7 % Copyright
7 \setcopyright{waclicense} 8 \setcopyright{waclicense}
130 % Listening tests/perceptual audio evaluation: what are they, why are they important 131 % Listening tests/perceptual audio evaluation: what are they, why are they important
131 % As opposed to limited scope of WAC15 paper: also musical features, realism of sound effects / sound synthesis, performance of source separation and other algorithms... 132 % As opposed to limited scope of WAC15 paper: also musical features, realism of sound effects / sound synthesis, performance of source separation and other algorithms...
132 Perceptual evaluation of audio, in the form of listening tests, is a powerful way to assess anything from audio codec quality over realism of sound synthesis to the performance of source separation, automated music production and 133 Perceptual evaluation of audio, in the form of listening tests, is a powerful way to assess anything from audio codec quality over realism of sound synthesis to the performance of source separation, automated music production and
133 In less technical areas, the framework of a listening test can be used to measure emotional response to music or test cognitive abilities. % maybe some references? If there's space. 134 In less technical areas, the framework of a listening test can be used to measure emotional response to music or test cognitive abilities. % maybe some references? If there's space.
134 135
136 % check out http://link.springer.com/article/10.1007/s10055-015-0270-8 - only paper that cited WAC15 paper
137
135 % Why difficult? Challenges? What constitutes a good interface? 138 % Why difficult? Challenges? What constitutes a good interface?
136 Technical, interfaces, user friendliness, reliability 139 Technical, interfaces, user friendliness, reliability
137 140
138 Note that the design of an effective listening test further poses many challenges unrelated to interface design, which are beyond the scope of this paper \cite{bech}. 141 Note that the design of an effective listening test further poses many challenges unrelated to interface design, which are beyond the scope of this paper \cite{bech}.
139 142
190 193
191 If the experimenter is willing to trade some degree of control for a higher number of participants, the test can be hosted on a web server so that subjects can take part remotely. This way, a link can be shared widely in the hope of attracting a large amount of subjects, while listening conditions and subject reliability may be less ideal. However, a sound system calibration page and a wide range of metrics logged during the test mitigate these problems. Note also that in some experiments, it may be preferred that the subject has a `real life', familiar listening set-up, for instance when perceived quality differences on everyday sound systems are investigated. 194 If the experimenter is willing to trade some degree of control for a higher number of participants, the test can be hosted on a web server so that subjects can take part remotely. This way, a link can be shared widely in the hope of attracting a large amount of subjects, while listening conditions and subject reliability may be less ideal. However, a sound system calibration page and a wide range of metrics logged during the test mitigate these problems. Note also that in some experiments, it may be preferred that the subject has a `real life', familiar listening set-up, for instance when perceived quality differences on everyday sound systems are investigated.
192 Furthermore, a fully browser-based test, where the collection of the results is automatic, is more efficient and technically reliable even when the test still takes place under lab conditions. 195 Furthermore, a fully browser-based test, where the collection of the results is automatic, is more efficient and technically reliable even when the test still takes place under lab conditions.
193 196
194 The following features allow easy and effective remote testing: 197 The following features allow easy and effective remote testing:
195 \begin{itemize} 198 \begin{itemize}[noitemsep,nolistsep]
196 \item PHP script to collect result XML files 199 \item PHP script to collect result XML files
197 \item Randomly pick specified number of audioholders 200 \item Randomly pick specified number of audioholders
198 \item Calibration 201 \item Calibration
199 \item Functionality to participate multiple times 202 \item Functionality to participate multiple times
200 \begin{itemize} 203 \begin{itemize}[noitemsep,nolistsep]
201 \item Possible to log in with unique ID (no password) 204 \item Possible to log in with unique ID (no password)
202 \item Pick `new user' (need new, unique ID) or `already participated' (need already available ID) 205 \item Pick `new user' (need new, unique ID) or `already participated' (need already available ID)
203 \item Store XML on server with IDs plus which audioholders have already been listened to 206 \item Store XML on server with IDs plus which audioholders have already been listened to
204 \item Don't show `post-test' survey after first time 207 \item Don't show `post-test' survey after first time
205 \item Pick `new' audioholders if available 208 \item Pick `new' audioholders if available
218 221
219 Established tests (see below) included as `presets' in the build-your-own-test page. 222 Established tests (see below) included as `presets' in the build-your-own-test page.
220 223
221 224
222 We could add more interfaces, such as: 225 We could add more interfaces, such as:
223 \begin{itemize} 226 \begin{itemize}[noitemsep,nolistsep]
224 \item (APE style) \cite{ape} 227 \item (APE style) \cite{ape}
225 \item Multi attribute ratings 228 \item Multi attribute ratings
226 \item MUSHRA (ITU-R BS. 1534)~\cite{recommendation20031534} 229 \item MUSHRA (ITU-R BS. 1534)~\cite{recommendation20031534}
227 \item Interval Scale~\cite{zacharov1999round} 230 \item Interval Scale~\cite{zacharov1999round}
228 \item Rank Scale~\cite{pascoe1983evaluation} 231 \item Rank Scale~\cite{pascoe1983evaluation}
264 \item 2 point Scale - Better or Worse - (not sure how to default this - they default everything to better, which is an interesting choice) 267 \item 2 point Scale - Better or Worse - (not sure how to default this - they default everything to better, which is an interesting choice)
265 \end{itemize} 268 \end{itemize}
266 \end{itemize} 269 \end{itemize}
267 270
268 There are also the following interfaces, which would require a slightly different `engine' underneath, e.g. loading a different page for every possible pair. 271 There are also the following interfaces, which would require a slightly different `engine' underneath, e.g. loading a different page for every possible pair.
269 \begin{itemize} 272 \begin{itemize}[noitemsep,nolistsep]
270 \item AB Test~\cite{lipshitz1981great} 273 \item AB Test~\cite{lipshitz1981great}
271 \item ABX Test~\cite{clark1982high} 274 \item ABX Test~\cite{clark1982high}
272 \item JND 275 \item JND
273 \end{itemize} 276 \end{itemize}
274 277
278 % don't mention Python scripts 281 % don't mention Python scripts
279 It would be great to have easy-to-use analysis tools to visualise the collected data and even do science with it. Even better would be to have all this in the browser. Complete perfection would be achieved if and when only limited setup, installation time, and expertise are required for the average non-CS researcher to use this. 282 It would be great to have easy-to-use analysis tools to visualise the collected data and even do science with it. Even better would be to have all this in the browser. Complete perfection would be achieved if and when only limited setup, installation time, and expertise are required for the average non-CS researcher to use this.
280 283
281 The following could be nice: 284 The following could be nice:
282 285
283 \begin{itemize} 286 \begin{itemize}[noitemsep,nolistsep]
284 \item Web page showing all audioholder IDs, file names, subject IDs, audio element IDs, ... in the collected XMLs so far (\texttt{saves/*.xml}) 287 \item Web page showing all audioholder IDs, file names, subject IDs, audio element IDs, ... in the collected XMLs so far (\texttt{saves/*.xml})
285 \item Check/uncheck each of the above for analysis (e.g. zoom in on a certain song, or exclude a subset of subjects) 288 \item Check/uncheck each of the above for analysis (e.g. zoom in on a certain song, or exclude a subset of subjects)
286 \item Click a mix to hear it (follow path in XML setup file, which is also embedded in the XML result file) 289 \item Click a mix to hear it (follow path in XML setup file, which is also embedded in the XML result file)
287 \item Box plot, confidence plot, scatter plot of values (for a given audioholder) 290 \item Box plot, confidence plot, scatter plot of values (for a given audioholder)
288 \item Timeline for a specific subject (see Python scripts), perhaps re-playing the experiment in X times realtime. (If actual realtime, you could replay the audio...) 291 \item Timeline for a specific subject (see Python scripts), perhaps re-playing the experiment in X times realtime. (If actual realtime, you could replay the audio...)