annotate docs/WAC2016/WAC2016.tex @ 320:1f7c3d9ccb94 WAC2016

Paper: title and meeting notes
author Brecht De Man <b.deman@qmul.ac.uk>
date Mon, 12 Oct 2015 11:03:57 +0100
parents 85f05a29a01a
children 107e70b2a927
rev   line source
b@308 1 \documentclass{sig-alternate}
b@318 2 \usepackage{hyperref} % make links (like references, links to Sections, ...) clickable
b@318 3 \usepackage{enumitem} % tighten itemize etc by appending '[noitemsep,nolistsep]'
b@308 4
b@308 5 \begin{document}
b@308 6
b@308 7 % Copyright
b@308 8 \setcopyright{waclicense}
b@308 9
b@308 10
b@308 11 %% DOI
b@308 12 %\doi{10.475/123_4}
b@308 13 %
b@308 14 %% ISBN
b@308 15 %\isbn{123-4567-24-567/08/06}
b@308 16 %
b@308 17 %%Conference
b@308 18 %\conferenceinfo{PLDI '13}{June 16--19, 2013, Seattle, WA, USA}
b@308 19 %
b@308 20 %\acmPrice{\$15.00}
b@308 21
b@308 22 %
b@308 23 % --- Author Metadata here ---
b@308 24 \conferenceinfo{Web Audio Conference WAC-2016,}{April 4--6, 2016, Atlanta, USA}
b@308 25 \CopyrightYear{2016} % Allows default copyright year (20XX) to be over-ridden - IF NEED BE.
b@308 26 %\crdata{0-12345-67-8/90/01} % Allows default copyright data (0-89791-88-6/97/05) to be over-ridden - IF NEED BE.
b@308 27 % --- End of Author Metadata ---
b@308 28
b@320 29 \title{Web Audio Evaluation Tool: A framework for subjective assessment of audio}
b@308 30 %\subtitle{[Extended Abstract]
b@308 31 %\titlenote{A full version of this paper is available as
b@308 32 %\textit{Author's Guide to Preparing ACM SIG Proceedings Using
b@308 33 %\LaTeX$2_\epsilon$\ and BibTeX} at
b@308 34 %\texttt{www.acm.org/eaddress.htm}}}
b@308 35 %
b@308 36 % You need the command \numberofauthors to handle the 'placement
b@308 37 % and alignment' of the authors beneath the title.
b@308 38 %
b@308 39 % For aesthetic reasons, we recommend 'three authors at a time'
b@308 40 % i.e. three 'name/affiliation blocks' be placed beneath the title.
b@308 41 %
b@308 42 % NOTE: You are NOT restricted in how many 'rows' of
b@308 43 % "name/affiliations" may appear. We just ask that you restrict
b@308 44 % the number of 'columns' to three.
b@308 45 %
b@308 46 % Because of the available 'opening page real-estate'
b@308 47 % we ask you to refrain from putting more than six authors
b@308 48 % (two rows with three columns) beneath the article title.
b@308 49 % More than six makes the first-page appear very cluttered indeed.
b@308 50 %
b@308 51 % Use the \alignauthor commands to handle the names
b@308 52 % and affiliations for an 'aesthetic maximum' of six authors.
b@308 53 % Add names, affiliations, addresses for
b@308 54 % the seventh etc. author(s) as the argument for the
b@308 55 % \additionalauthors command.
b@308 56 % These 'additional authors' will be output/set for you
b@308 57 % without further effort on your part as the last section in
b@308 58 % the body of your article BEFORE References or any Appendices.
b@308 59
b@316 60 % FIVE authors instead of four, to leave space between first two authors.
d@310 61 \numberofauthors{5} % in this sample file, there are a *total*
b@308 62 % of EIGHT authors. SIX appear on the 'first-page' (for formatting
b@308 63 % reasons) and the remaining two appear in the \additionalauthors section.
b@308 64 %
b@308 65 \author{
b@308 66 % You can go ahead and credit any number of authors here,
b@308 67 % e.g. one 'row of three' or two rows (consisting of one row of three
b@308 68 % and a second row of one, two or three).
b@308 69 %
b@308 70 % The command \alignauthor (no curly braces needed) should
b@308 71 % precede each author name, affiliation/snail-mail address and
b@308 72 % e-mail address. Additionally, tag each line of
b@308 73 % affiliation/address with \affaddr, and tag the
b@308 74 % e-mail address with \email.
b@308 75 %
b@308 76 % 1st. author
b@308 77 \alignauthor Nicholas Jillings\\
b@308 78 \email{n.g.r.jillings@se14.qmul.ac.uk}
b@316 79 % dummy author for nicer spacing
b@316 80 \alignauthor
b@308 81 % 2nd. author
b@308 82 \alignauthor Brecht De Man\\
b@308 83 \email{b.deman@qmul.ac.uk}
b@308 84 \and % use '\and' if you need 'another row' of author names
b@308 85 % 3rd. author
b@308 86 \alignauthor David Moffat\\
b@308 87 \email{d.j.moffat@qmul.ac.uk}
b@308 88 % 4th. author
b@308 89 \alignauthor Joshua D. Reiss\\
b@308 90 \email{joshua.reiss@qmul.ac.uk}
b@316 91 \and % new line for address
b@308 92 \affaddr{Centre for Digital Music}\\
b@308 93 \affaddr{School of Electronic Engineering and Computer Science}\\
b@308 94 \affaddr{Queen Mary University of London}\\
b@308 95 \affaddr{Mile End Road,}
b@308 96 \affaddr{London E1 4NS}\\
b@308 97 \affaddr{United Kingdom}\\
b@308 98 }
b@308 99 %Centre for Digital Music, School of Electronic Engineering and Computer Science, Queen Mary University of London
b@308 100 %% 5th. author
b@308 101 %\alignauthor Sean Fogarty\\
b@308 102 % \affaddr{NASA Ames Research Center}\\
b@308 103 % \affaddr{Moffett Field}\\
b@308 104 % \email{fogartys@amesres.org}
b@308 105 %% 6th. author
b@308 106 %\alignauthor Charles Palmer\\
b@308 107 % \affaddr{Palmer Research Laboratories}\\
b@308 108 % \affaddr{8600 Datapoint Drive}\\
b@308 109 % \email{cpalmer@prl.com}
b@308 110 %}
b@308 111 % There's nothing stopping you putting the seventh, eighth, etc.
b@308 112 % author on the opening page (as the 'third row') but we ask,
b@308 113 % for aesthetic reasons that you place these 'additional authors'
b@308 114 % in the \additional authors block, viz.
b@308 115 %\additionalauthors{Additional authors: John Smith (The Th{\o}rv{\"a}ld Group,
b@308 116 %email: {\texttt{jsmith@affiliation.org}}) and Julius P.~Kumquat
b@308 117 %(The Kumquat Consortium, email: {\texttt{jpkumquat@consortium.net}}).}
b@308 118 \date{1 October 2015}
b@308 119 % Just remember to make sure that the TOTAL number of authors
b@308 120 % is the number that will appear on the first page PLUS the
b@308 121 % number that will appear in the \additionalauthors section.
b@308 122
b@308 123 \maketitle
b@308 124 \begin{abstract}
b@308 125 Here comes the abstract.
b@308 126 \end{abstract}
b@308 127
b@308 128
b@308 129 \section{Introduction}
b@317 130
b@317 131 % Listening tests/perceptual audio evaluation: what are they, why are they important
b@317 132 % As opposed to limited scope of WAC15 paper: also musical features, realism of sound effects / sound synthesis, performance of source separation and other algorithms...
b@317 133 Perceptual evaluation of audio, in the form of listening tests, is a powerful way to assess anything from audio codec quality over realism of sound synthesis to the performance of source separation, automated music production and
b@317 134 In less technical areas, the framework of a listening test can be used to measure emotional response to music or test cognitive abilities. % maybe some references? If there's space.
b@317 135
b@318 136 % check out http://link.springer.com/article/10.1007/s10055-015-0270-8 - only paper that cited WAC15 paper
b@318 137
b@317 138 % Why difficult? Challenges? What constitutes a good interface?
b@317 139 Technical, interfaces, user friendliness, reliability
b@317 140
b@317 141 Note that the design of an effective listening test further poses many challenges unrelated to interface design, which are beyond the scope of this paper \cite{bech}.
b@317 142
b@317 143 % Why in the browser?
b@317 144 Web Audio API has made some essential features like sample manipulation of audio streams possible \cite{schoeffler2015mushra}.
b@308 145
b@316 146 Situating the Web Audio Evaluation Tool between other currently available evaluation tools, ...
b@316 147
b@316 148 % only browser-based?
b@316 149 \begin{table*}[htdp]
b@316 150 \caption{Table with existing listening test platforms and their features}
b@316 151 \begin{center}
b@316 152 \begin{tabular}{|*{6}{l|}}
b@316 153 \hline
b@317 154 \textbf{Name} & \textbf{Ref.} & \textbf{Language} & \textbf{Interfaces} & \textbf{Remote} & \textbf{All UI} \\
b@316 155 \hline
b@317 156 APE & \cite{ape} & MATLAB & multiple stimulus one axis & & \\
b@316 157 BeaqleJS & \cite{beaqlejs} & JavaScript & & not natively supported & \\
b@317 158 HULTI-GEN & \cite{hultigen} & MAX & & & \checkmark \\
b@317 159 \textbf{WAET} & \cite{waet} & JavaScript & \textbf{all of the above} & \checkmark & \checkmark \\
b@316 160 \hline
b@316 161 \end{tabular}
b@316 162 \end{center}
b@316 163 \label{tab:toolboxes}
b@316 164 \end{table*}%
b@316 165
b@316 166 % about BeaqleJS
b@316 167 ... However, BeaqleJS \cite{beaqlejs} does not make use of the Web Audio API, %requires programming knowledge...
b@316 168
b@316 169 %
b@317 170 Selling points: remote tests, visualisaton, create your own test in the browser, many interfaces, few/no dependencies, flexibility
b@317 171
b@317 172 As recruiting participants can be very time-consuming, and as for some tests a large number of participants is needed, browser-based tests \cite{schoeffler2015mushra}. However, to our knowledge, no tool currently exists that allows the creation of a remotely accessible listening test. % I wonder what you can do with Amazon Mechanical Turk and the likes.
b@317 173
b@317 174 [Talking about what we do in the various sections of this paper. Referring to \cite{waet}. ]
b@316 175
b@320 176 % MEETING 8 OCTOBER
b@320 177 \subsection{Meeting 8 October}
b@320 178 \begin{itemize}
b@320 179 \item Do we manipulate audio?\\
b@320 180 \begin{itemize}
b@320 181 \item Add loudness equalisation? (test\_create.html) Tag with gains.
b@320 182 \item Add volume slider?
b@320 183 \item Cross-fade (in interface node): default 0, number of seconds
b@320 184 \item Also: we use the playback buffer to present metrics of which portion is listened to
b@320 185 \end{itemize}
b@320 186 \item Logging system information: whichever are possible (justify others)
b@320 187 \item Input streams as audioelements
b@320 188 \item Capture microphone to estimate loudness (especially Macbook)
b@320 189 \item Test page (in-built oscillators): left-right calibration, ramp up test tone until you hear it; optional compensating EQ (future work implementing own filters) --> Highlight issues!
b@320 190 \item Record IP address (PHP function, grab and append to XML file)
b@320 191 \item Expand anchor/reference options
b@320 192 \item AB / ABX
b@320 193 \end{itemize}
b@320 194
b@320 195 \subsubsection{Issues}
b@320 196 \begin{itemize}
b@320 197 \item Filters not consistent (Nick to test across browsers)
b@320 198 \item Playback audiobuffers need to be destroyed and rebuilt each time
b@320 199 \item Can't get channel data, hardware input/output...
b@320 200 \end{itemize}
b@320 201
b@316 202
b@317 203 \section{Architecture} % title? 'back end'? % NICK
b@308 204 A slightly technical overview of the system. Talk about XML, JavaScript, Web Audio API, HTML5.
b@316 205 Describe and/or visualise audioholder-audioelement-... structure.
b@316 206
b@317 207 % see also SMC12 - less detail here
b@317 208
b@317 209 Which type of files? % WAV, anything else? Perhaps not exhaustive list, but say something along the lines of 'whatever browser supports'
b@315 210
b@316 211 Streaming audio? % probably not, unless it's easy
b@316 212
b@317 213 Compatibility? % not IE, everything else fine?
b@317 214
b@317 215
b@315 216
b@315 217
b@316 218 \section{Remote tests} % with previous?
b@317 219
b@317 220 If the experimenter is willing to trade some degree of control for a higher number of participants, the test can be hosted on a web server so that subjects can take part remotely. This way, a link can be shared widely in the hope of attracting a large amount of subjects, while listening conditions and subject reliability may be less ideal. However, a sound system calibration page and a wide range of metrics logged during the test mitigate these problems. Note also that in some experiments, it may be preferred that the subject has a `real life', familiar listening set-up, for instance when perceived quality differences on everyday sound systems are investigated.
b@317 221 Furthermore, a fully browser-based test, where the collection of the results is automatic, is more efficient and technically reliable even when the test still takes place under lab conditions.
b@317 222
b@315 223 The following features allow easy and effective remote testing:
b@318 224 \begin{itemize}[noitemsep,nolistsep]
b@315 225 \item PHP script to collect result XML files
b@315 226 \item Randomly pick specified number of audioholders
b@317 227 \item Calibration
b@315 228 \item Functionality to participate multiple times
b@318 229 \begin{itemize}[noitemsep,nolistsep]
b@315 230 \item Possible to log in with unique ID (no password)
b@315 231 \item Pick `new user' (need new, unique ID) or `already participated' (need already available ID)
b@315 232 \item Store XML on server with IDs plus which audioholders have already been listened to
b@315 233 \item Don't show `post-test' survey after first time
b@315 234 \item Pick `new' audioholders if available
b@315 235 \item Copy survey information first time to new XMLs
b@315 236 \end{itemize}
b@315 237 \item Intermediate saves
b@315 238 \item Collect IP address information (privacy issues?) --> geo-related API?
b@317 239 \item Time measurement - see before or
b@315 240 \end{itemize}
b@315 241
b@308 242
b@316 243 \section{Interfaces} % title? 'Front end'? % Dave
b@316 244 `Build your own test'
b@316 245
b@317 246 Elements present to build any of the following interfaces, and many more: axes, markers, labels, anchors, references, reference signal button, stop button, comment boxes, radio buttons, checkboxes, transport/scrubber bar
b@317 247
b@317 248 Established tests (see below) included as `presets' in the build-your-own-test page.
b@317 249
b@317 250
b@308 251 We could add more interfaces, such as:
b@318 252 \begin{itemize}[noitemsep,nolistsep]
b@316 253 \item (APE style) \cite{ape}
b@308 254 \item Multi attribute ratings
d@310 255 \item MUSHRA (ITU-R BS. 1534)~\cite{recommendation20031534}
d@310 256 \item Interval Scale~\cite{zacharov1999round}
d@310 257 \item Rank Scale~\cite{pascoe1983evaluation}
b@308 258
d@310 259 \item 2D Plane rating - e.g. Valence vs. Arousal~\cite{carroll1969individual}
d@310 260 \item Likert scale~\cite{likert1932technique}
d@309 261
b@316 262 \item {\bf All the following are the interfaces available in HULTI-GEN~\cite{hultigen} }
d@310 263 \item ABC/HR (ITU-R BS. 1116)~\cite{recommendation19971116}
d@309 264 \begin{itemize}
d@309 265 \item Continuous Scale (5-1) Imperceptible, Perceptible but not annoying, slightly annoying, annoying, very annoying. (default Inaudible?)
d@309 266 \end{itemize}
d@309 267 \item -50 to 50 Bipolar with Ref
d@309 268 \begin{itemize}
d@309 269 \item Scale -50 to 50 on Mushra with default values as 0 in middle and a comparison ``Reference'' to compare to 0 value
d@309 270 \end{itemize}
d@310 271 \item Absolute Category Rating (ACR) Scale~\cite{rec1996p}
d@309 272 \begin{itemize}
d@309 273 \item 5 point Scale - Bad, Poor, Fair, Good, Excellent (Default fair?)
d@309 274 \end{itemize}
d@310 275 \item Degredation Category Rating (DCR) Scale~\cite{rec1996p}
d@309 276 \begin{itemize}
d@309 277 \item 5 point Scale - Inaudible, Audible but not annoying, slightly annoying, annoying, very annoying. (default Inaudible?) - {\it Basically just quantised ABC/HR?}
d@309 278 \end{itemize}
d@310 279 \item Comparison Category Rating (CCR) Scale~\cite{rec1996p}
d@309 280 \begin{itemize}
d@309 281 \item 7 point scale: Much Better, Better, Slightly Better, About the same, slightly worse, worse, much worse - Default about the same with reference to compare to
d@309 282 \end{itemize}
d@310 283 \item 9 Point Hedonic Category Rating Scale~\cite{peryam1952advanced}
d@309 284 \begin{itemize}
d@309 285 \item 9 point scale: Like Extremely, Like Very Much, Like Moderate, Like Slightly, Neither Like nor Dislike, dislike Extremely, dislike Very Much, dislike Moderate, dislike Slightly - Default Neither Like nor Dislike with reference to compare to
d@309 286 \end{itemize}
d@310 287 \item ITU-R 5 Point Continuous Impairment Scale~\cite{rec1997bs}
d@309 288 \begin{itemize}
d@309 289 \item 5 point Scale (5-1) Imperceptible, Perceptible but not annoying, slightly annoying, annoying, very annoying. (default Inaudible?)- {\it Basically just quantised ABC/HR, or Different named DCR}
d@309 290 \end{itemize}
d@310 291 \item Pairwise Comparison (Better/Worse)~\cite{david1963method}
d@309 292 \begin{itemize}
d@309 293 \item 2 point Scale - Better or Worse - (not sure how to default this - they default everything to better, which is an interesting choice)
d@309 294 \end{itemize}
b@308 295 \end{itemize}
b@308 296
b@308 297 There are also the following interfaces, which would require a slightly different `engine' underneath, e.g. loading a different page for every possible pair.
b@318 298 \begin{itemize}[noitemsep,nolistsep]
d@310 299 \item AB Test~\cite{lipshitz1981great}
d@310 300 \item ABX Test~\cite{clark1982high}
b@308 301 \item JND
b@308 302 \end{itemize}
b@308 303
b@308 304 A screenshot would be nice.
b@308 305
b@308 306 \section{Analysis and diagnostics}
b@317 307 % don't mention Python scripts
b@308 308 It would be great to have easy-to-use analysis tools to visualise the collected data and even do science with it. Even better would be to have all this in the browser. Complete perfection would be achieved if and when only limited setup, installation time, and expertise are required for the average non-CS researcher to use this.
b@308 309
b@312 310 The following could be nice:
b@312 311
b@318 312 \begin{itemize}[noitemsep,nolistsep]
b@312 313 \item Web page showing all audioholder IDs, file names, subject IDs, audio element IDs, ... in the collected XMLs so far (\texttt{saves/*.xml})
b@312 314 \item Check/uncheck each of the above for analysis (e.g. zoom in on a certain song, or exclude a subset of subjects)
b@312 315 \item Click a mix to hear it (follow path in XML setup file, which is also embedded in the XML result file)
b@312 316 \item Box plot, confidence plot, scatter plot of values (for a given audioholder)
b@312 317 \item Timeline for a specific subject (see Python scripts), perhaps re-playing the experiment in X times realtime. (If actual realtime, you could replay the audio...)
b@312 318 \item Distribution plots of any radio button and number questions (drop-down menu with `pretest', `posttest', ...; then drop-down menu with question `IDs' like `gender', `age', ...; make pie chart/histogram of these values over selected range of XMLs)
b@312 319 \item All `comments' on a specific audioelement
b@312 320 \item A `download' button for a nice CSV of various things (values, survey responses, comments) people might want to use for analysis, e.g. when XML scares them
b@315 321 \item Validation of setup XMLs (easily spot `errors', like duplicate IDs or URLs, missing/dangling tags, ...)
b@312 322 \end{itemize}
b@312 323
b@312 324 A subset of the above would already be nice for this paper.
b@312 325
b@308 326 Some pictures here please.
b@308 327
b@316 328 \section{Concluding remarks and future work}
b@308 329
b@317 330 The code and documentation can be pulled or downloaded from \url{code.soundsoftware.ac.uk/projects/webaudioevaluationtool}.
b@308 331
b@317 332 [Talking a little bit about what else might happen. Unless we really want to wrap this up. ]
b@316 333
b@317 334 Use \cite{schoeffler2015mushra} as a `checklist', even though it only considers subjective evaluation of audio systems (and focuses on the requirements for a MUSHRA test).
b@317 335 % remote
b@317 336 % language support (not explicitly stated)
b@317 337 % crossfades
b@317 338 % choosing speakers/sound device from within browser?
b@317 339 % collect information about software and sound system
b@317 340 % buttons, scales, ... UI elements
b@317 341 % must be able to load uncompressed PCM
b@317 342
b@317 343 [What can we not do? `Method of adjustment', as in \cite{schoeffler2015mushra} is another can of worms, because, like, you could adjust lots of things (volume is just one of them, that could be done quite easily). Same for using input signals like the participant's voice. Either leave out, or mention this requires modification of the code we provide.]
b@308 344
b@308 345 %
b@308 346 % The following two commands are all you need in the
b@308 347 % initial runs of your .tex file to
b@308 348 % produce the bibliography for the citations in your paper.
b@308 349 \bibliographystyle{abbrv}
b@308 350 \bibliography{WAC2016} % sigproc.bib is the name of the Bibliography in this case
b@308 351 % You must have a proper ".bib" file
b@308 352 % and remember to run:
b@308 353 % latex bibtex latex latex
b@308 354 % to resolve all references
b@308 355 %
b@308 356 % ACM needs 'a single self-contained file'!
b@308 357 %
b@308 358 \end{document}