webaudioevaluationtool: docs/WAC2016/WAC2016.tex annotate

annotate docs/WAC2016/WAC2016.tex @ 1949:162d571a71d0

Paper: bits of introduction and remote testing, minor edits

author	Brecht De Man <b.deman@qmul.ac.uk>
date	Tue, 29 Sep 2015 23:55:10 +0200
parents	a14124d3578b
children	3b637867eafe

rev	line source
b@1481	1 \documentclass{sig-alternate}
b@1949	2 \usepackage{hyperref}
b@1481	3
b@1481	4 \begin{document}
b@1481	5
b@1481	6 % Copyright
b@1481	7 \setcopyright{waclicense}
b@1481	8
b@1481	9
b@1481	10 %% DOI
b@1481	11 %\doi{10.475/123_4}
b@1481	12 %
b@1481	13 %% ISBN
b@1481	14 %\isbn{123-4567-24-567/08/06}
b@1481	15 %
b@1481	16 %%Conference
b@1481	17 %\conferenceinfo{PLDI '13}{June 16--19, 2013, Seattle, WA, USA}
b@1481	18 %
b@1481	19 %\acmPrice{\$15.00}
b@1481	20
b@1481	21 %
b@1481	22 % --- Author Metadata here ---
b@1481	23 \conferenceinfo{Web Audio Conference WAC-2016,}{April 4--6, 2016, Atlanta, USA}
b@1481	24 \CopyrightYear{2016} % Allows default copyright year (20XX) to be over-ridden - IF NEED BE.
b@1481	25 %\crdata{0-12345-67-8/90/01} % Allows default copyright data (0-89791-88-6/97/05) to be over-ridden - IF NEED BE.
b@1481	26 % --- End of Author Metadata ---
b@1481	27
b@1945	28 \title{Web Audio Evaluation Tool something something}
b@1481	29 %\subtitle{[Extended Abstract]
b@1481	30 %\titlenote{A full version of this paper is available as
b@1481	31 %\textit{Author's Guide to Preparing ACM SIG Proceedings Using
b@1481	32 %\LaTeX$2_\epsilon$\ and BibTeX} at
b@1481	33 %\texttt{www.acm.org/eaddress.htm}}}
b@1481	34 %
b@1481	35 % You need the command \numberofauthors to handle the 'placement
b@1481	36 % and alignment' of the authors beneath the title.
b@1481	37 %
b@1481	38 % For aesthetic reasons, we recommend 'three authors at a time'
b@1481	39 % i.e. three 'name/affiliation blocks' be placed beneath the title.
b@1481	40 %
b@1481	41 % NOTE: You are NOT restricted in how many 'rows' of
b@1481	42 % "name/affiliations" may appear. We just ask that you restrict
b@1481	43 % the number of 'columns' to three.
b@1481	44 %
b@1481	45 % Because of the available 'opening page real-estate'
b@1481	46 % we ask you to refrain from putting more than six authors
b@1481	47 % (two rows with three columns) beneath the article title.
b@1481	48 % More than six makes the first-page appear very cluttered indeed.
b@1481	49 %
b@1481	50 % Use the \alignauthor commands to handle the names
b@1481	51 % and affiliations for an 'aesthetic maximum' of six authors.
b@1481	52 % Add names, affiliations, addresses for
b@1481	53 % the seventh etc. author(s) as the argument for the
b@1481	54 % \additionalauthors command.
b@1481	55 % These 'additional authors' will be output/set for you
b@1481	56 % without further effort on your part as the last section in
b@1481	57 % the body of your article BEFORE References or any Appendices.
b@1481	58
b@1948	59 % FIVE authors instead of four, to leave space between first two authors.
me@1944	60 \numberofauthors{5} % in this sample file, there are a total
b@1481	61 % of EIGHT authors. SIX appear on the 'first-page' (for formatting
b@1481	62 % reasons) and the remaining two appear in the \additionalauthors section.
b@1481	63 %
b@1481	64 \author{
b@1481	65 % You can go ahead and credit any number of authors here,
b@1481	66 % e.g. one 'row of three' or two rows (consisting of one row of three
b@1481	67 % and a second row of one, two or three).
b@1481	68 %
b@1481	69 % The command \alignauthor (no curly braces needed) should
b@1481	70 % precede each author name, affiliation/snail-mail address and
b@1481	71 % e-mail address. Additionally, tag each line of
b@1481	72 % affiliation/address with \affaddr, and tag the
b@1481	73 % e-mail address with \email.
b@1481	74 %
b@1481	75 % 1st. author
b@1481	76 \alignauthor Nicholas Jillings\\
b@1481	77 \email{n.g.r.jillings@se14.qmul.ac.uk}
b@1948	78 % dummy author for nicer spacing
b@1948	79 \alignauthor
b@1481	80 % 2nd. author
b@1481	81 \alignauthor Brecht De Man\\
b@1481	82 \email{b.deman@qmul.ac.uk}
b@1481	83 \and % use '\and' if you need 'another row' of author names
b@1481	84 % 3rd. author
b@1481	85 \alignauthor David Moffat\\
b@1481	86 \email{d.j.moffat@qmul.ac.uk}
b@1481	87 % 4th. author
b@1481	88 \alignauthor Joshua D. Reiss\\
b@1481	89 \email{joshua.reiss@qmul.ac.uk}
b@1948	90 \and % new line for address
b@1481	91 \affaddr{Centre for Digital Music}\\
b@1481	92 \affaddr{School of Electronic Engineering and Computer Science}\\
b@1481	93 \affaddr{Queen Mary University of London}\\
b@1481	94 \affaddr{Mile End Road,}
b@1481	95 \affaddr{London E1 4NS}\\
b@1481	96 \affaddr{United Kingdom}\\
b@1481	97 }
b@1481	98 %Centre for Digital Music, School of Electronic Engineering and Computer Science, Queen Mary University of London
b@1481	99 %% 5th. author
b@1481	100 %\alignauthor Sean Fogarty\\
b@1481	101 % \affaddr{NASA Ames Research Center}\\
b@1481	102 % \affaddr{Moffett Field}\\
b@1481	103 % \email{fogartys@amesres.org}
b@1481	104 %% 6th. author
b@1481	105 %\alignauthor Charles Palmer\\
b@1481	106 % \affaddr{Palmer Research Laboratories}\\
b@1481	107 % \affaddr{8600 Datapoint Drive}\\
b@1481	108 % \email{cpalmer@prl.com}
b@1481	109 %}
b@1481	110 % There's nothing stopping you putting the seventh, eighth, etc.
b@1481	111 % author on the opening page (as the 'third row') but we ask,
b@1481	112 % for aesthetic reasons that you place these 'additional authors'
b@1481	113 % in the \additional authors block, viz.
b@1481	114 %\additionalauthors{Additional authors: John Smith (The Th{\o}rv{\"a}ld Group,
b@1481	115 %email: {\texttt{jsmith@affiliation.org}}) and Julius P.~Kumquat
b@1481	116 %(The Kumquat Consortium, email: {\texttt{jpkumquat@consortium.net}}).}
b@1481	117 \date{1 October 2015}
b@1481	118 % Just remember to make sure that the TOTAL number of authors
b@1481	119 % is the number that will appear on the first page PLUS the
b@1481	120 % number that will appear in the \additionalauthors section.
b@1481	121
b@1481	122 \maketitle
b@1481	123 \begin{abstract}
b@1481	124 Here comes the abstract.
b@1481	125 \end{abstract}
b@1481	126
b@1481	127
b@1481	128 \section{Introduction}
b@1949	129
b@1949	130 % Listening tests/perceptual audio evaluation: what are they, why are they important
b@1949	131 % As opposed to limited scope of WAC15 paper: also musical features, realism of sound effects / sound synthesis, performance of source separation and other algorithms...
b@1949	132 Perceptual evaluation of audio, in the form of listening tests, is a powerful way to assess anything from audio codec quality over realism of sound synthesis to the performance of source separation, automated music production and
b@1949	133 In less technical areas, the framework of a listening test can be used to measure emotional response to music or test cognitive abilities. % maybe some references? If there's space.
b@1949	134
b@1949	135 % Why difficult? Challenges? What constitutes a good interface?
b@1949	136 Technical, interfaces, user friendliness, reliability
b@1949	137
b@1949	138 Note that the design of an effective listening test further poses many challenges unrelated to interface design, which are beyond the scope of this paper \cite{bech}.
b@1949	139
b@1949	140 % Why in the browser?
b@1949	141 Web Audio API has made some essential features like sample manipulation of audio streams possible \cite{schoeffler2015mushra}.
b@1481	142
b@1948	143 Situating the Web Audio Evaluation Tool between other currently available evaluation tools, ...
b@1948	144
b@1948	145 % only browser-based?
b@1948	146 \begin{table*}[htdp]
b@1948	147 \caption{Table with existing listening test platforms and their features}
b@1948	148 \begin{center}
b@1948	149 \begin{tabular}{\|*{6}{l\|}}
b@1948	150 \hline
b@1949	151 \textbf{Name} & \textbf{Ref.} & \textbf{Language} & \textbf{Interfaces} & \textbf{Remote} & \textbf{All UI} \\
b@1948	152 \hline
b@1949	153 APE & \cite{ape} & MATLAB & multiple stimulus one axis & & \\
b@1948	154 BeaqleJS & \cite{beaqlejs} & JavaScript & & not natively supported & \\
b@1949	155 HULTI-GEN & \cite{hultigen} & MAX & & & \checkmark \\
b@1949	156 \textbf{WAET} & \cite{waet} & JavaScript & \textbf{all of the above} & \checkmark & \checkmark \\
b@1948	157 \hline
b@1948	158 \end{tabular}
b@1948	159 \end{center}
b@1948	160 \label{tab:toolboxes}
b@1948	161 \end{table*}%
b@1948	162
b@1948	163 % about BeaqleJS
b@1948	164 ... However, BeaqleJS \cite{beaqlejs} does not make use of the Web Audio API, %requires programming knowledge...
b@1948	165
b@1948	166 %
b@1949	167 Selling points: remote tests, visualisaton, create your own test in the browser, many interfaces, few/no dependencies, flexibility
b@1949	168
b@1949	169 As recruiting participants can be very time-consuming, and as for some tests a large number of participants is needed, browser-based tests \cite{schoeffler2015mushra}. However, to our knowledge, no tool currently exists that allows the creation of a remotely accessible listening test. % I wonder what you can do with Amazon Mechanical Turk and the likes.
b@1949	170
b@1949	171 [Talking about what we do in the various sections of this paper. Referring to \cite{waet}. ]
b@1948	172
b@1948	173
b@1949	174 \section{Architecture} % title? 'back end'? % NICK
b@1481	175 A slightly technical overview of the system. Talk about XML, JavaScript, Web Audio API, HTML5.
b@1948	176 Describe and/or visualise audioholder-audioelement-... structure.
b@1948	177
b@1949	178 % see also SMC12 - less detail here
b@1949	179
b@1949	180 Which type of files? % WAV, anything else? Perhaps not exhaustive list, but say something along the lines of 'whatever browser supports'
b@1947	181
b@1948	182 Streaming audio? % probably not, unless it's easy
b@1948	183
b@1949	184 Compatibility? % not IE, everything else fine?
b@1949	185
b@1949	186
b@1947	187
b@1947	188
b@1948	189 \section{Remote tests} % with previous?
b@1949	190
b@1949	191 If the experimenter is willing to trade some degree of control for a higher number of participants, the test can be hosted on a web server so that subjects can take part remotely. This way, a link can be shared widely in the hope of attracting a large amount of subjects, while listening conditions and subject reliability may be less ideal. However, a sound system calibration page and a wide range of metrics logged during the test mitigate these problems. Note also that in some experiments, it may be preferred that the subject has a `real life', familiar listening set-up, for instance when perceived quality differences on everyday sound systems are investigated.
b@1949	192 Furthermore, a fully browser-based test, where the collection of the results is automatic, is more efficient and technically reliable even when the test still takes place under lab conditions.
b@1949	193
b@1947	194 The following features allow easy and effective remote testing:
b@1947	195 \begin{itemize}
b@1947	196 \item PHP script to collect result XML files
b@1947	197 \item Randomly pick specified number of audioholders
b@1949	198 \item Calibration
b@1947	199 \item Functionality to participate multiple times
b@1947	200 \begin{itemize}
b@1947	201 \item Possible to log in with unique ID (no password)
b@1947	202 \item Pick `new user' (need new, unique ID) or `already participated' (need already available ID)
b@1947	203 \item Store XML on server with IDs plus which audioholders have already been listened to
b@1947	204 \item Don't show `post-test' survey after first time
b@1947	205 \item Pick `new' audioholders if available
b@1947	206 \item Copy survey information first time to new XMLs
b@1947	207 \end{itemize}
b@1947	208 \item Intermediate saves
b@1947	209 \item Collect IP address information (privacy issues?) --> geo-related API?
b@1949	210 \item Time measurement - see before or
b@1947	211 \end{itemize}
b@1947	212
b@1481	213
b@1948	214 \section{Interfaces} % title? 'Front end'? % Dave
b@1948	215 `Build your own test'
b@1948	216
b@1949	217 Elements present to build any of the following interfaces, and many more: axes, markers, labels, anchors, references, reference signal button, stop button, comment boxes, radio buttons, checkboxes, transport/scrubber bar
b@1949	218
b@1949	219 Established tests (see below) included as `presets' in the build-your-own-test page.
b@1949	220
b@1949	221
b@1481	222 We could add more interfaces, such as:
b@1481	223 \begin{itemize}
b@1948	224 \item (APE style) \cite{ape}
b@1481	225 \item Multi attribute ratings
me@1944	226 \item MUSHRA (ITU-R BS. 1534)~\cite{recommendation20031534}
me@1944	227 \item Interval Scale~\cite{zacharov1999round}
me@1944	228 \item Rank Scale~\cite{pascoe1983evaluation}
b@1481	229
me@1944	230 \item 2D Plane rating - e.g. Valence vs. Arousal~\cite{carroll1969individual}
me@1944	231 \item Likert scale~\cite{likert1932technique}
me@1943	232
b@1948	233 \item {\bf All the following are the interfaces available in HULTI-GEN~\cite{hultigen} }
me@1944	234 \item ABC/HR (ITU-R BS. 1116)~\cite{recommendation19971116}
me@1943	235 \begin{itemize}
me@1943	236 \item Continuous Scale (5-1) Imperceptible, Perceptible but not annoying, slightly annoying, annoying, very annoying. (default Inaudible?)
me@1943	237 \end{itemize}
me@1943	238 \item -50 to 50 Bipolar with Ref
me@1943	239 \begin{itemize}
me@1943	240 \item Scale -50 to 50 on Mushra with default values as 0 in middle and a comparison ``Reference'' to compare to 0 value
me@1943	241 \end{itemize}
me@1944	242 \item Absolute Category Rating (ACR) Scale~\cite{rec1996p}
me@1943	243 \begin{itemize}
me@1943	244 \item 5 point Scale - Bad, Poor, Fair, Good, Excellent (Default fair?)
me@1943	245 \end{itemize}
me@1944	246 \item Degredation Category Rating (DCR) Scale~\cite{rec1996p}
me@1943	247 \begin{itemize}
me@1943	248 \item 5 point Scale - Inaudible, Audible but not annoying, slightly annoying, annoying, very annoying. (default Inaudible?) - {\it Basically just quantised ABC/HR?}
me@1943	249 \end{itemize}
me@1944	250 \item Comparison Category Rating (CCR) Scale~\cite{rec1996p}
me@1943	251 \begin{itemize}
me@1943	252 \item 7 point scale: Much Better, Better, Slightly Better, About the same, slightly worse, worse, much worse - Default about the same with reference to compare to
me@1943	253 \end{itemize}
me@1944	254 \item 9 Point Hedonic Category Rating Scale~\cite{peryam1952advanced}
me@1943	255 \begin{itemize}
me@1943	256 \item 9 point scale: Like Extremely, Like Very Much, Like Moderate, Like Slightly, Neither Like nor Dislike, dislike Extremely, dislike Very Much, dislike Moderate, dislike Slightly - Default Neither Like nor Dislike with reference to compare to
me@1943	257 \end{itemize}
me@1944	258 \item ITU-R 5 Point Continuous Impairment Scale~\cite{rec1997bs}
me@1943	259 \begin{itemize}
me@1943	260 \item 5 point Scale (5-1) Imperceptible, Perceptible but not annoying, slightly annoying, annoying, very annoying. (default Inaudible?)- {\it Basically just quantised ABC/HR, or Different named DCR}
me@1943	261 \end{itemize}
me@1944	262 \item Pairwise Comparison (Better/Worse)~\cite{david1963method}
me@1943	263 \begin{itemize}
me@1943	264 \item 2 point Scale - Better or Worse - (not sure how to default this - they default everything to better, which is an interesting choice)
me@1943	265 \end{itemize}
b@1481	266 \end{itemize}
b@1481	267
b@1481	268 There are also the following interfaces, which would require a slightly different `engine' underneath, e.g. loading a different page for every possible pair.
b@1481	269 \begin{itemize}
me@1944	270 \item AB Test~\cite{lipshitz1981great}
me@1944	271 \item ABX Test~\cite{clark1982high}
b@1481	272 \item JND
b@1481	273 \end{itemize}
b@1481	274
b@1481	275 A screenshot would be nice.
b@1481	276
b@1481	277 \section{Analysis and diagnostics}
b@1949	278 % don't mention Python scripts
b@1481	279 It would be great to have easy-to-use analysis tools to visualise the collected data and even do science with it. Even better would be to have all this in the browser. Complete perfection would be achieved if and when only limited setup, installation time, and expertise are required for the average non-CS researcher to use this.
b@1481	280
b@1945	281 The following could be nice:
b@1945	282
b@1945	283 \begin{itemize}
b@1945	284 \item Web page showing all audioholder IDs, file names, subject IDs, audio element IDs, ... in the collected XMLs so far (\texttt{saves/*.xml})
b@1945	285 \item Check/uncheck each of the above for analysis (e.g. zoom in on a certain song, or exclude a subset of subjects)
b@1945	286 \item Click a mix to hear it (follow path in XML setup file, which is also embedded in the XML result file)
b@1945	287 \item Box plot, confidence plot, scatter plot of values (for a given audioholder)
b@1945	288 \item Timeline for a specific subject (see Python scripts), perhaps re-playing the experiment in X times realtime. (If actual realtime, you could replay the audio...)
b@1945	289 \item Distribution plots of any radio button and number questions (drop-down menu with `pretest', `posttest', ...; then drop-down menu with question `IDs' like `gender', `age', ...; make pie chart/histogram of these values over selected range of XMLs)
b@1945	290 \item All `comments' on a specific audioelement
b@1945	291 \item A `download' button for a nice CSV of various things (values, survey responses, comments) people might want to use for analysis, e.g. when XML scares them
b@1947	292 \item Validation of setup XMLs (easily spot `errors', like duplicate IDs or URLs, missing/dangling tags, ...)
b@1945	293 \end{itemize}
b@1945	294
b@1945	295 A subset of the above would already be nice for this paper.
b@1945	296
b@1481	297 Some pictures here please.
b@1481	298
b@1948	299 \section{Concluding remarks and future work}
b@1481	300
b@1949	301 The code and documentation can be pulled or downloaded from \url{code.soundsoftware.ac.uk/projects/webaudioevaluationtool}.
b@1481	302
b@1949	303 [Talking a little bit about what else might happen. Unless we really want to wrap this up. ]
b@1948	304
b@1949	305 Use \cite{schoeffler2015mushra} as a `checklist', even though it only considers subjective evaluation of audio systems (and focuses on the requirements for a MUSHRA test).
b@1949	306 % remote
b@1949	307 % language support (not explicitly stated)
b@1949	308 % crossfades
b@1949	309 % choosing speakers/sound device from within browser?
b@1949	310 % collect information about software and sound system
b@1949	311 % buttons, scales, ... UI elements
b@1949	312 % must be able to load uncompressed PCM
b@1949	313
b@1949	314 [What can we not do? `Method of adjustment', as in \cite{schoeffler2015mushra} is another can of worms, because, like, you could adjust lots of things (volume is just one of them, that could be done quite easily). Same for using input signals like the participant's voice. Either leave out, or mention this requires modification of the code we provide.]
b@1481	315
b@1481	316 %
b@1481	317 % The following two commands are all you need in the
b@1481	318 % initial runs of your .tex file to
b@1481	319 % produce the bibliography for the citations in your paper.
b@1481	320 \bibliographystyle{abbrv}
b@1481	321 \bibliography{WAC2016} % sigproc.bib is the name of the Bibliography in this case
b@1481	322 % You must have a proper ".bib" file
b@1481	323 % and remember to run:
b@1481	324 % latex bibtex latex latex
b@1481	325 % to resolve all references
b@1481	326 %
b@1481	327 % ACM needs 'a single self-contained file'!
b@1481	328 %
b@1481	329 \end{document}

Mercurial > hg > webaudioevaluationtool

annotate docs/WAC2016/WAC2016.tex @ 1949:162d571a71d0