changeset 729:6a7f6a58bf11

Add interfaces section to WAC paper
author Dave <djmoffat@users.noreply.github.com>
date Mon, 12 Oct 2015 12:22:42 +0100
parents 28b385057ded
children a48b6a2c7489
files docs/WAC2016/WAC2016.pdf docs/WAC2016/WAC2016.tex
diffstat 2 files changed, 118 insertions(+), 58 deletions(-) [+]
line wrap: on
line diff
Binary file docs/WAC2016/WAC2016.pdf has changed
--- a/docs/WAC2016/WAC2016.tex	Mon Oct 12 11:03:57 2015 +0100
+++ b/docs/WAC2016/WAC2016.tex	Mon Oct 12 12:22:42 2015 +0100
@@ -1,6 +1,7 @@
 \documentclass{sig-alternate}
 \usepackage{hyperref}	% make links (like references, links to Sections, ...) clickable
 \usepackage{enumitem}	% tighten itemize etc by appending '[noitemsep,nolistsep]'
+\usepackage{cleveref}
 
 \begin{document}
 
@@ -146,7 +147,7 @@
 	Situating the Web Audio Evaluation Tool between other currently available evaluation tools, ...
 	
 	% only browser-based? 
-	\begin{table*}[htdp]
+	\begin{table*}[ht]
         \caption{Table with existing listening test platforms and their features}
         \begin{center}
         	\begin{tabular}{|*{6}{l|}}
@@ -241,67 +242,126 @@
 
 	
 \section{Interfaces} % title? 'Front end'? % Dave
+
+The purpose of this listening test framework is to allow any user the maximum flexibility to design a listening test for their exact application with minimum effort. To this end, a large range of standard listening test interfaces have been implemented. A review of existing listening test frameworks was undertaken and presented in~\Cref{tab:toolboxes}. HULTI-GEN~\cite{hultigen} is a single toolbox that presents the user with a large number of different test interfaces and allows for customisation of each test interface.
+
+To provide users with a flexible system, a large range of `standard' listening test interfaces have been implemented, including:
+	\begin{itemize}[noitemsep,nolistsep]
+		\item MUSHRA (ITU-R BS. 1534)~\cite{recommendation20031534}
+		\begin{itemize}
+			\item Multiple stimuli are presented and rated on a continuous scale, which includes a reference, hidden reference and hidden anchors.
+		\end{itemize}
+		\item Rank Scale~\cite{pascoe1983evaluation}
+		\begin{itemize}
+			\item Stimuli ranked on single horizontal scale, where they are ordered in preference order.
+		\end{itemize}
+		\item Likert scale~\cite{likert1932technique}
+		\begin{itemize}
+			\item Each stimuli has a five point scale with values: Strongly Agree, Agree, Neutral, Disagree and Strongly Disagree.
+		\end{itemize}
+		\item ABC/HR (ITU-R BS. 1116)~\cite{recommendation19971116} (Mean Opinion Score: MOS)
+		\begin{itemize}
+			\item Each stimulus has a continuous scale (5-1), labeled as Imperceptible, Perceptible but not annoying, slightly annoying, annoying, very annoying.
+		\end{itemize}
+		\item -50 to 50 Bipolar with Ref
+		\begin{itemize}
+			\item Each stimulus has a continuous scale -50 to 50 with default values as 0 in middle and a comparison. There is also a provided reference		\end{itemize}
+		\item Absolute Category Rating (ACR) Scale~\cite{rec1996p}
+		\begin{itemize}
+			\item Each stimuli has a five point scale with values: Bad, Poor, Fair, Good, Excellent
+		\end{itemize}
+		\item Degredation Category Rating (DCR) Scale~\cite{rec1996p}
+		\begin{itemize}
+			\item Each stimuli has a five point scale with values: (5) Inaudible, (4) Audible but not annoying, (3) slightly annoying, (2) annoying, (1) very annoying.
+		\end{itemize}
+		\item Comparison Category Rating (CCR) Scale~\cite{rec1996p}
+		\begin{itemize}
+			\item Each stimuli has a seven point scale with values: Much Better, Better, Slightly Better, About the same, slightly worse, worse, much worse. There is also a provided reference.
+		\end{itemize}
+		\item 9 Point Hedonic Category Rating Scale~\cite{peryam1952advanced}
+		\begin{itemize}
+			\item Each stimuli has a seven point scale with values: Like Extremely, Like Very Much, Like Moderate, Like Slightly, Neither Like nor Dislike, dislike Extremely, dislike Very Much, dislike Moderate, dislike Slightly. There is also a provided reference.
+			\end{itemize}
+		\item ITU-R 5 Point Continuous Impairment Scale~\cite{rec1997bs}
+		\begin{itemize}
+			\item Each stimuli has a five point scale with values: (5) Imperceptible, (4) Perceptible but not annoying, (3) slightly annoying, (2) annoying, (1) very annoying. There is also a provided reference.
+		\end{itemize}
+		\item Pairwise Comparison (Better/Worse)~\cite{david1963method}
+		\begin{itemize}
+			\item A reference is provided and ever stimulus is rated as being either better or worse than the reference.
+		\end{itemize}
+		\item APE style \cite{ape}
+		\begin{itemize}
+			\item Multiple stimuli on a single horizontal slider for inter-sample rating.
+		\end{itemize}
+		\item Multi attribute ratings
+		\begin{itemize}
+			\item Multiple stimuli as points on a 2D plane for inter-sample rating (eg. Valence Arousal)
+		\end{itemize}
+		\item AB Test~\cite{lipshitz1981great}
+		\begin{itemize}
+			\item Two stimuli are presented at a time and the participant has to select a preferred stimulus.
+		\end{itemize}
+		\item ABX Test~\cite{clark1982high}
+		\begin{itemize}
+			\item Two stimuli are presented along with a reference and the participant has to select a preferred stimulus, often the closest to the reference.
+		\end{itemize}
+	\end{itemize}
+	
+	While implementing all of these interfaces, it is possible to include any number of references, anchors, hidden references and hidden anchors into all of these listening test formats.
+	
+%%%%	\begin{itemize}[noitemsep,nolistsep]
+%%%%		\item (APE style) \cite{ape}
+%%%%		\item Multi attribute ratings
+%%%%		\item MUSHRA (ITU-R BS. 1534)~\cite{recommendation20031534}
+%%%%		\item Interval Scale~\cite{zacharov1999round}
+%%%%		\item Rank Scale~\cite{pascoe1983evaluation}
+%%%%		
+%%%%		\item 2D Plane rating - e.g. Valence vs. Arousal~\cite{carroll1969individual}
+%%%%		\item Likert scale~\cite{likert1932technique}
+%%%%		
+%%%%		\item {\bf All the following are the interfaces available in HULTI-GEN~\cite{hultigen} }
+%%%%		\item ABC/HR (ITU-R BS. 1116)~\cite{recommendation19971116}
+%%%%		\begin{itemize}
+%%%%			\item Continuous Scale (5-1) Imperceptible, Perceptible but not annoying, slightly annoying, annoying, very annoying. (default Inaudible?)
+%%%%		\end{itemize}
+%%%%		\item -50 to 50 Bipolar with Ref
+%%%%		\begin{itemize}
+%%%%			\item Scale -50 to 50 on Mushra with default values as 0 in middle and a comparison ``Reference'' to compare to 0 value
+%%%%		\end{itemize}
+%%%%		\item Absolute Category Rating (ACR) Scale~\cite{rec1996p}
+%%%%		\begin{itemize}
+%%%%			\item 5 point Scale - Bad, Poor, Fair, Good, Excellent (Default fair?)
+%%%%		\end{itemize}
+%%%%		\item Degredation Category Rating (DCR) Scale~\cite{rec1996p}
+%%%%		\begin{itemize}
+%%%%			\item 5 point Scale - Inaudible, Audible but not annoying, slightly annoying, annoying, very annoying. (default Inaudible?) - {\it Basically just quantised ABC/HR?}
+%%%%		\end{itemize}
+%%%%		\item Comparison Category Rating (CCR) Scale~\cite{rec1996p}
+%%%%		\begin{itemize}
+%%%%			\item 7 point scale: Much Better, Better, Slightly Better, About the same, slightly worse, worse, much worse - Default about the same with reference to compare to
+%%%%		\end{itemize}
+%%%%		\item 9 Point Hedonic Category Rating Scale~\cite{peryam1952advanced}
+%%%%		\begin{itemize}
+%%%%			\item 9 point scale: Like Extremely, Like Very Much, Like Moderate, Like Slightly, Neither Like nor Dislike, dislike Extremely, dislike Very Much, dislike Moderate, dislike Slightly  - Default Neither Like nor Dislike with reference to compare to
+%%%%		\end{itemize}
+%%%%		\item ITU-R 5 Point Continuous Impairment Scale~\cite{rec1997bs}
+%%%%		\begin{itemize}
+%%%%			\item 5 point Scale (5-1) Imperceptible, Perceptible but not annoying, slightly annoying, annoying, very annoying. (default Inaudible?)- {\it Basically just quantised ABC/HR, or Different named DCR}
+%%%%		\end{itemize}
+%%%%		\item Pairwise Comparison (Better/Worse)~\cite{david1963method}
+%%%%		\begin{itemize}
+%%%%			\item 2 point Scale - Better or Worse - (not sure how to default this - they default everything to better, which is an interesting choice)
+%%%%		\end{itemize}
+%%%%	\end{itemize}
+	
+{	\bf A screenshot would be nice. 
+
 	`Build your own test'
 
 	Elements present to build any of the following interfaces, and many more: axes, markers, labels, anchors, references, reference signal button, stop button, comment boxes, radio buttons, checkboxes, transport/scrubber bar
 
-	Established tests (see below) included as `presets' in the build-your-own-test page. 
-
-
-	We could add more interfaces, such as: 
-	\begin{itemize}[noitemsep,nolistsep]
-		\item (APE style) \cite{ape}
-		\item Multi attribute ratings
-		\item MUSHRA (ITU-R BS. 1534)~\cite{recommendation20031534}
-		\item Interval Scale~\cite{zacharov1999round}
-		\item Rank Scale~\cite{pascoe1983evaluation}
-		
-		\item 2D Plane rating - e.g. Valence vs. Arousal~\cite{carroll1969individual}
-		\item Likert scale~\cite{likert1932technique}
-		
-		\item {\bf All the following are the interfaces available in HULTI-GEN~\cite{hultigen} }
-		\item ABC/HR (ITU-R BS. 1116)~\cite{recommendation19971116}
-		\begin{itemize}
-			\item Continuous Scale (5-1) Imperceptible, Perceptible but not annoying, slightly annoying, annoying, very annoying. (default Inaudible?)
-		\end{itemize}
-		\item -50 to 50 Bipolar with Ref
-		\begin{itemize}
-			\item Scale -50 to 50 on Mushra with default values as 0 in middle and a comparison ``Reference'' to compare to 0 value
-		\end{itemize}
-		\item Absolute Category Rating (ACR) Scale~\cite{rec1996p}
-		\begin{itemize}
-			\item 5 point Scale - Bad, Poor, Fair, Good, Excellent (Default fair?)
-		\end{itemize}
-		\item Degredation Category Rating (DCR) Scale~\cite{rec1996p}
-		\begin{itemize}
-			\item 5 point Scale - Inaudible, Audible but not annoying, slightly annoying, annoying, very annoying. (default Inaudible?) - {\it Basically just quantised ABC/HR?}
-		\end{itemize}
-		\item Comparison Category Rating (CCR) Scale~\cite{rec1996p}
-		\begin{itemize}
-			\item 7 point scale: Much Better, Better, Slightly Better, About the same, slightly worse, worse, much worse - Default about the same with reference to compare to
-		\end{itemize}
-		\item 9 Point Hedonic Category Rating Scale~\cite{peryam1952advanced}
-		\begin{itemize}
-			\item 9 point scale: Like Extremely, Like Very Much, Like Moderate, Like Slightly, Neither Like nor Dislike, dislike Extremely, dislike Very Much, dislike Moderate, dislike Slightly  - Default Neither Like nor Dislike with reference to compare to
-		\end{itemize}
-		\item ITU-R 5 Point Continuous Impairment Scale~\cite{rec1997bs}
-		\begin{itemize}
-			\item 5 point Scale (5-1) Imperceptible, Perceptible but not annoying, slightly annoying, annoying, very annoying. (default Inaudible?)- {\it Basically just quantised ABC/HR, or Different named DCR}
-		\end{itemize}
-		\item Pairwise Comparison (Better/Worse)~\cite{david1963method}
-		\begin{itemize}
-			\item 2 point Scale - Better or Worse - (not sure how to default this - they default everything to better, which is an interesting choice)
-		\end{itemize}
-	\end{itemize}
-	
-	There are also the following interfaces, which would require a slightly different `engine' underneath, e.g. loading a different page for every possible pair. 
-	\begin{itemize}[noitemsep,nolistsep]
-		\item AB Test~\cite{lipshitz1981great}
-		\item ABX Test~\cite{clark1982high}
-		\item JND
-	\end{itemize}
-	
-	A screenshot would be nice. 
+	Established tests (see below) included as `presets' in the build-your-own-test page. }
 
 \section{Analysis and diagnostics}
 	% don't mention Python scripts