mtridoc: smc2013_cameraReady/smc2013.tex annotate

annotate smc2013_cameraReady/smc2013.tex @ 58:a63c438b3f65 tip

Squeezed it into the 6 page limit

author	Henrik Ekeus <hekeus@eecs.qmul.ac.uk>
date	Tue, 11 Jun 2013 15:17:21 +0100
parents	64631f786097
children

rev	line source
hekeus@55	1 % -----------------------------------------------
hekeus@55	2 % Template for SMAC SMC 2013
hekeus@55	3 % adapted and corrected from the template for SMC 2012, which was adapted from that of SMC 2011
hekeus@55	4 % -----------------------------------------------
hekeus@55	5
hekeus@55	6 \documentclass{article}
hekeus@55	7 \usepackage{smacsmc2013}
hekeus@55	8 \usepackage{times}
hekeus@55	9 \usepackage{ifpdf}
hekeus@55	10 \usepackage[english]{babel}
hekeus@55	11 \usepackage{cite}
hekeus@55	12 \usepackage{caption}
hekeus@55	13 \usepackage{subcaption}
hekeus@55	14
hekeus@55	15 %%%%%%%%%%%%%%%%%%%%%%%% Some useful packages %%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%
hekeus@55	16 %%%%%%%%%%%%%%%%%%%%%%%% See related documentation %%%%%%%%%%%%%%%%%%%%%%%%%%
hekeus@55	17 %\usepackage{amsmath} % popular packages from Am. Math. Soc. Please use the
hekeus@55	18 %\usepackage{amssymb} % related math environments (split, subequation, cases,
hekeus@55	19 %\usepackage{amsfonts}% multline, etc.)
hekeus@55	20 %\usepackage{bm} % Bold Math package, defines the command \bf{}
hekeus@55	21 %\usepackage{paralist}% extended list environments
hekeus@55	22 %%subfig.sty is the modern replacement for subfigure.sty. However, subfig.sty
hekeus@55	23 %%requires and automatically loads caption.sty which overrides class handling
hekeus@55	24 %%of captions. To prevent this problem, preload caption.sty with caption=false
hekeus@55	25 %\usepackage[caption=false]{caption}
hekeus@55	26 %\usepackage[font=footnotesize]{subfig}
hekeus@55	27
hekeus@55	28
hekeus@55	29
hekeus@55	30 %allowing the users to undertake an engaging form of citizen science
hekeus@55	31 %Mobile apps provide a new and unique opportunity for collecting data Ôin the wildÕ. This allows for a greater reach, and the ability to collect greater amounts of data than would be possible in traditional experimental scenarios.
hekeus@55	32 %they generally serve to combine the input from many individuals to find commonalities and trends; eliciting a kind of Ôwisdom of the crowdsÕ.
hekeus@55	33 % Such ÔcrowdsourcedÕ data collection is an increasingly common methodology in the research community, and can provide insights that would not be possible to achieve in any other way. For example in the creative sector, the evaluation of broad classification of musical aesthetics requires many individuals to make personal choices to produce robust patterns of data.
hekeus@55	34 %
hekeus@55	35
hekeus@55	36
hekeus@55	37 %user defined variables
hekeus@55	38 \def\papertitle{How Predictable Do We Like Our Music? Eliciting Aesthetic Preferences With The Melody Triangle Mobile App }
hekeus@55	39 \def\firstauthor{Henrik Ekeus}
hekeus@55	40 \def\secondauthor{Samer A. Abdallah}
hekeus@55	41 \def\thirdauthor{Peter W. McOwan}
hekeus@55	42 \def\fourthauthor{Mark D. Plumbley}
hekeus@55	43
hekeus@55	44 % adds the automatic
hekeus@55	45 % Saves a lot of ouptut space in PDF... after conversion with the distiller
hekeus@55	46 % Delete if you cannot get PS fonts working on your system.
hekeus@55	47
hekeus@55	48 % pdf-tex settings: detect automatically if run by latex or pdflatex
hekeus@55	49 \newif\ifpdf
hekeus@55	50 \ifx\pdfoutput\relax
hekeus@55	51 \else
hekeus@55	52 \ifcase\pdfoutput
hekeus@55	53 \pdffalse
hekeus@55	54 \else
hekeus@55	55 \pdftrue
hekeus@55	56 \fi
hekeus@55	57
hekeus@55	58 \ifpdf % compiling with pdflatex
hekeus@55	59 \usepackage[pdftex,
hekeus@55	60 pdftitle={\papertitle},
hekeus@55	61 pdfauthor={\firstauthor, \secondauthor, \thirdauthor},
hekeus@55	62 bookmarksnumbered, % use section numbers with bookmarks
hekeus@55	63 pdfstartview=XYZ % start with zoom=100% instead of full screen;
hekeus@55	64 % especially useful if working with a big screen :-)
hekeus@55	65 ]{hyperref}
hekeus@55	66 %\pdfcompresslevel=9
hekeus@55	67
hekeus@55	68 \usepackage[pdftex]{graphicx}
hekeus@55	69 % declare the path(s) where your graphic files are and their extensions so
hekeus@55	70 %you won't have to specify these with every instance of \includegraphics
hekeus@55	71 \graphicspath{{./figures/}}
hekeus@55	72 \DeclareGraphicsExtensions{.pdf,.jpeg,.png}
hekeus@55	73
hekeus@55	74 \usepackage[figure,table]{hypcap}
hekeus@55	75
hekeus@55	76 \else % compiling with latex
hekeus@55	77 \usepackage[dvips,
hekeus@55	78 bookmarksnumbered, % use section numbers with bookmarks
hekeus@55	79 pdfstartview=XYZ % start with zoom=100% instead of full screen
hekeus@55	80 ]{hyperref} % hyperrefs are active in the pdf file after conversion
hekeus@55	81
hekeus@55	82 \usepackage[dvips]{epsfig,graphicx}
hekeus@55	83 % declare the path(s) where your graphic files are and their extensions so
hekeus@55	84 %you won't have to specify these with every instance of \includegraphics
hekeus@55	85 \graphicspath{{./figures/}}
hekeus@55	86 \DeclareGraphicsExtensions{.eps}
hekeus@55	87
hekeus@55	88 \usepackage[figure,table]{hypcap}
hekeus@55	89 \fi
hekeus@55	90
hekeus@55	91 %setup the hyperref package - make the links black without a surrounding frame
hekeus@55	92 \hypersetup{
hekeus@55	93 colorlinks,%
hekeus@55	94 citecolor=black,%
hekeus@55	95 filecolor=black,%
hekeus@55	96 linkcolor=black,%
hekeus@55	97 urlcolor=black
hekeus@55	98 }
hekeus@55	99
hekeus@55	100 \usepackage{tools}
hekeus@55	101 \newcommand{\colfig}[2][1]{\includegraphics[width=#1\linewidth]{figs/#2}}
hekeus@55	102 \newcommand\past[1]{\overset{\rule{0pt}{0.2em}\smash{\leftarrow}}{#1}}
hekeus@55	103 \newcommand\fut[1]{\overset{\rule{0pt}{0.1em}\smash{\rightarrow}}{#1}}
hekeus@55	104
hekeus@55	105 % Title.
hekeus@55	106 % ------
hekeus@55	107 \title{\papertitle}
hekeus@55	108
hekeus@55	109 % Authors
hekeus@55	110 % Please note that submissions are NOT anonymous, therefore
hekeus@55	111 % authors' names have to be VISIBLE in your manuscript.
hekeus@55	112 %
hekeus@55	113 % Single address
hekeus@55	114 % To use with only one author or several with the same address
hekeus@55	115 % ---------------
hekeus@55	116 \oneauthor
hekeus@55	117 {\firstauthor \textsuperscript{1}, \secondauthor \textsuperscript{2}, \thirdauthor \textsuperscript{1}, \fourthauthor \textsuperscript{1}} {\textsuperscript{1}Centre for Digital Music, Queen Mary University of London \\\textsuperscript{2}Department of Computer Science, University College London\\%
hekeus@56	118 {\tt \href{mailto:hekeus@eecs.qmul.ac.uk,peter.mcowan@eecs.qmul.ac.uk,mark.plumbley@eecs.qmul.ac.uk,s.abdallah@ucl.ac.uk}{\{hekeus,peter.mcowan,mark.plumbley\}@eecs.qmul.ac.uk}}\\
hekeus@56	119 {\tt \href{mailto:hekeus@eecs.qmul.ac.uk,peter.mcowan@eecs.qmul.ac.uk,mark.plumbley@eecs.qmul.ac.uk,s.abdallah@ucl.ac.uk}{s.abdallah@ucl.ac.uk}}}
hekeus@55	120
hekeus@55	121 %Two addresses
hekeus@55	122 %--------------
hekeus@55	123 % \twoauthors
hekeus@55	124 % {\firstauthor} {Affiliation1 \\ %
hekeus@55	125 % {\tt \href{mailto:author1@smcnetwork.org}{author1@smcnetwork.org}}}
hekeus@55	126 % {\secondauthor} {Affiliation2 \\ %
hekeus@55	127 % {\tt \href{mailto:author2@smcnetwork.org}{author2@smcnetwork.org}}}
hekeus@55	128
hekeus@55	129 % Three addresses
hekeus@55	130 % --------------
hekeus@55	131 % \threeauthors
hekeus@55	132 % {\firstauthor} {Affiliation1 \\ %
hekeus@55	133 % {\tt \href{mailto:author1@smcnetwork.org}{author1@smcnetwork.org}}}
hekeus@55	134 % {\secondauthor} {Affiliation2 \\ %
hekeus@55	135 % {\tt \href{mailto:author2@smcnetwork.org}{author2@smcnetwork.org}}}
hekeus@55	136 % {\thirdauthor} { Affiliation3 \\ %
hekeus@55	137 % {\tt \href{mailto:author3@smcnetwork.org}{author3@smcnetwork.org}}}
hekeus@55	138
hekeus@55	139
hekeus@55	140 % *************************************** the document starts here *************
hekeus@55	141 \begin{document}
hekeus@55	142 %
hekeus@55	143 \capstartfalse
hekeus@55	144 \maketitle
hekeus@55	145 \capstarttrue
hekeus@55	146 %
hekeus@55	147 \begin{abstract}
hekeus@55	148 The Melody Triangle is a smartphone application for Android that lets users easily create musical patterns and textures.
hekeus@55	149 %The user creates melodies by specifying positions within a triangle, and these positions directly map to the information theoretic properties of the generated output sequences.
hekeus@55	150 The user creates melodies by specifying positions within a triangle, and these positions correspond to the information theoretic properties of generated musical sequences.
hekeus@55	151 A model of human expectation and surprise in the perception of music, \emph{information dynamics}, is used to `map out' a musical generative system's parameter space, in this case Markov chains. This enables a user to explore the possibilities afforded by Markov chains, not by directly selecting their parameters, but by specifying the subjective \emph{predictability} of the output sequence.
hekeus@55	152 As users of the app find melodies and patterns they like, they are encouraged to press a `like' button, where their setting are uploaded to our servers for analysis. Collecting the `liked' settings of many users worldwide will allow us to elicit trends and commonalities in aesthetic preferences across users of the app, and to investigate how these might relate to the information-dynamic model of human expectation and surprise.
hekeus@55	153 We outline some of the relevant ideas from information dynamics and how the Melody Triangle is defined in terms of these. We then describe the Melody Triangle mobile application, how it is being used to collect research data and how the collected data will be evaluated.
hekeus@55	154 \end{abstract}
hekeus@55	155 %
hekeus@55	156
hekeus@55	157 \section{Introduction}\label{sec:introduction}
hekeus@55	158 The use of generative stochastic processes in music composition has been widespread for
hekeus@55	159 decades---for instance Iannis Xenakis applied probabilistic mathematical models
hekeus@55	160 to the creation of musical materials\cite{Xenakis:1992ul}. However it can sometimes be difficult for a composer to find desirable parameters and navigate the possibilities of a generative algorithm intuitively.
hekeus@55	161
hekeus@56	162 \sloppy The Melody Triangle is an interface for the discovery of melodic content where the parameter space of a stochastic generative musical process, the Markov chain, is `mapped out' according to the \emph{predictability} of the output. The Melody Triangle was developed in the context of \emph{information dynamics}\cite{Abdallah:2009p4089}; an information theoretic approach to modelling human expectation and surprise in the perception of music.
hekeus@55	163 Users of the Melody Triangle do not select the parameters to generative processes directly, rather they provide input in the form of a position within a triangle, and this maps to the information theoretic properties of an output melody.
hekeus@55	164 For instance one corner of the triangle returns completely random melodies, while an other area yields entirely predictable and periodic patterns, the entirety of the triangle covering a spectrum of predictability of the output melodies.
hekeus@55	165
hekeus@56	166 \fussy In section \ref{s:Intro} we review the concepts and ideas behind information dynamics, and outline the information measures that lead to the development of the Melody Triangle, which have been described in greater detail in our previous work \cite{Abdallah:2009p4089}. In section \ref{makingthetriangle} we describe how these information measures are used to construct the Melody Triangle, and how the triangular interface is used to retrieve patterns of symbols that are then mapped to notes or percussive sounds. The Melody Triangle has in previous work been implemented as an interactive installation and as a desktop application, these implementations are described and evaluated in \cite{ekeusmt1}.
hekeus@55	167 In section \ref{theapp} we describe the Melody Triangle mobile app for Android, which is the main contribution of this paper. We outline its features, how it allows users to share their settings with each other, and how it is currently being used to collect data for research. We then describe how the collected data will be interpreted to identify trends and commonalities in aesthetic preferences across users of the app, and to determine if parallels between these preferences and the information dynamics models can be made.
hekeus@55	168
hekeus@55	169 \section{Information Dynamics}
hekeus@55	170 \label{s:Intro}
hekeus@55	171 The relationship between
hekeus@55	172 Shannon's \cite{Shannon48} information theory and music and art in general has been the
hekeus@55	173 subject of some interest since the 1950s
hekeus@55	174 \cite{Youngblood58,CoonsKraehenbuehl1958,Moles66,Meyer67,Cohen1962}.
hekeus@55	175 The general thesis is that perceptible qualities and subjective states
hekeus@55	176 like uncertainty, surprise, complexity, tension, and interestingness
hekeus@55	177 are closely related to information-theoretic quantities like
hekeus@55	178 entropy, relative entropy, and mutual information.
hekeus@55	179
hekeus@55	180 Music is an inherently dynamic process. %The idea that the musical experience is strongly shaped by the generation
hekeus@55	181 %and playing out of strong and weak expectations was put forward by, amongst others,
hekeus@55	182 %music theorists L. B. Meyer \cite{Meyer:1967} and Narmour \cite{Narmour:1977}.
hekeus@55	183 %Music theorists L.B. Meyer \cite{Meyer:1967} and Narmour \cite{Narmour:1977}
hekeus@55	184 An essential aspect of this is that music is experienced as a phenomenon
hekeus@55	185 that unfolds in time, rather than being apprehended as a static object
hekeus@56	186 presented in its entirety. Meyer \cite{Meyer67} and Narmour \cite{Narmour77} argued that the experience depends
hekeus@55	187 on how we change and revise our conceptions \emph{as events happen}, on
hekeus@55	188 how expectation and prediction interact with occurrence, and that, to a
hekeus@55	189 large degree, the way to understand the effect of music is to focus on
hekeus@55	190 this `kinetics' of expectation and surprise.
hekeus@55	191
hekeus@55	192 Prediction and expectation are essentially probabilistic concepts
hekeus@55	193 and can be treated mathematically using probability theory.
hekeus@55	194 We suppose that when we listen to music, expectations are created on the basis
hekeus@55	195 of our familiarity with various styles of music and our ability to
hekeus@55	196 detect and learn statistical regularities in the music as they emerge.
hekeus@55	197 There is experimental evidence that human listeners are able to internalise
hekeus@56	198 statistical knowledge about musical structure
hekeus@55	199 \cite{SaffranJohnsonAslin1999}, and also
hekeus@55	200 that statistical models can form an effective basis for computational
hekeus@56	201 analysis of music
hekeus@55	202 \cite{ConklinWitten95,PonsfordWigginsMellish1999,Pearce2005}.
hekeus@55	203
hekeus@55	204 Information dynamics considers several different kinds of predictability in musical patterns, how these might be quantified using the tools of information theory,
hekeus@55	205 and how they shape or affect the listening experience. Our working hypothesis is that listeners maintain a dynamically evolving probabilistic belief state that enables them to make predictions about how a piece of music will continue.
hekeus@55	206
hekeus@55	207 They do this using both the immediate context of the piece as well as using previous musical experience, such as a familiarity with musical styles and conventions. As the music unfolds, listeners continually revise this belief state, which includes predictive
hekeus@55	208 distributions over possible future events. These changes in probabilistic beliefs can be associated with
hekeus@55	209 quantities of information; these are the focus of information dynamics.
hekeus@55	210
hekeus@55	211 In this next section we briefly describe the information measures that we use to define the Melody Triangle, however a more complete overview of information dynamics and some of its applications can be found in \cite{Abdallah:2009p4089} and \cite{CIP}.
hekeus@55	212
hekeus@55	213 \subsection{Sequential Information Measures}\label{sec:Sequential_Information_Measures}
hekeus@55	214
hekeus@55	215 Consider a sequence of symbols from the viewpoint of an observer at a certain time, and split the
hekeus@55	216 sequence into a single symbol in the \emph{present} ($X_t$), an infinite \emph{past} ($\past{X}_t$) and the
hekeus@55	217 infinite \emph{future} ($\fut{X}_t$). The symbols arrive at a constant, uniform rate.
hekeus@55	218
hekeus@55	219 The \emph{entropy rate} of a random process is a well-known, basic measure of its randomness or
hekeus@55	220 unpredictablity. The entropy rate is the entropy, \emph{H}, of the \emph{present} given the \emph{past}:
hekeus@55	221 \begin{equation}
hekeus@55	222 \label{eq:entro-rate}
hekeus@55	223 h_\mu = H(X_t\|\past{X}_t).
hekeus@55	224 \end{equation}
hekeus@56	225 \sloppy that is, it represents our average uncertainty about the present symbol \emph{given}
hekeus@55	226 that we have observed everything before it. Processes with zero entropy rate can
hekeus@55	227 be predicted perfectly given enough of the preceding context.
hekeus@55	228
hekeus@56	229 \fussy The \emph{multi-information rate} $\rho_\mu$ \cite{Dubnov2004}
hekeus@55	230 is the mutual
hekeus@55	231 information, \emph{I}, between the `past' and the `present':
hekeus@55	232 \begin{equation}
hekeus@55	233 \label{eq:multi-info}
hekeus@55	234 \rho_\mu = I(\past{X}_t;X_t) = H(X_t) - H(X_t\|\past{X}_t).
hekeus@55	235 \end{equation}
hekeus@55	236
hekeus@55	237 Multi-information rate can be thought of as measures of \emph{redundancy}, quantifying the extent to which the same information is to be found in all parts of the sequence.
hekeus@55	238 It is a measure of how much the predictability of the process depends on knowing the
hekeus@55	239 preceding context. It is the difference between the entropy of a single element of the
hekeus@55	240 sequence in isolation (imagine choosing a note from a musical score at random with your
hekeus@55	241 eyes closed and then trying to guess the note) and its entropy after taking into account
hekeus@55	242 the preceding context:
hekeus@55	243 If the previous symbols reduce our uncertainty about the present symbol a great deal, then
hekeus@55	244 the redundancy is high. For example, if we know that a sequence consists of a repeating
hekeus@56	245 cycle such as \ldots b, c, d, a, b, c, d, a \ldots, but we do not know which was the first
hekeus@55	246 symbol, then the redundancy is high, as $H(X_t)$ is high (because we
hekeus@55	247 have no idea about the present symbol in isolation), but $H(X_t\|\past{X}_t)$
hekeus@55	248 is zero, because knowing the previous symbol immediately tells us what the present symbol is.
hekeus@55	249
hekeus@55	250 The \emph{predictive information rate} (PIR) \cite{Abdallah:2009p4089} brings in our uncertainty about the future. It is a
hekeus@55	251 measure of how much each symbol reduces our uncertainty about the future as it is
hekeus@55	252 observed, \emph{given} that we have observed the past:
hekeus@55	253 \begin{equation}
hekeus@55	254 \label{eq:PIR}
hekeus@55	255 b_\mu = I(X_t;\fut{X}_t\|\past{X}_t) = H(\fut{X}_t\|\past{X}_t) - H(\fut{X}_t\|X_t,\past{X}_t).
hekeus@55	256 \end{equation}
hekeus@56	257 \sloppy It is a measure of the mutual information between the `present' and the `future' given the `past'. In other words, it is a measure of the \emph{new} information in each symbol.
hekeus@55	258
hekeus@56	259 \fussy The behaviour of the predictive information rate make it interesting from a compositional point of view. The definition
hekeus@55	260 of the PIR is such that it is low both for extremely regular processes, such as constant
hekeus@55	261 or periodic sequences, \emph{and} low for extremely random processes, where each symbol
hekeus@55	262 is chosen independently of the others, in a kind of `white noise'. In the former case,
hekeus@55	263 the pattern, once established, is completely predictable and therefore there is no
hekeus@55	264 \emph{new} information in subsequent observations. In the latter case, the randomness
hekeus@55	265 and independence of all elements of the sequence means that, though potentially surprising,
hekeus@55	266 each observation carries no information about the ones to come.
hekeus@55	267 \begin{fig}{wundt}
hekeus@55	268 \raisebox{-4em}{\colfig[0.43]{wundt}}
hekeus@55	269 {\ {\large$\longrightarrow$}\ }
hekeus@55	270 \raisebox{-4em}{\colfig[0.43]{wundt2}}
hekeus@55	271 \caption{
hekeus@55	272 The Wundt curve relating randomness/complexity with
hekeus@55	273 perceived value. Repeated exposure sometimes results
hekeus@55	274 in a move to the left along the curve \cite{Berlyne71}.
hekeus@55	275 }
hekeus@55	276 \end{fig}
hekeus@55	277 Processes with high PIR maintain a certain kind of balance between
hekeus@55	278 predictability and unpredictability in such a way that the observer must continually
hekeus@55	279 pay attention to each new observation as it occurs in order to make the best
hekeus@55	280 possible predictions about the evolution of the sequence. This balance between predictability
hekeus@55	281 and unpredictability is reminiscent of the inverted `U' shape of the Wundt curve (see \Figrf{wundt}),
hekeus@55	282 which summarises the observations of Wundt \cite{Wundt1897} that stimuli are most
hekeus@55	283 pleasing at intermediate levels of novelty or disorder, where there is a balance between
hekeus@55	284 `order' and `chaos'.
hekeus@55	285
hekeus@55	286 \begin{fig}{tranmatrix}
hekeus@55	287 \centering
hekeus@55	288 \includegraphics[width=0.49\linewidth]{figs/PeriodicMatrix.pdf}
hekeus@55	289 \includegraphics[width=0.49\linewidth]{figs/NonPeriodicMatrix.pdf}
hekeus@55	290 \caption{Two transition matrixes representing Markov chains. The shade of grey represents the probabilities of transition from one symbol to the next (white=0, black=1). The current symbol is along the bottom, and the next symbol is along the left. The left hand matrix has no uncertainty; it represents a periodic pattern (a,d,c,b,a,d,c,b,a,d,c,b,a\dots). The right hand matrix contains unpredictability but nonetheless is not completely without perceivable structure (we know for instance that any `b' will always be followed by an `a' and preceded by a `c'), it is of a higher entropy rate. \label{TransitionMatrixes}}
hekeus@55	291 \end{fig}
hekeus@55	292
hekeus@55	293
hekeus@55	294 \begin{fig}{mtriscat}
hekeus@55	295 \centering
hekeus@55	296 \begin{subfigure}[b]{0.5\textwidth}
hekeus@55	297 \centering
hekeus@55	298 \includegraphics[width=\textwidth]{./figs/meltriplot_diag3_cropped.pdf}
hekeus@55	299 \caption{}
hekeus@55	300 \label{mtriscat_diag}
hekeus@55	301 \end{subfigure}
hekeus@55	302 \begin{subfigure}[b]{0.5\textwidth}
hekeus@55	303 \centering
hekeus@55	304 \includegraphics[width=0.66\textwidth]{./figs/meltriplot_top3_cropped.pdf}
hekeus@55	305 \label{mtriscat_top}
hekeus@55	306 \caption{}
hekeus@55	307 \end{subfigure}
hekeus@55	308 \caption{The population of hundreds of randomly generated 8-state transition matrices in the 3D space of
hekeus@55	309 entropy rate ($h_\mu$), redundancy ($\rho_\mu$) and predictive information rate ($b_\mu$),
hekeus@55	310 all in bits. As can be seen in (a) the distribution as a whole makes a curved sheet, with the highest PIR values found at intermediate entropy and redundancy. Although
hekeus@55	311 not visible in this plot, it is largely hollow in the middle.
hekeus@55	312 As can be seen in (b), the same plot with the PIR dimension projected out forms a right angled triangle, this is the triangle which corresponds to the interface of the Melody Triangle.
hekeus@55	313 %The concentrations of points along the redundancy axis correspond
hekeus@55	314 %to Markov chains which are roughly periodic.
hekeus@55	315 % with periods of 2 (redundancy 1 bit),
hekeus@55	316 %3, 4, \etc all the way to period 7 (redundancy 2.8 bits). The dotted line outlines the Wundt-like curve in the plot of PIR values,
hekeus@55	317 }
hekeus@55	318
hekeus@55	319 \end{fig}
hekeus@55	320
hekeus@55	321
hekeus@55	322 % \begin{fig}{mtriscat}
hekeus@55	323 % \colfig[1]{meltriplot_diag3_cropped}
hekeus@55	324 % \colfig[0.66]{meltriplot_top3_cropped}
hekeus@55	325 % \caption{The population of 12-state transition matrices in the 3D space of
hekeus@55	326 % entropy rate ($h_\mu$), redundancy ($\rho_\mu$) and predictive information rate ($b_\mu$),
hekeus@55	327 % all in bits. Note that the distribution as a whole makes a curved triangle. Although
hekeus@55	328 % not visible in this plot, it is largely hollow in the middle.
hekeus@55	329 % The concentrations of points along the redundancy axis correspond
hekeus@55	330 % to Markov chains which are roughly periodic with periods of 2 (redundancy 1 bit),
hekeus@55	331 % 3, 4, \etc all the way to period 7 (redundancy 2.8 bits). The dotted line outlines the Wundt-like curve in the plot of PIR values, note that the highest PIR values are found at intermediate entropy
hekeus@55	332 % and redundancy. \label{InfoDynEngine}}
hekeus@55	333 %\end{fig}
hekeus@55	334
hekeus@55	335
hekeus@55	336
hekeus@56	337 A similar shape is visible in the upper envelope of the plot in \Figrf{mtriscat}a, which is a 3-D scatter plot of the information measures for hundreds of first-order, eight state Markov chain transition matrices.%generated by a random sampling method.
hekeus@55	338 The coordinates of the `information space' are entropy rate ($h_\mu$), redundancy ($\rho_\mu$), and
hekeus@56	339 predictive information rate ($b_\mu$). The matrices are generated by % a random sampling method.
hekeus@56	340 a hierarchical Dirichlet sampling method\cite{Teh2006} to increase the probability of generating very sparse transition matrices, and get a good spread that reaches the edges and corners of the space. The points along the `redundancy' axis correspond
hekeus@55	341 to periodic Markov chains. Those along the `entropy' axis produce uncorrelated sequences
hekeus@55	342 with no temporal structure. Processes with high PIR are to be found at intermediate
hekeus@55	343 levels of entropy and redundancy.
hekeus@56	344
hekeus@55	345
hekeus@55	346 These observations led us to construct the `Melody Triangle'.
hekeus@55	347
hekeus@55	348
hekeus@55	349
hekeus@55	350
hekeus@55	351
hekeus@55	352
hekeus@55	353
hekeus@55	354 \section{The Melody Triangle}\label{makingthetriangle}
hekeus@55	355
hekeus@55	356 The Melody Triangle is an interface that is designed around this natural distribution of Markov chain transition
hekeus@55	357 matrices in the information space of entropy rate ($h_\mu$), redundancy ($\rho_\mu$) and predictive information rate ($b_\mu$), as illustrated in \Figrf{mtriscat}a.
hekeus@55	358
hekeus@55	359 The distribution of transition matrices in this space forms a relatively thin
hekeus@55	360 curved sheet. Thus, it is a reasonable simplification to project out the
hekeus@55	361 third dimension (the PIR) and present an interface that is just two dimensional, resulting in a right-angled triangle, as can be seen in \Figrf{mtriscat}b.
hekeus@55	362
hekeus@56	363 The right-angled triangle is rotated and stretched to form an equilateral triangle with
hekeus@55	364 the `redundancy'/`entropy rate' vertex at the top, the `redundancy' axis down the left-hand
hekeus@55	365 side, and the `entropy rate' axis down the right, as shown in \Figrf{TheTriangle}.
hekeus@55	366 This is our `Melody Triangle' and
hekeus@55	367 forms the interface by which the system is controlled.
hekeus@55	368
hekeus@55	369
hekeus@55	370
hekeus@55	371 \subsection{Usage}
hekeus@55	372 The user selects a point within the triangle, this is mapped into the
hekeus@55	373 information space and the nearest transition matrix is used to generate
hekeus@55	374 a sequence of values which are then sonified either as pitched notes or percussive
hekeus@55	375 sounds.
hekeus@55	376
hekeus@55	377 Though the interface is 2D, the third dimension (predictive information rate) is implicitly present, as
hekeus@55	378 transition matrices retrieved from
hekeus@55	379 along the centre line of the triangle will tend to have higher PIR.
hekeus@55	380 \begin{fig}{TheTriangle}
hekeus@55	381 \colfig[0.75]{TheTriangle.pdf}
hekeus@55	382 \caption{The Melody Triangle}
hekeus@55	383 \end{fig}
hekeus@55	384 As shown in \Figrf{TheTriangle}, the corners correspond to three different extremes of predictability and
hekeus@55	385 unpredictability, which could be loosely characterised as `periodicity', `noise'
hekeus@55	386 and `repetition'. Melodies from the `noise' corner (high $h_\mu$, low $\rho_\mu$
hekeus@55	387 and low $b_\mu$) have no discernible pattern;
hekeus@55	388 those along the `periodicity'
hekeus@55	389 to `repetition' edge are all cyclic patterns that get shorter as we approach
hekeus@55	390 the `repetition' corner, until each is just one repeating note. Those along the
hekeus@55	391 opposite edge consist of independent random notes from non-uniform distributions.
hekeus@55	392 Areas between the left and right edges will tend to have higher predictive information rate ($b_\mu$),
hekeus@55	393 and we hypothesise that, under
hekeus@55	394 the appropriate conditions, these will be perceived as more `interesting' or `melodic.'
hekeus@55	395 These melodies have some level of unpredictability, but are not completely random.
hekeus@55	396 Or, conversely, are predictable, but not entirely so.
hekeus@55	397
hekeus@55	398 Given coordinates corresponding to a point in the triangle, we select from a pre-built
hekeus@55	399 library of random processes, choosing one whose entropy rate and redundancy match the desired
hekeus@55	400 values. The implementations discussed in this paper use first order Markov chains as the content generator,
hekeus@55	401 since it is easy to compute the theoretically exact values of entropy rate, redundancy and predictive
hekeus@55	402 information rate given the transition matrix of the Markov chain. However, in principle, any generative system could be used to create the library of sequences, given an appropriate probabilistic listener model supporting
hekeus@55	403 the estimation of entropy rate and redundancy.
hekeus@55	404
hekeus@56	405 \sloppy The Markov chain based implementation generates streams of symbols in the abstract; the alphabet of symbols is then mapped to a set of distinct sounds, such as pitched notes in a scale or a set of percussive sounds. By layering these streams, intricate musical textures can be created.
hekeus@55	406 %The Melody Triangle does not take into account the statistical experience of our exposure to tonal music. Even if a particular stream of symbols is periodic and predictable, in mapping to the chromatic scale there is a chance that the melody may conflict with culturally defined expectations. A mapping to the diatonic scale however is less likely to lead to such conflicts, and mappings to the pentatonic scale even less so.
hekeus@55	407 %The symbols can also be mapped to a set of percussive sounds,
hekeus@55	408 The number of states in the generated Markov chains corresponds to the number of audio samples used, however the output of the Melody Triangle could even be mapped to non sonic outputs such as visible shapes, colours, or movements. %Further by layering these streams, intricate musical textures can be created.
hekeus@55	409
hekeus@56	410 \fussy The information measures that define the Melody Triangle assume a constant rate of symbols, and thus the output sequences proceed at a constant, uniform rate. Although the placing of events in time and rhythm has a strong effect on expectations, surprise and satisfaction in music, the system does not, as yet, address this temporal dimension. Additionally the system does not address the culturally defined expectations of melodic structure that result from our exposure to tonal music; all symbols are considered equal, regardless of what note in a scale they are mapped to.
hekeus@55	411
hekeus@55	412 %nto the expectations thataccount statistical experience of our exposure to tonal music and of consonance or melodic structure.
hekeus@55	413
hekeus@55	414 \section{The Mobile App}
hekeus@55	415 \label{theapp}
hekeus@55	416 The Melody Triangle has been implemented as an interactive multi-user installation, as a desktop composition tool, and most recently as a mobile app for the Android platform. It was launched on 28th March 2013, and is free to download from the Google Play app store.\footnote{The download link and some sample audio can be found at \href{http://melodytriangle.eecs.qmul.ac.uk/}{http://melodytriangle.eecs.qmul.ac.uk/}} A description of the interactive installation and the desktop versions of the Melody Triangle, as well as some user trials can be read in \cite{ekeusmt1}.
hekeus@55	417
hekeus@55	418 \begin{fig}{screenshot.png}
hekeus@55	419 \colfig[1]{screenshot.png}
hekeus@55	420 \caption{Screenshot of the Melody Triangle mobile app for Android. The letters on the tokens correspond to the instrument they are currently assigned to. P=piano, B=bass, D=drums. }
hekeus@55	421 \end{fig}
hekeus@55	422 To support the crowdsourcing of data, the app needs to provide enough musical variety to engage users. A simple implementation (with for instance, one single concurrent melody, at one single rate and timbre), would make data analysis easier and more straight forward, however the limit musical appeal would make it difficult to collect data from the public. In the next sections we outline the features of the app, describe how the data is collected, and how it will be analysed.
hekeus@55	423
hekeus@55	424 \subsection{Features}
hekeus@55	425 As seen in \Figrf{screenshot.png}, the app provides three tokens that can be dragged in to the triangle using the touch screen. It is with these tokens that the user selects the points in the triangle that will generate sequences, and thus three sequences can be played simultaneously. Each token can be assigned to one of three instruments: piano, bass, drums. The user can change what instrument is assigned to each token by pressing on the token's holder position on the top left. In addition to changing the instrument, the user can also change the register of the instrument; the piano has three octaves, the bass has two. Additionally the user can select the number of notes per beat, as well as specifying whether this token's notes should be delayed to come on the off-beat, allowing for syncopation between the sequences generated by the tokens.
hekeus@55	426
hekeus@55	427 There are also some global controls; the master beats-per-minute can be changed with the `+/-' buttons on the left, and there is an additional settings menu were the user can choose between the diatonic scale, harmonic scale or the pentatonic scale.
hekeus@55	428
hekeus@55	429 The mobile app is pre-populated with two sets of over 8000 matrixes that densely cover the triangular interface. For the diatonic and harmonic scale (and for the drums samples) the transition matrixes contain 8 states, and for the pentatonic scale 6 states. Whenever a transition matrix is selected by placing a token in the triangle, the symbol-to-note mapping is shuffled. This allows the same transition matrix to correspond to multiple melodies. One state for each of the matrixes is mapped to a rest, allowing for some rhythmic variety and to increase the musicality of the output. When a user taps one of the tokens in the triangle it re-shuffles the symbol-to-note mapping while keeping the same transition matrix.
hekeus@55	430
hekeus@55	431 The current transition matrices, settings of each token and the global settings constitute the `state' of the system. A user can save their favourites states locally as presets or share them with the world by pressing the `like' button.
hekeus@55	432
hekeus@55	433 \subsection{Collecting Data - `Likes' and the Melody Triangle `Radio'}
hekeus@58	434 Onscreen notifications encourage users of the app to press the `like' button (the heart icon on the right of the screen) whenever they enjoy what they are hearing. When they do so, the current state of the system is stored and assigned a unique 6 character hash code, referred to as a `song id'. The users are given the option to enter a username, or may choose to remain anonymous. This state is encoded into a small file and uploaded to our servers at Queen Mary, University of London. Geographical information is also stored.
hekeus@55	435
hekeus@58	436 %Uploaded states become available to other users of the app.
hekeus@58	437 It is possible for users of the app to share settings with each other.
hekeus@58	438 By pressing the cloud icon on the right of the screen, the user can type in any song id. When they do so the app downloads the state file from the server and loads the state on to the user's phone.
hekeus@55	439 Additionally the app can go into `radio mode', where the users can quickly and easily audition other users' uploaded states. Upon entering radio mode, the app downloads a randomly selected uploaded state. An additional button appears on the interface, a `skip' button, which whenever it is pressed the app downloads another randomly selected state. Again the users are encouraged (via on screen notifications) to whenever they enjoy one of the downloaded states, to press the `like' button. This allows us over time to build a kind of crowdsourced ranking of the uploaded states, as more popular states get more likes. Users can modify downloaded states and then `like' those, hence states can evolve from other states, and so any uploaded state keeps a history of previous states so that we may track their evolution.
hekeus@55	440
hekeus@55	441 To further encourage uploads and participations, there is a `Hall of Fame' (see \Figrf{halloffame.png}) available at the project website. It shows a list of the users who have contributed the most by uploading many states, as well as chart of most popular songs when `liked' in radio mode.
hekeus@55	442
hekeus@55	443 \begin{fig}{halloffame.png}
hekeus@55	444 \colfig[1]{halloffame.png}
hekeus@55	445 \caption{The Melody Triangle `Hall of Fame' as of 9th of June 2013. The top list shows the most prolific users who have shared the most settings by pressing the `like' button. The lower list shows the top ranked songs based on the number of `likes' a state has received by other users while in `Radio Mode'. The hall of fame can be found at \href{http://melodytriangle.eecs.qmul.ac.uk/}{http://melodytriangle.eecs.qmul.ac.uk/}}
hekeus@55	446 \end{fig}
hekeus@55	447
hekeus@55	448 In previous work \cite{ekeusmt1} we attempted to carry out a lab study to find links between the information theoretic measures of the Melody Triangle and aesthetic preferences, however it quickly became clear that lab conditions were not practical to get significant amounts of data. The Melody Triangle mobile phone app provides an alternative means of collecting data, while engaging crowds with a unique citizen science project.
hekeus@55	449
hekeus@55	450
hekeus@55	451 \subsection{Interpreting Crowdsourced Data}
hekeus@55	452
hekeus@55	453 By collecting many liked settings from users all over the world, it may be possible to identify trends and commonalities across these settings. A submitted setting contains all the information relating to the current state of the app, this forms a feature vector that includes the information measures of the currently playing Markov chains, the current note-to-symbol mappings, instrument/register choices, scale, notes per beat for every token and master BPM. Given a submitted state we can extract a number of additional features that are not explicitly stored in the data representing the state of the system, but that are implicitly available by observing the output. This includes the frequencies of notes and melodic intervals for each melody, and by looking across concurrent melodies, inter-melody intervals allowing us to extract harmonic information.
hekeus@55	454
hekeus@55	455 We can look for clusters in the feature space to answer a variety of questions. For instance we can identify what the most common intervals are, both within a melody, and across concurrent melodies, and whether these correspond to the more consonant intervals. We can look for the average information values of the Markov chains, and see how these vary based on the number of concurrent tokens, the rate at which notes are output, or register for instance. We can see if the states that receive the greatest like-to-download ratio in `Radio mode' have similar information properties to each other.
hekeus@55	456
hekeus@57	457 We are in the active state of research\footnote{As of June 9th 2013, there have been 173 submitted settings. The collected data is being made available to researchers at the project website: \href{http://melodytriangle.eecs.qmul.ac.uk/data}{http://melodytriangle.eecs.qmul.ac.uk/data}.}and a full analysis is yet to be carried out. However it is already clear from data collected so far that the more `predictable' half of the triangle (the half with lower entropy rate and higher redundancy) is preferred to the `unpredictable' half of the triangle. Additionally it has been observed that the visual layout of the interface has an influence on the parameter choices; a number of states contain tokens lined up in rows or columns. Approximately 20\% of states submitted so far contain only the drum sounds, and these may lend themselves to a more straight-forward information theoretic analysis as these are not subject to cultural melodic expectations.
hekeus@55	458
hekeus@56	459 Clusterings in the state-space of the data may provide us with the means to link the information dynamic models and its measures to aesthetic preferences. Additionally if we get enough entries, the geographical information may allow us to determine if there are any cultural differences between users based on countries or continents.
hekeus@56	460
hekeus@56	461
hekeus@55	462
hekeus@55	463 \section{Conclusion}
hekeus@55	464 We presented the Melody Triangle; an interface for the discovery of melodic content where the input --- positions within a triangle --- corresponds to the predictability of the output melodies. The Melody Triangle is contextualised in \emph{information dynamics}; an information theoretic approach to modelling human expectation and surprise.
hekeus@55	465 We outlined the relevant ideas behind information dynamics and described three key information theoretic measures; entropy rate, redundancy and a measure of \emph{predictive information rate}, which describes the gain in information made by current observations about the future, but which are not already known from past observations. We described how the natural distribution of randomly generated Markov chains in terms of these measures lead us to design the Melody Triangle.
hekeus@55	466
hekeus@55	467 We described the Melody Triangle mobile app, a free app for Android, and outlined how it collects data for research by uploading the `liked' settings of users to our servers. We describe the app's `radio mode' that enables users to quickly audition other uploaded states provide feedback to form a crowd-sourced rankings table of most popular settings. Finally we outline how the collected data will be used to look for trends and commonalities in the uploaded settings, and to help identify any relationship between the information-dynamic model of human expectation and aesthetic preference.
hekeus@55	468
hekeus@55	469 \begin{acknowledgments}
hekeus@55	470 This work is supported by an EPSRC Doctoral Training Centre EP/G03723X/1 (HE), GR/S82213/01 and \\EP/E045235/1(SA), an EPSRC Leadership Fellowship, \\EP/G007144/1 (MDP) and EPSRC IDyOM2 EP/H013059/1. The Melody Triangle mobile app was developed with QApps and supported by impactQM, funded by the EPSRC.
hekeus@55	471 \end{acknowledgments}
hekeus@55	472
hekeus@55	473 %%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%
hekeus@55	474 %bibliography here
hekeus@55	475 \bibliography{all,c4dm,all3}
hekeus@55	476
hekeus@55	477 \end{document}

Mercurial > hg > mtridoc

annotate smc2013_cameraReady/smc2013.tex @ 58:a63c438b3f65 tip