diff nime2012/mtriange.tex @ 5:d5702ab76262

Starting to make coherent sentences :)
author Henrik Ekeus <hekeus@eecs.qmul.ac.uk>
date Thu, 02 Feb 2012 17:34:19 +0000
parents bd15a95a0c60
children 99b7caedab90
line wrap: on
line diff
--- a/nime2012/mtriange.tex	Wed Feb 01 17:16:12 2012 +0000
+++ b/nime2012/mtriange.tex	Thu Feb 02 17:34:19 2012 +0000
@@ -17,40 +17,81 @@
 \title{The Melody Triangle - Pattern and Predictability in Music}
 \numberofauthors{2}
 \author{
- \alignauthor Henrik Ekeus (1),  Samer Abdallah (1)\\
+ \alignauthor Henrik Ekeus (1),  Samer Abdallah (1), Mark D. Plumbley, Peter W. McOwan\\
      \affaddr{(1) Centre for Digital Music}\\
-     \affaddr{Queen Mary, Univ. of London}
+     \affaddr{Queen Mary University of London}
 }
 
 \begin{document}
 \maketitle
 \begin{abstract}
+The Melody Triangle is a musical pattern generating system.  It has two interfaces; one is a traditional screen based interface, the other a multi-user interactive installation.  In both cases, the Melody Triangle allows its users to interactively explore patterns of predictability in music. It makes use of statistical models developed as part of Information Dynamics of Music(IDyOM) project[ref], which seeks to model patterns of expectation and surprise in the perception of music.   
 
-This is the abstract.
+We outline the Information Dynamics model and how it forms the basis of the Melody Triangle.  We discuss both uses of the system, the multi-user installation where collaboration in a performative setting provides a playful yet informative way to explore expectation and surprise in music, and the screen based interface where the Melody Triangle becomes compositional tool.  Finally we outline a study where participants used the screen-based interface under various experimental conditions to allow us to determine the relationship between the Information Dynamics models and musical preference. We found that\dots   	
+
 \end{abstract}
 \keywords{Information dynamics, Markov chains, Collaborative performance, Aleatoric composition}
 
 \section{Introduction}
 
-The Melody Triangle is a musical patter generating system that 
+ Music generally involves patterns in time. Composers commonly, consciously or not, play with his or her audience's expectations by setting up patterns that seem more or less predictable, and thus manipulate expectations and surprise in the listener[ref].  The research into Information Dynamics explores several different kinds of predictability in musical patterns, how human listeners might perceive these, and how they shape or affect the listening experience.
 
-explores patterns of predictability in music in an interactive sound installation.  This installation makes use of some models developed as part of research into Information Dynamics of Music.  Music generally involves patterns in time, often a melodies or chord sequences are repeated, and times they are slightly modified.   Composers commonly, consciously or not, play with his or her audience's expectations, by setting up patterns at that seem more or less predictable, and thus manipulating expectations and surprise in the listener.  The research into information dynamics explores several different kinds of predictability in musical patterns, how human listeners might perceive these, and how they shape or affect the listening experience.
+
+\section{Information Dynamics and the Triangle }
+(some background on IDyOM and Markov chains)
+
+\begin{figure*}[t]
+\centering
+\includegraphics[width=1.0\textwidth]{InfoDynEngine.pdf}
+\caption{Screen Shot from the Information Dynamics engine - the current and next transition matrixes are on the left.  The upper one has no uncertainty and thus represents a periodic pattern. The lower one contains unpredictability but nonetheless is not completely without perceivable structure.  On the right we see the population of transition matrixes distributed along three axies of redundancy, entropy rate and predictive -information rate.  Note how the distribution makes triangle-like plane floating in 3d space.\label{InfoDynEngine}}
+\end{figure*}
+
+
+
+The Information Dynamics model operates on discreet symbols, only at the output stage is any symbol mapped to a particular note. Each stream of symbols is at any one time defined by a transition matrix.  A transition matrix defines the probabilistic distribution for the symbol following the current one.  In fig.\ref{InfoDynEngine}, on the left we see two transition matrices, the upper one having no uncertainty and therefore outlining a periodic pattern.   The lower one containing considerable unpredictability but is nonetheless not completely without perceivable structure, it is of a higher entropy.  The current symbol is along the bottom, and in this case there are twelve possibilities.  In the upper matrix in fig, we can see for example that symbol 4 must follow symbol 3, and that symbol 10 must follow symbol 4, and so on.   
Hundreds of transition matrixes are generated, and they are then placed in a 3d statistical space based on 3 information measures calculated from the matrix, these are redundancy, entropy rate, and predictive-information rate [see [cite]].  In fig.\ref{InfoDynEngine} on the right, we see a representation of these matrixes distributed; each one of these points corresponds to a transition matrix.  Entropy rate is the average uncertainty for the next symbol as we go through the sequence.  A looping sequence has 0 entropy, a sequence that is difficult to predict has high entropy rate.   Entropy rate is an average of ÔsurprisingnessÕ over time.  
+
Redundancy tells us the difference in uncertainty before we look at the context (the fixed point distribution) and the uncertainty after we look at context.  For instance a matrix with high redundancy, such as one that represents a long periodic sequence, would have high uncertainty before we look at the context but as soon as we look at the previous symbol, the uncertainty drops to zero because we now know what is coming next.

Predictive information rate tell us the average reduction in uncertainty upon perceiving a symbol; a system with high predictive information rate means that each symbol tells you more about the next one.  If we imagine a purely periodic sequence, each symbol tells you nothing about the next one, that we didn't already know as we already know how the pattern is going.  Similarly with a seemingly uncorrelated sequence,  seeing the next symbol does not tell us anymore because they are completely independent anyway; there is no pattern.   There is a subset of transition matrixes that have high predictive information rate, and it is neither the periodic ones, nor the completely un-corellated ones.  Rather they tend to yield output that have certain characteristic patterns, however a listener can't necessarily know when they occur.  However a certain sequence of symbols might tell us about which one of the characteristics patterns will show up next.  Each symbols tell a us little bit about the future but nothing about the infinite future, we only learn about that as time goes on; there is continual building of prediction.

When we look at the distribution of matrixes generated by a random sampling method in this 3d space of entropy rate, redundancy and predictive information rate, it forms an arch shape that is fairly thin, and it thus becomes a reasonable approximation to pretend that it is just a sheet in two dimensions(see fig.\ref{InfoDynEngine}).  It is this triangular sheetfig.\ref{TheTriangle} that is then mapped either to the screen, or in the case of the interactive installation, physical space.  Each corner corresponding to three different extremes of predictability/unpredictability, which could be loosely characterised as periodicity, noise and repetition.  
+
+\begin{figure*}[t]
+\centering
+\includegraphics[width=0.5\textwidth]{TheTriangle.pdf}
+\caption{The Melody Triangle - the triangle's axis corresponds to \label{TheTriangle}}
+\end{figure*}
+
+
+
+
+\section{User Interfaces}
+The Melody Triangle engine, developed in Prolog and MatLab, can be controlled by OSC messages and thus any number of interfaces could be developed to for it. Currently two different interfaces exist; a standard screen based interface where a user moves tokens with a mouse in and around a triangle on screen, and a multi-user interactive installation where a Kinnect camera tracks individuals in a space and maps their positions in the space to the triangle.  
+
+\subsection{The Multi-User Installation}
+
+
 
  the statistical properties of this melody is based on where in the physical room the participant is standing, as this is mapped to a statistical space (see below).  By exploring the physical space the participants thus explore the predictability of the melodic and rhythmical patterns, based on a simple model of how might guess the next musical event given the previous one.  
 \dots
 
 
-\section{ The Information Dynamics Model }
-(some background on IDyOM)
-The active space is triangular, with each corner corresponding to three different extremes of predictability/unpredictability.  When multiple people are in the space, they can  cooperate to create musical polyphonic textures.   For example, one person could lay down a predictable repeating bass line by keeping themselves to the periodicity/repetition side of the room, while a companion can generate a freer melodic line by being nearer the 'noise' part of the space.
+When multiple people are in the space, they can cooperate to create musical polyphonic textures.   For example, one person could lay down a predictable repeating bass line by keeping themselves to the periodicity/repetition side of the room, while a companion can generate a freer melodic line by being nearer the 'noise' part of the space.
 
-\subsection{User Interface}
+
+
+
+\subsection{The Screen Based Interface}
+
+[screen shot]
 On the screen is a triangle and a round token.
 
 With the mouse you can click and drag the red token and move it around the screen.
 When the red token is dragged into the triangle, the system will start generating a sequence of piano notes.  The pattern of notes depends on where in the triangle the token is
 
-The information dynamics engine has as input positions in space, and outputs a stream of symbols for each of those coordinates.  These symbols are then interpreted as notes in a scale, each individual thus generating a melody.  Each stream of symbols is at any one time defined by a transition matrix.  A transition matrix defines the probabilistic distribution for the symbol following the current one.  In fig x above, on the left we see two transition matrices, the upper one having no uncertainty and therefore outlining a periodic pattern.   The lower one containing considerable unpredictability but is nonetheless not completely without perceivable structure, it is of a higher entropy.  The current symbol is along the bottom, and in this case there are twelve possibilities.  In the upper matrix in fig, we can see for example that symbol 4 must follow symbol 3, and that symbol 10 must follow symbol 4, and so on.   
Hundreds of transition matrixes are generated, and they are then placed in a 3d statistical space based on 3 information measures calculated from the matrix, these are redundancy, entropy rate, and predictive-information rate [see [cite]].  In fig x on the right, we see a representation of these matrixes distributed; each one of these points corresponds to a transition matrix.  Entropy rate is the average uncertainty for the next symbol as we go through the sequence.  A looping sequence has 0 entropy, a sequence that is difficult to predict has high entropy rate.   Entropy rate is an average of ÔsurprisingnessÕ over time.  
Redundancy tells us the difference in uncertainty before we look at the context (the fixed point distribution) and the uncertainty after we look at context.  For instance a matrix with high redundancy, such as one that represents a long periodic sequence, would have high uncertainty before we look at the context but as soon as we look at the previous symbol, the uncertainty drops to zero because we now know what is coming next.

Predictive information rate tell us the average reduction in uncertainty upon perceiving a symbol; a system with high predictive information rate means that each symbol tells you more about the next one.  If we imagine a purely periodic sequence, each symbol tells you nothing about the next one, that we didn't already know as we already know how the pattern is going.  Similarly with a seemingly uncorrelated sequence,  seeing the next symbol does not tell us anymore because they are completely independent anyway; there is no pattern.   There is a subset of transition matrixes that have high predictive information rate, and it is neither the periodic ones, nor the completely un-corellated ones.  Rather they tend to yield output that have certain characteristic patterns, however a listener can't necessarily know when they occur.  However a certain sequence of symbols might tell us about which one of the characteristics patterns will show up next.  Each symbols tell a us little bit about the future but nothing about the infinite future, we only learn about that as time goes on; there is continual building of prediction.

When we look at the distribution of matrixes generated by a random sampling method in this 3d space of entropy rate, redundancy and predictive information rate, it forms an arch shape that is fairly thin, and it thus becomes a reasonable approximation to pretend that it is just a sheet in two dimensions.  It is this triangular sheet that is then mapped on to the screen. 
+
+
+\section{Information Dynamics and Musical Preference}
+
+
+
+
+
 
 
 \section{Technical Implementation (needed?)}