-Output written on chiliguano_msc_finalproject.pdf (16 pages, 76350 bytes).
-PDF statistics:
- 71 PDF objects out of 1000 (max. 8388607)
- 0 named destinations out of 1000 (max. 500000)
- 1 words of extra memory for PDF output out of 10000 (max. 10000000)
-
diff -r f1504bb2c552 -r 1dbd24575d44 Report/chiliguano_msc_finalproject.pdf
Binary file Report/chiliguano_msc_finalproject.pdf has changed
diff -r f1504bb2c552 -r 1dbd24575d44 Report/chiliguano_msc_finalproject.synctex.gz
Binary file Report/chiliguano_msc_finalproject.synctex.gz has changed
diff -r f1504bb2c552 -r 1dbd24575d44 Report/chiliguano_msc_finalproject.tex
--- a/Report/chiliguano_msc_finalproject.tex Tue Jul 28 21:14:27 2015 +0100
+++ b/Report/chiliguano_msc_finalproject.tex Tue Aug 04 12:13:47 2015 +0100
@@ -1,41 +1,100 @@
-\documentclass[12pt,draft,spaced,oneside,openright]{qmwphd}
+% \documentclass[12pt,doubledspaced,oneside,openright]{qmwphd}
% Final MSc project report using qmwphd.cls
-\usepackage{moreverb} % This defines \verbatiminput.
+% \usepackage{moreverb} % This defines \verbatiminput.
+
+\documentclass[a4paper,12pt,draft]{report}
+%PhD Thesis Template for the School of Electronic Engineering and Computer Science, Queen Mary University of London. Stripped from Dan Stowell's PhD.
+
+%BEFORE SUBMISSION DO THESE:
+% * deactivate all \includeonly
+% * ensure \doneit set to nothing
+% * ensure numbering CONTINUOUS from title page on through
+% * activate the includes of license, ack, etc
+% * check through for question mark errors in render
+% * make sure the bibliog doesn't have ugly urls in
+
+\usepackage{ifdraft}
+\usepackage{amsmath}
+\usepackage{amsfonts}
+\usepackage{amssymb}
+\usepackage{natbib}
+\usepackage{har2nat}
+\usepackage{rotating}
+\usepackage[breaklinks]{hyperref}
+\usepackage{subfig} % apparently subfig is the one to use not subfigure
+\usepackage{appendix}
+\usepackage{tipa}
+\usepackage{clrscode}
+\usepackage{setspace}
+\usepackage[absolute]{textpos}
+
+
\begin{document}
+
+\setlength{\TPHorizModule}{200mm}
+\setlength{\TPVertModule}{100mm}
+\textblockorigin{61mm}{19mm}
+
+%%%%% thanks alex mclean for super-useful onscreen reading tip:
+%\usepackage[top=0.1in, bottom=0.1in, left=0.3in, right=0.3in, paperwidth=11in, paperheight=7in]{geometry} % activate for ONSCREEN reading shape AT HOME
+%\usepackage[top=0.1in, bottom=0.1in, left=0.3in, right=0.3in, paperwidth=11in, paperheight=8.5in]{geometry} % activate for ONSCREEN reading shape AT WORK
-\frontmatter
+\doublespacing{}
-\author{Paulo Esteban Chiliguano Torres}
+% numbering starts from here:
+\pagenumbering{arabic}
+
+% titlepage stuff
\title{Hybrid music recommender using content-based and social information}
-\qualification{Master of Science}
+\author{Paulo Esteban Chiliguano Torres \\
+ \\
+ Project report 2015\\
+ \\
+ School of Electronic Engineering and Computer Science\\
+ Queen Mary University of London
+}
+\date{2015}
+
+% \frontmatter
+% \author{Paulo Esteban Chiliguano Torres}
+% \title{Hybrid music recommender using content-based and social information}
+% \qualification{Master of Science}
\maketitle
-\begin{summary}
-.
-\end{summary}
+
+% \include{acknowledgements/acknowledgements}
+
+% \begin{summary}
+% \end{summary}
+\include{abstract/abstract}
+
+\setcounter{page}{3}
\tableofcontents
\listoffigures
+\listoftables
+
% could also have a \listoftables, but this example doesn't include any
-\begin{acknowledgements}
-.
-\end{acknowledgements}
+%\mainmatter
+% Start the main context
+\include{chapter1/introduction}
-\mainmatter
-% Start the main context
-\include{ch1/ch1}
-\include{ch2/ch2}
-\include{ch3/ch3}
-\include{ch4/ch4}
-\include{ch5/ch5}
-\include{ch6/ch6}
+\include{chapter2/background}
-\bibliographystyle{plain}
-\bibliography{chiliguano_msc_finalproject}
+\include{chapter3/ch3}
-\backmatter
+\include{chapter4/evaluation}
+
+\include{chapter5/results}
+
+\include{chapter6/conclusions}
+
+\bibliographystyle{agsm}
+\bibliography{references}
+
+%\backmatter
\end{document}
\ No newline at end of file
diff -r f1504bb2c552 -r 1dbd24575d44 Report/chiliguano_msc_finalproject.toc
--- a/Report/chiliguano_msc_finalproject.toc Tue Jul 28 21:14:27 2015 +0100
+++ b/Report/chiliguano_msc_finalproject.toc Tue Aug 04 12:13:47 2015 +0100
@@ -1,27 +1,31 @@
-\contentsline {chapter}{\numberline {1}Introduction}{7}
-\contentsline {section}{\numberline {1.1}Outline of the thesis}{8}
-\contentsline {chapter}{\numberline {2}Background}{9}
-\contentsline {section}{\numberline {2.1}Recommender Systems}{9}
-\contentsline {subsection}{\numberline {2.1.1}Content-based Recommender Systems}{9}
-\contentsline {subsection}{\numberline {2.1.2}Collaborative filtering Recommender System}{9}
-\contentsline {subsection}{\numberline {2.1.3}Hybrid Recommender Systems}{10}
-\contentsline {section}{\numberline {2.2}Online Social Networks}{10}
-\contentsline {subsection}{\numberline {2.2.1}APIs}{10}
-\contentsline {section}{\numberline {2.3}Data Fusion Techniques}{10}
-\contentsline {chapter}{\numberline {3}Main contribution}{11}
-\contentsline {section}{\numberline {3.1}Methods}{11}
-\contentsline {subsection}{\numberline {3.1.1}Content based modelling}{11}
-\contentsline {subsection}{\numberline {3.1.2}Collaborative filtering}{11}
-\contentsline {section}{\numberline {3.2}Algorithms}{11}
-\contentsline {subsection}{\numberline {3.2.1}Deep Belief Networks}{11}
-\contentsline {subsubsection}{Convolutional Deep Belief Network (CDBN)}{11}
-\contentsline {chapter}{\numberline {4}Experiments}{12}
-\contentsline {section}{\numberline {4.1}Evaluation for recommender systems}{12}
-\contentsline {subsection}{\numberline {4.1.1}Types of experiments}{12}
-\contentsline {section}{\numberline {4.2}Evaluation settings}{13}
-\contentsline {subsection}{\numberline {4.2.1}Dataset}{13}
-\contentsline {subsection}{\numberline {4.2.2}Evaluation measures}{13}
-\contentsline {subsection}{\numberline {4.2.3}Experimentation aims}{13}
-\contentsline {chapter}{\numberline {5}Results}{14}
-\contentsline {chapter}{\numberline {6}Conclusion}{15}
-\contentsline {chapter}{Bibliography}{16}
+\contentsline {chapter}{\numberline {1}Introduction}{7}{chapter.1}
+\contentsline {section}{\numberline {1.1}Outline of the thesis}{8}{section.1.1}
+\contentsline {chapter}{\numberline {2}Background}{9}{chapter.2}
+\contentsline {section}{\numberline {2.1}Recommender Systems}{9}{section.2.1}
+\contentsline {subsection}{\numberline {2.1.1}Collaborative filtering (CF)}{9}{subsection.2.1.1}
+\contentsline {subsection}{\numberline {2.1.2}Content-based methods}{10}{subsection.2.1.2}
+\contentsline {subsection}{\numberline {2.1.3}Hybrid methods}{10}{subsection.2.1.3}
+\contentsline {section}{\numberline {2.2}Online Social Networks}{10}{section.2.2}
+\contentsline {section}{\numberline {2.3}Deep Learning}{11}{section.2.3}
+\contentsline {subsection}{\numberline {2.3.1}Convolutional Neural Networks (CNN)}{11}{subsection.2.3.1}
+\contentsline {section}{\numberline {2.4}Estimation of Distribution Algorithms (EDAs)}{11}{section.2.4}
+\contentsline {chapter}{\numberline {3}Methodology}{12}{chapter.3}
+\contentsline {section}{\numberline {3.1}Data collection}{12}{section.3.1}
+\contentsline {subsection}{\numberline {3.1.1}Taste profile subset filtering}{12}{subsection.3.1.1}
+\contentsline {subsection}{\numberline {3.1.2}Audio samples collection}{12}{subsection.3.1.2}
+\contentsline {subsection}{\numberline {3.1.3}Log-mel spectrograms}{12}{subsection.3.1.3}
+\contentsline {section}{\numberline {3.2}Algorithms}{12}{section.3.2}
+\contentsline {subsection}{\numberline {3.2.1}CNN implementation}{12}{subsection.3.2.1}
+\contentsline {subsubsection}{Genre classification}{12}{section*.4}
+\contentsline {subsection}{\numberline {3.2.2}Continuous Bayesian EDA}{12}{subsection.3.2.2}
+\contentsline {subsection}{\numberline {3.2.3}EDA-based hybrid recommender}{12}{subsection.3.2.3}
+\contentsline {chapter}{\numberline {4}Experiments}{13}{chapter.4}
+\contentsline {section}{\numberline {4.1}Evaluation for recommender systems}{13}{section.4.1}
+\contentsline {subsection}{\numberline {4.1.1}Types of experiments}{13}{subsection.4.1.1}
+\contentsline {section}{\numberline {4.2}Evaluation settings}{14}{section.4.2}
+\contentsline {subsection}{\numberline {4.2.1}Dataset}{14}{subsection.4.2.1}
+\contentsline {subsection}{\numberline {4.2.2}Evaluation measures}{15}{subsection.4.2.2}
+\contentsline {subsection}{\numberline {4.2.3}Experimentation aims}{15}{subsection.4.2.3}
+\contentsline {chapter}{\numberline {5}Results}{16}{chapter.5}
+\contentsline {chapter}{\numberline {6}Conclusion}{17}{chapter.6}
+\contentsline {section}{\numberline {6.1}Future work}{17}{section.6.1}
diff -r f1504bb2c552 -r 1dbd24575d44 Report/references.bib
--- /dev/null Thu Jan 01 00:00:00 1970 +0000
+++ b/Report/references.bib Tue Aug 04 12:13:47 2015 +0100
@@ -0,0 +1,165 @@
+@incollection{Lops2011,
+ year={2011},
+ isbn={978-0-387-85819-7},
+ booktitle={Recommender Systems Handbook},
+ editor={Ricci, Francesco and Rokach, Lior and Shapira, Bracha and Kantor, Paul B.},
+ doi={10.1007/978-0-387-85820-3_3},
+ title={Content-based Recommender Systems: State of the Art and Trends},
+ url={http://dx.doi.org/10.1007/978-0-387-85820-3\_3},
+ publisher={Springer US},
+ author={Lops, Pasquale and de Gemmis, Marco and Semeraro, Giovanni},
+ pages={73-105},
+ language={English}
+}
+
+@ARTICLE{Burke2002331,
+ author={Burke, R.},
+ title={Hybrid recommender systems: Survey and experiments},
+ journal={User Modelling and User-Adapted Interaction},
+ year={2002},
+ volume={12},
+ number={4},
+ pages={331-370},
+ doi={10.1023/A:1021240730564},
+ note={cited By 0},
+ url={http://www.scopus.com/inward/record.url?eid=2-s2.0-0036959356\&partnerID=40\&md5=28885a102109be826507abc2435117a7},
+ document_type={Article},
+ source={Scopus},
+}
+
+@ARTICLE{Yoshii2008435,
+ author={Yoshii, K. and Goto, M. and Komatani, K. and Ogata, T. and Okuno, H.G.},
+ title={An efficient hybrid music recommender system using an incrementally trainable probabilistic generative model},
+ journal={IEEE Transactions on Audio, Speech and Language Processing},
+ year={2008},
+ volume={16},
+ number={2},
+ pages={435-447},
+ doi={10.1109/TASL.2007.911503},
+ art_number={4432655},
+ note={cited By 0},
+ url={http://www.scopus.com/inward/record.url?eid=2-s2.0-39649112098\&partnerID=40\&md5=6827f82844ae1da58a6fa95caf5092d9},
+ document_type={Article},
+ source={Scopus},
+}
+
+
+@article{JCC4:JCC4393,
+author = {boyd, danah m. and Ellison, Nicole B.},
+title = {Social Network Sites: Definition, History, and Scholarship},
+journal = {Journal of Computer-Mediated Communication},
+volume = {13},
+number = {1},
+publisher = {Blackwell Publishing Inc},
+issn = {1083-6101},
+url = {http://dx.doi.org/10.1111/j.1083-6101.2007.00393.x},
+doi = {10.1111/j.1083-6101.2007.00393.x},
+pages = {210--230},
+year = {2007},
+}
+
+@ARTICLE{Castanedo2013,
+ author={Castanedo, F.},
+ title={A review of data fusion techniques},
+ journal={The Scientific World Journal},
+ year={2013},
+ volume={2013},
+ doi={10.1155/2013/704504},
+ art_number={704504},
+ note={cited By 0},
+ url={http://www.scopus.com/inward/record.url?eid=2-s2.0-84888882639\&partnerID=40\&md5=827fabc750db24f662fdae1c798f2507},
+ document_type={Review},
+ source={Scopus},
+}
+
+
+@CONFERENCE{Lee20091096,
+ author={Lee, H. and Yan, L. and Pham, P. and Ng, A.Y.},
+ title={Unsupervised feature learning for audio classification using convolutional deep belief networks},
+ journal={Advances in Neural Information Processing Systems 22 - Proceedings of the 2009 Conference},
+ year={2009},
+ pages={1096-1104},
+ note={cited By 0},
+ url={http://www.scopus.com/inward/record.url?eid=2-s2.0-84863380535\&partnerID=40\&md5=e872a6227c816850167f91bb2d41d8b7},
+ document_type={Conference Paper},
+ source={Scopus},
+}
+
+
+@TechReport {export:115396,
+abstract = {Recommender systems are now popular both commercially and in the research
+ community, where many approaches have been suggested for providing
+ recommendations. In many cases a system designer that wishes to employ a
+ recommendation system must choose between a set of candidate approaches. A first
+ step towards selecting an appropriate algorithm is to decide which properties of
+ the application to focus upon when making this choice. Indeed, recommendation
+ systems have a variety of properties that may affect user experience, such as
+ accuracy, robustness, scalability, and so forth. In this paper we discuss how to
+ compare recommenders based on a set of properties that are relevant for e
+ application. We focus on comparative studies, where a few algorithms are compared
+ using some evaluation metric, rather than absolute benchmarking of algorithms. We
+ describe experimental settings appropriate for making choices between algorithms.
+ We review three types of experiments, starting with an offline setting, where
+ recommendation approaches are compared without user interaction, then reviewing
+ user studies, where a small group of subjects experiment with the system and
+ report on the experience, and finally describe large scale online experiments,
+ where real user populations interact with the system. In each of these cases we
+ describe types of questions that can be answered, and suggest protocols for
+ experimentation. We also discuss how to draw trustworthy conclusions from e
+ conducted experiments. We then review a large set of properties, and explain how
+ to evaluate systems given relevant properties. We also survey a large set of
+ evaluation metrics in the context of the property that they evaluate.
},
+author = {Guy Shani and Asela Gunawardana},
+month = {November},
+number = {MSR-TR-2009-159},
+publisher = {Microsoft Research},
+title = {Evaluating Recommender Systems},
+url = {http://research.microsoft.com/apps/pubs/default.aspx?id=115396},
+year = {2009},
+}
+
+@phdthesis {1242,
+ title = {Music Recommendation and Discovery in the Long Tail},
+ year = {2008},
+ school = {Universitat Pompeu Fabra},
+ address = {Barcelona},
+ abstract = {
+Music consumption is biased towards a few popular artists. For instance, in 2007 only 1\% of
+all digital tracks accounted for 80\% of all sales. Similarly, 1,000 albums accounted for 50\%
+of all album sales, and 80\% of all albums sold were purchased less than 100 times. There is
+a need to assist people to filter, discover, personalise and recommend from the huge amount
+of music content available along the Long Tail.
+
+
+Current music recommendation algorithms try to
+accurately predict what people demand to listen to. However, quite
+often these algorithms tend to recommend popular -or well-known to the
+user- music, decreasing the effectiveness of the recommendations. These
+approaches focus on improving the accuracy of the recommendations. That
+is, try to make
+accurate predictions about what a user could listen to, or buy next,
+independently of how
+useful to the user could be the provided recommendations.
+
+
+In this Thesis we stress the importance of the user{\textquoteright}s
+perceived quality of the recommendations. We model the Long Tail curve
+of artist popularity to predict -potentially-
+interesting and unknown music, hidden in the tail of the popularity
+curve. Effective recommendation systems should promote novel and
+relevant material (non-obvious recommendations), taken primarily from
+the tail of a popularity distribution.
+
+
+The main contributions of this Thesis are: (i) a novel network-based approach for
+recommender systems, based on the analysis of the item (or user) similarity graph, and the
+popularity of the items, (ii) a user-centric evaluation that measures the user{\textquoteright}s relevance
+and novelty of the recommendations, and (iii) two prototype systems that implement the
+ideas derived from the theoretical work. Our findings have significant implications for
+recommender systems that assist users to explore the Long Tail, digging for content they
+might like.
+
+},
+ url = {http://mtg.upf.edu/static/media/PhD\_ocelma.pdf},
+ author = {Celma, \{`O}.},
+}