SoDaMaT Project » History » Version 56
Steve Welburn, 2012-11-20 12:42 PM
1 | 1 | Steve Welburn | h1. Sound Data Management Training (SoDaMaT) |
---|---|---|---|
2 | 1 | Steve Welburn | |
3 | 3 | Steve Welburn | {{>toc}} |
4 | 3 | Steve Welburn | |
5 | 34 | Steve Welburn | ??(for general information re. Research Data Management please see the "parent project Wiki":https://code.soundsoftware.ac.uk/projects/smdmrd/wiki)?? |
6 | 34 | Steve Welburn | |
7 | 55 | Steve Welburn | [[1 SoDaMaT Overview]] |
8 | 2 | Steve Welburn | |
9 | 56 | Steve Welburn | [[2 SoDaMaT Background]] |
10 | 3 | Steve Welburn | |
11 | 3 | Steve Welburn | h2. The Digital Music and Audio Researcher Profile |
12 | 3 | Steve Welburn | |
13 | 11 | Steve Welburn | A wealth of material for training researchers in data management has been produced by previous JISC-funded projects such as "Incremental":http://www.jisc.ac.uk/whatwedo/programmes/mrd/rdmi/incremental.aspx and those in the "RDMTrain":http://www.jisc.ac.uk/whatwedo/programmes/mrd/rdmtrain.aspx programme. The "Research Data Management Skills Support Initiative":http://www.dcc.ac.uk/training/data-management-courses-and-training/skills-frameworks (DaMSSI), which collected and compared the results from discipline-specific data management training projects in the "RDMTrain":http://www.jisc.ac.uk/whatwedo/programmes/mrd/rdmtrain.aspx programme, in its "final report":http://www.dcc.ac.uk/webfm_send/532 came to the conclusion that "participants respond well to discipline-specific examples and the opportunity to discuss issues with tutors and others in similar disciplines" and that "a discipline-specific approach is more likely to engage students - in many cases principles are the same across disciplines but are more interesting to students if these principles can be seen in the students' own context". "DaMSSI":http://www.dcc.ac.uk/training/data-management-courses-and-training/skills-frameworks also produced three discipline-specific researcher profiles - in the social sciences, in clinical psychology, and in archeology - and two generic data profiles - the conservator and the data manager. We believe that researchers at the Centre for Digital Music, and researchers in similar laboratories or institutions, do not fit in the above-mentioned profiles. |
14 | 3 | Steve Welburn | |
15 | 27 | Steve Welburn | The "Centre for Digital Music":http://www.elec.qmul.ac.uk/digitalmusic (C4DM) at QMUL is one of the leading research centres in the field of audio and music technology and signal processing. C4DM makes use of a variety of data as research inputs - most obviously audio datasets - and produces a variety of types of data as research outputs. These outputs include: |
16 | 27 | Steve Welburn | # manually annotated feature data ("reference annotations") such as expert chord and key transcriptions of existing music recordings which are used as comparative data for evaluating research work; |
17 | 27 | Steve Welburn | # automatically produced annotations such as those accompanying the publication of methods for audio feature analysis. |
18 | 3 | Steve Welburn | |
19 | 3 | Steve Welburn | The primary targets for the training material to be produced by the proposed project are postgraduate research students, and research and academic staff in C4DM, who perform research over a range of areas including music informatics, machine listening, audio engineering and interaction. C4DM is one of the leading research centres in the field of audio and music technology and signal processing. C4DM makes use of a variety of data as research inputs - most obviously audio datasets - and produces a variety of types of data as research outputs. A common use-case in C4DM research is to run a newly-developed analysis algorithm on a set of audio examples and evaluate the algorithm by comparing its output with that of a human annotator. Results are then compared with published results using the same input data to determine whether the newly proposed approach makes any improvement on the state of the art. |
20 | 3 | Steve Welburn | |
21 | 3 | Steve Welburn | The type of data used in digital music and audio research poses some challenges that need to be addressed in discipline-specific training material. These challenges include: |
22 | 31 | Steve Welburn | # Copyright: the copyright status of digital music data is often difficult to establish. For example, the owner of internally generated data might be unclear, or data purchased or downloaded from outside might have special license requirements that must be adhered to. This prevents researchers from publishing data in order to avoid unnecessary risk. Addressing this aspect in detail and emphasising the use of less restrictive licenses (e.g. "Creative Commons":http://creativecommons.org/ , "Open Data Commons":http://opendatacommons.org/ ), could lead to a larger amount of data being published in public repositories. |
23 | 9 | Steve Welburn | # Metadata: the line between data and metadata is often unclear. For example, descriptive metadata (e.g. a song's title, author, year of publication, or key) is in another context used as data. The training material will focus on defining what data and metadata are, on the importance of metadata standards, and on their use, together with standard protocols such as "OAI-PMH":http://www.openarchives.org/OAI/openarchivesprotocol.html and "SWORD":http://swordapp.org/, to exchange data among repositories. |
24 | 3 | Steve Welburn | # Ethical approval and participant agreement: experimental work based on human responses (e.g. perceptual listening tests) require ethical approval. The lack of information and experience on this topic leads people to write ethics forms that prohibit the release of data, preventing other researchers from reproducing or extending their results, when data could be safely released with the participants' consent if anonymised. %Data is often not published because, for lack of information, the creators tend to be exceedingly "safe" in this respect. The material will include information on how ethical approval works, how to obtain it, and information about publication of sensitive data. |
25 | 3 | Steve Welburn | |
26 | 21 | Steve Welburn | In addition to the recommendations from "DaMSSI":http://www.dcc.ac.uk/training/data-management-courses-and-training/skills-frameworks , the need for specific training material for digital music and audio researchers is justified by at least two additional factors. First, most of the researchers are either computer scientists or electrical engineers and have advanced IT skills. Second, the data is very heterogeneous, rapidly changing, and relatively small in size. As a result, it is usually managed by the creator of the data itself. Thus, the clear separation pointed out by the profiles produced by "DaMSSI":http://www.dcc.ac.uk/training/data-management-courses-and-training/skills-frameworks , as well as in "Pryor and Donnelly":http://www.ijdc.net/index.php/ijdc/article/view/126 (2009, p. 165), between the data creator and the data manager/librarian/scientist becomes blurred: all the different aspects can be, and often are, taken care of by the same person. |
27 | 3 | Steve Welburn | |
28 | 3 | Steve Welburn | h2. Evaluation |
29 | 15 | Steve Welburn | |
30 | 30 | Steve Welburn | Strong attention will be payed to evaluate the quality and impact on research practice of the training material. By taking advantage of the established collaborations, the material will be tested in different situations, including postgraduate courses, internal and external seminars and workshops, and tutorials at international conferences. The "International Society for Music Information Retrieval":http://www.ismir.net/ (ISMIR) serves the purposes of fostering the exchange of ideas between and among members whose activities, though diverse, stem from a common interest in music information retrieval. A tutorial proposal been submitted in collaboration with the "Sound Software":http://www.soundsoftware.ac.uk project to the "2012 ISMIR conference":http://ismir2012.ismir.net/ (8-12 October in Porto, Portugal). A tutorial proposal will also be submitted to "DAFx-12":http://dafx12.york.ac.uk/ (Digital Audio Effects conference, 17-21 September in York). |
31 | 1 | Steve Welburn | |
32 | 28 | Steve Welburn | The "QMUL Learning Institute":http://www.learninginstitute.qmul.ac.uk/ will provide support and know-how in evaluation methodologies and analysis. |
33 | 28 | Steve Welburn | |
34 | 28 | Steve Welburn | Feedback will be collected using: |
35 | 27 | Steve Welburn | # anonymous questionnaires after the tutorials/workshops, tailored to the specific audience; |
36 | 27 | Steve Welburn | # online questionnaires; |
37 | 27 | Steve Welburn | # standard course evaluation for postgraduate modules; |
38 | 1 | Steve Welburn | # focus groups interviewed a few months after the training to establish the longer-term impact of the training. |
39 | 28 | Steve Welburn | |
40 | 27 | Steve Welburn | The feedback will be used to iteratively improve the material. Revised versions of all training materials will be available by the end of the project. |
41 | 3 | Steve Welburn | |
42 | 3 | Steve Welburn | h2. Sustainability |
43 | 3 | Steve Welburn | |
44 | 3 | Steve Welburn | We aim to achieve sustainability in the longer term both in the digital music and audio research community, and within QMUL. Our goals are: |
45 | 38 | Steve Welburn | # *to make discipline-specific training sustainable in the digital music and audio research community.* Awareness will be raised by presenting the material in collaboration with the Sound Software project at similar UK research institutions, and at discipline-specific conferences (ISMIR and DAFx). Training material will be made available for reuse through the "Jorum":http://www.jorum.ac.uk/ repository. |
46 | 17 | Steve Welburn | # *to set an example within QMUL.* The project will be used as an example by the "QMUL Learning Institute":http://www.learninginstitute.qmul.ac.uk/ , the School of Electronic Engineering and Computer Science, and the IT Services to expand the data management training to other disciplines by adapting the material and methodologies, starting from related research areas such as Signal Processing, and more generally Electronic Engineering and Computer Science. Data management training will be integrated in postgraduate curricula: every PhD student is expected to take part in approximately 210 hours of development activities (including research methods courses) over the course of their studies and the points gained are mapped against the four domains of the "Vitae":http://www.vitae.ac.uk/ /RCUK "Researcher Development Framework":http://www.vitae.ac.uk/researchers/428241/Researcher-Development-Framework.html . Material for Continuous Professional Development courses for research and academic staff will also be adapted to other disciplines, and all face-to-face training will be complemented by online training material. |
47 | 3 | Steve Welburn | |
48 | 4 | Steve Welburn | h2. [[Workplan]] |
49 | 3 | Steve Welburn | |
50 | 1 | Steve Welburn | The work of the project is divided into four work packages (WP): |
51 | 42 | Steve Welburn | * WP1 Training Material Design |
52 | 1 | Steve Welburn | ** [[WP1_1 Research Of Available Resources|WP1.1 Research Of Available Resources]] |
53 | 43 | Steve Welburn | ** [[WP1_2 Online Training Material|WP1.2 Online Training Material]] |
54 | 42 | Steve Welburn | ** WP1.3 Research Staff Material |
55 | 42 | Steve Welburn | ** WP1.4 Post-Graduate Course Material |
56 | 42 | Steve Welburn | * WP2 Test and evaluation |
57 | 49 | Steve Welburn | ** [[WP2_1 Evaluation Strategy Design|WP2.1 Evaluation Strategy Design]] |
58 | 49 | Steve Welburn | ** WP2.2 Online Materials |
59 | 49 | Steve Welburn | ** WP2.3 Research Staff Materials |
60 | 49 | Steve Welburn | ** WP2.4 Postgraduate Materials |
61 | 54 | Steve Welburn | * [[WP3 Embedding]] |
62 | 53 | Steve Welburn | * [[WP4 Communication and Management]] |
63 | 42 | Steve Welburn | |
64 | 42 | Steve Welburn | An overview of the intended content of the work packages is [[Workplan|here]] |
65 | 3 | Steve Welburn | |
66 | 50 | Steve Welburn | h2. [[Training the Trainers]] |
67 | 50 | Steve Welburn | |
68 | 51 | Steve Welburn | h2. [[Additional Notes]] |
69 | 51 | Steve Welburn | |
70 | 3 | Steve Welburn | h2. References |
71 | 3 | Steve Welburn | |
72 | 4 | Steve Welburn | ??Pryor, G. and Donnelly, M. (2009). "Skilling up to do data: whose role, whose responsibility, whose career?":http://www.ijdc.net/index.php/ijdc/article/view/126 The International Journal of Digital Curation. Vol. 4(2), pp. 158--170.?? |
73 | 18 | Steve Welburn | |
74 | 18 | Steve Welburn | "Research Data Management Skills Support Initiative":http://www.dcc.ac.uk/training/data-management-courses-and-training/skills-frameworks (DaMSSI) "final report":http://www.dcc.ac.uk/webfm_send/532 |
75 | 52 | Steve Welburn | |
76 | 52 | Steve Welburn | [[Printable Version]] |