luis@11
|
1 In October 2010 we opened an online survey, advertised to a
|
luis@11
|
2 number of senior researchers in other groups around the UK. This
|
luis@11
|
3 survey asked for detailed information about the software usage and authorship practices of
|
luis@11
|
4 researchers, with the aim of obtaining a number of individual case
|
luis@11
|
5 points for further examination as well as some broad numerical
|
luis@11
|
6 results. The survey, started in November 2010, closed in April 2011,
|
luis@11
|
7 with 54 complete and 23 partially complete responses. There were
|
luis@11
|
8 responses from at least 16 different institutions.
|
luis@11
|
9
|
luis@11
|
10
|
luis@11
|
11 \subsection{Software Development}
|
luis@11
|
12
|
luis@11
|
13 Our survey shows that 66,6\% of researchers use more than one OS. Linux isn’t
|
luis@11
|
14 used as a single operating system by anybody, which seems to indicate
|
luis@11
|
15 that most Linux users use it for multi-platform developing or for
|
luis@11
|
16 specific software needs. SuperCollider, Android SDK/NDK, NET, PRAAT (Speech Researcher), CUDA-C
|
luis@11
|
17 (GPU Programming), Clojure, Presentation, R.
|
luis@11
|
18
|
luis@11
|
19 Most researchers (56\%) use version control. This kind of system is
|
luis@11
|
20 more widely used by PHD students and Postdocs/Research
|
luis@11
|
21 Assistants. When asked for what kind of technologies were used, SVN
|
luis@11
|
22 (14) and GIT (7) were the most popular systems. CVS (5) and Mercurial
|
luis@11
|
23 (4) were the other available options. Many users use more than one of
|
luis@11
|
24 these systems simultaneously.
|
luis@11
|
25
|
luis@11
|
26 When asked for the usage of code hosting services, 52\% of the researchers
|
luis@11
|
27 said their code stayed in their computers. The most used third-party
|
luis@11
|
28 source code hosting services is SourceForge (6 users). 10 users are
|
luis@11
|
29 using university source code version control tools. \textit{remove
|
luis@11
|
30 numbers, only leave percentages}
|
luis@11
|
31
|
luis@11
|
32 27 users do not produce or maintain software. 16 do, while 11 did not
|
luis@11
|
33 answer. \textit{percentages\ldots}
|
luis@11
|
34
|
luis@11
|
35 Most users (57\%) do not plan to make any software available.
|
luis@11
|
36
|
luis@11
|
37 When asked “Do you develop any software that you do not intend to
|
luis@11
|
38 publish?”, 52\% of the users answered no. Possible commercial use is
|
luis@11
|
39 the main justification given for not publishing the software at this
|
luis@11
|
40 point.
|
luis@11
|
41
|
luis@11
|
42 \subsection{Reproducible Research}
|
luis@11
|
43
|
luis@11
|
44 Most researchers (56\%) acknowledge they don't take the necessary
|
luis@11
|
45 steps to ensure sustainable and reproducible research. Many do not
|
luis@11
|
46 understand the concept of reproducible research. By analyzing this
|
luis@11
|
47 accordingly to the current position, we can see that PhD students are
|
luis@11
|
48 the ones that are less aware of the importance of reproducible
|
luis@11
|
49 research (even the ones that are almost finishing their PhD).
|
luis@11
|
50
|
luis@11
|
51 Many of the researchers that ensure they do the steps necessary to
|
luis@11
|
52 reproducibility say they only give the code and/or data to interested
|
luis@11
|
53 researchers. Some researchers also say that they publish their code in
|
luis@11
|
54 their own pages. At the same time, there are indications that this
|
luis@11
|
55 procedure can lead to unsustainability itself. Many researchers
|
luis@11
|
56 complain about the amount of time and/or complexity of making research
|
luis@11
|
57 reproducible. Also many of them make only parts of their work
|
luis@11
|
58 available. Some researchers also complain about copyright issues in
|
luis@11
|
59 releasing data.
|
luis@11
|
60
|
luis@11
|
61 Many researchers do not understand the full concept of
|
luis@11
|
62 reproducibility. Some assume that explaining the algorithm and the
|
luis@11
|
63 tools used is enough for other researchers to be able to reproduce
|
luis@11
|
64 their results. Finally, some typical (but not widely admitted) answers
|
luis@11
|
65 justify the decision not to embrace reproducibility due to messy code
|
luis@11
|
66 or code/data protection:
|
luis@11
|
67
|
luis@10
|
68
|
luis@10
|
69 %%% Local Variables:
|
luis@10
|
70 %%% mode: latex
|
luis@10
|
71 %%% TeX-master: "cannam"
|
luis@10
|
72 %%% End:
|