Evidence Promoting Good Data Management » History » Version 73

Steve Welburn, 2012-11-16 11:33 AM

1 1 Steve Welburn
h1. Evidence Promoting Good Data Management
2 1 Steve Welburn
3 18 Steve Welburn
{{>toc}}
4 18 Steve Welburn
5 20 Steve Welburn
If you have any additional examples that you would like to share, please email them to: rdm.c4dm at gmail.com
6 11 Steve Welburn
7 73 Steve Welburn
[[Disasters]]
8 54 Steve Welburn
9 73 Steve Welburn
[[Tales Of Lost Data]]
10 42 Steve Welburn
11 6 Steve Welburn
h2. The Lost Laptop Problem
12 6 Steve Welburn
13 6 Steve Welburn
* 2010 Ponemon Institute report for Intel re. US laptops
14 6 Steve Welburn
** On average, 2.3% of laptops assigned to employees are lost each year
15 25 Steve Welburn
** In education & research that rises to 3.7%, with 10.8% of laptops being lost before the end of their useful life
16 25 Steve Welburn
*** ~3 years i.e. within 1 PhD of allocation!
17 6 Steve Welburn
** 75% lost outside the workplace
18 6 Steve Welburn
* Very similar results from 2011 European report!
19 6 Steve Welburn
20 46 Steve Welburn
Intel 2010, The Billion Dollar Lost Laptop Problem - http://tinyurl.com/8c9m4bn
21 46 Steve Welburn
22 46 Steve Welburn
Intel 2011, The Billion Euro Laptop Problem - http://tinyurl.com/9wpbxn9
23 7 Steve Welburn
24 73 Steve Welburn
h2. [[Reliability]]
25 71 Steve Welburn
26 69 Steve Welburn
h3. Laptop Reliability
27 7 Steve Welburn
28 7 Steve Welburn
* 2011 PC World Laptop Reliability Survey from 63,000 readers:
29 7 Steve Welburn
** 22.6% had signifcant problems during the product's lifetime
30 7 Steve Welburn
** Of which...
31 7 Steve Welburn
*** 19% had OS problems ~1 in 25 of all laptops
32 7 Steve Welburn
*** 18% had HDD problems ~1 in 25 of all laptops
33 7 Steve Welburn
*** 10% PSU problems ~1 in 50 of all laptops
34 7 Steve Welburn
35 7 Steve Welburn
PC World 2011 - http://tinyurl.com/876qza5
36 8 Steve Welburn
37 70 Steve Welburn
h3. Hard Disk Failures
38 8 Steve Welburn
39 8 Steve Welburn
* Failure Trends In A Large Disk Drive Population
40 8 Steve Welburn
** Usenix conference on File and Storage Technologies 2007 (FAST '07)
41 8 Steve Welburn
** Eduardo Pinheiro & Wolf-Dietrich Weber, Google Inc.
42 8 Steve Welburn
* Data collected from over 100,000 disk drives at Google
43 8 Steve Welburn
* As part of repairs procedures:
44 8 Steve Welburn
** ~13% of disk drives replaced over 3 years
45 8 Steve Welburn
** ~20% of disk drives replaced over 4 years
46 8 Steve Welburn
47 8 Steve Welburn
Article: http://tinyurl.com/octz6b
48 8 Steve Welburn
49 38 Steve Welburn
[[Failure Trends In A Large Disk Drive Population|More info]]
50 37 Steve Welburn
51 8 Steve Welburn
h2. Data management in the cloud
52 8 Steve Welburn
53 8 Steve Welburn
See JISC/DCC document "Curation In The Cloud" - http://tinyurl.com/8nogtmv
54 8 Steve Welburn
55 8 Steve Welburn
Service agreements may give wide-ranging rights to the data service.
56 8 Steve Welburn
57 8 Steve Welburn
h3. Google Terms Of Service
58 8 Steve Welburn
59 8 Steve Welburn
1 March 2012 Google Terms of Service : http://tinyurl.com/89dc9fa
60 8 Steve Welburn
61 8 Steve Welburn
<pre>
62 8 Steve Welburn
When you upload or otherwise submit content to our Services, you give
63 8 Steve Welburn
Google (and those we work with) a worldwide license to use, host, store,
64 8 Steve Welburn
reproduce, modify, create derivative works (such as those resulting from
65 8 Steve Welburn
translations, adaptations or other changes we make so that your
66 8 Steve Welburn
content works better with our Services), communicate, publish, publicly
67 8 Steve Welburn
perform, publicly display and distribute such content. The rights you
68 8 Steve Welburn
grant in this license are for the limited purpose of operating, promoting,
69 8 Steve Welburn
and improving our Services, and to develop new ones. This license
70 8 Steve Welburn
continues even if you stop using our Services (for example, for a
71 8 Steve Welburn
business listing you have added to Google Maps).
72 8 Steve Welburn
</pre>
73 8 Steve Welburn
74 8 Steve Welburn
h3. Microsoft Services Agreement
75 8 Steve Welburn
76 10 Steve Welburn
19 October 2012 Microsoft services agreement : http://tinyurl.com/8e4kucy
77 8 Steve Welburn
78 8 Steve Welburn
<pre>
79 8 Steve Welburn
When you upload your content to the services, you agree that it may
80 8 Steve Welburn
be used, modifed, adapted, saved, reproduced, distributed, and
81 8 Steve Welburn
displayed to the extent necessary to protect you and to provide, protect
82 8 Steve Welburn
and improve Microsoft products and services. For example, we may
83 8 Steve Welburn
occasionally use automated means to isolate information from email,
84 8 Steve Welburn
chats, or photos in order to help detect and protect against spam and
85 8 Steve Welburn
malware, or to improve the services with new features that makes them
86 8 Steve Welburn
easier to use. When processing your content, Microsoft takes steps to
87 8 Steve Welburn
help preserve your privacy.
88 8 Steve Welburn
</pre>
89 8 Steve Welburn
90 8 Steve Welburn
h2. Archiving Data
91 8 Steve Welburn
92 8 Steve Welburn
h3. BBC Domesday Project
93 8 Steve Welburn
94 8 Steve Welburn
1986 Project to do a modern-day Domesday book (early crowd-sourcing)
95 8 Steve Welburn
* Used “BBC Master” computers with data on laserdisc
96 8 Steve Welburn
* Collected 147,819 pages of text and 23,225 photos
97 8 Steve Welburn
* Media expiring and obsolete technology put the data at risk!
98 8 Steve Welburn
99 8 Steve Welburn
Domesday Reloaded (2011)
100 8 Steve Welburn
* Required emulation of software
101 8 Steve Welburn
* Images restored from original masters
102 8 Steve Welburn
* http://www.bbc.co.uk/history/domesday
103 8 Steve Welburn
104 8 Steve Welburn
To allow long-term access to data
105 8 Steve Welburn
* Don't use obscure formats!
106 8 Steve Welburn
* Don't use obscure media!
107 8 Steve Welburn
* Don't rely on technology being available!
108 8 Steve Welburn
* Do keep original source material!
109 12 Steve Welburn
110 15 Steve Welburn
Google images for "BBC Domesday":https://www.google.co.uk/search?tbm=isch&q=bbc+domesday
111 12 Steve Welburn
112 27 Steve Welburn
h2. Sharing Data
113 27 Steve Welburn
114 47 Steve Welburn
Piwowar, Heather A., Roger S. Day, and Douglas B. Fridsma. "Sharing detailed research data is associated with increased citation rate.":http://www.plosone.org/article/info%3Adoi%2F10.1371%2Fjournal.pone.0000308
115 47 Steve Welburn
PLoS One 2.3 (2007): e308.
116 12 Steve Welburn
117 12 Steve Welburn
118 12 Steve Welburn
h2. Related Media
119 12 Steve Welburn
120 12 Steve Welburn
h3. Disk Drives Break
121 12 Steve Welburn
122 12 Steve Welburn
"DataCent collection of disk drive failure sounds":http://datacent.com/hard_drive_sounds.php
123 12 Steve Welburn
124 12 Steve Welburn
h3. Laptops Break / Get Broken
125 13 Steve Welburn
126 13 Steve Welburn
* "Shot laptop":http://lilysussman.wordpress.com/tag/laptop-destroyed/
127 22 Steve Welburn
* "Google images of broken laptops":https://www.google.co.uk/search?q=broken%20laptop&um=1&tbm=isch
128 1 Steve Welburn
129 1 Steve Welburn
h2. More To Read
130 1 Steve Welburn
131 48 Steve Welburn
Albers, S. "Editorial: Well Documented Articles Achieve More Impact":http://papers.ssrn.com/sol3/papers.cfm?abstract_id=1568022
132 48 Steve Welburn
BuR Business Research Journal, Vol. 2, No.2, May 2009
133 1 Steve Welburn
134 48 Steve Welburn
Anderson, Richard G., et al. "The role of data/code archives in the future of economic research.":http://www.tandfonline.com/doi/abs/10.1080/13501780801915574
135 48 Steve Welburn
Journal of Economic Methodology 15.1 (2008): 99-119.
136 32 Steve Welburn
137 48 Steve Welburn
Borgman, Christine L. "The conundrum of sharing research data."
138 48 Steve Welburn
Journal of the American Society for Information Science and Technology 63.6 (2012): 1059-1078.
139 1 Steve Welburn
140 48 Steve Welburn
Campbell, Eric G., et al. "Data withholding in academic genetics."
141 48 Steve Welburn
JAMA: the journal of the American Medical Association 287.4 (2002): 473-480.
142 31 Steve Welburn
143 48 Steve Welburn
Evanschitzky, Heiner, et al. "Replication research's disturbing trend.":http://www.sciencedirect.com/science/article/pii/S0148296306002347
144 48 Steve Welburn
Journal of Business Research 60.4 (2007): 411-415.
145 1 Steve Welburn
146 48 Steve Welburn
Fischer, Beth A., and Michael J. Zigmond. "The essential nature of sharing in science."
147 48 Steve Welburn
Science and engineering ethics 16.4 (2010): 783-799.
148 48 Steve Welburn
149 1 Steve Welburn
Freckleton, R.P., P. Hulme, P. Giller and G. Kerby. 2005. "The changing face of applied ecology.":http://onlinelibrary.wiley.com/doi/10.1111/j.1365-2664.2005.00969.x/full
150 31 Steve Welburn
J. Appl. Ecol. 42:1–3.
151 1 Steve Welburn
152 48 Steve Welburn
Gleditsch, N.P., C. Metelits and H. Strand. 2003. Posting your data: Will you be scooped or will you be famous?.
153 48 Steve Welburn
Int. Stud. Perspect. 4:89–97.
154 1 Steve Welburn
155 48 Steve Welburn
Lancaster, Larry, and Alan Rowe. "Measuring Real World Data Availability.":http://static.usenix.org/publications/library/proceedings/lisa2001/tech/full_papers/lancaster/lancaster_html/
156 48 Steve Welburn
Proceedings of the LISA 2001 15th Systems Administration Conference. 2001.
157 1 Steve Welburn
158 48 Steve Welburn
McCullough, Bruce D., Kerry Anne McGeary, and Teresa D. Harrison. "Lessons from the JMCB Archive.":http://muse.jhu.edu/journals/mcb/summary/v038/38.4mccullough.html
159 48 Steve Welburn
Journal of Money, Credit, and Banking 38.4 (2006): 1093-1107.
160 1 Steve Welburn
161 48 Steve Welburn
Piwowar, Heather A., and Wendy W. Chapman. "Public sharing of research datasets: a pilot study of associations."
162 48 Steve Welburn
Journal of informetrics 4.2 (2010): 148-156.
163 1 Steve Welburn
164 48 Steve Welburn
Piwowar, Heather A., et al. "Towards a data sharing culture: recommendations for leadership from academic health centers."
165 48 Steve Welburn
PLoS medicine 5.9 (2008): e183.
166 31 Steve Welburn
167 48 Steve Welburn
Schroeder, Bianca, and Garth A. Gibson. "Disk failures in the real world: What does an MTTF of 1,000,000 hours mean to you.":http://www.usenix.org/event/fast07/tech/schroeder/schroeder.pdf
168 48 Steve Welburn
Proceedings of the 5th USENIX Conference on File and Storage Technologies (FAST). 2007.
169 48 Steve Welburn
170 48 Steve Welburn
Vandewalle, Patrick, Jelena Kovacevic, and Martin Vetterli. "Reproducible research in signal processing."
171 48 Steve Welburn
Signal Processing Magazine, IEEE 26.3 (2009): 37-47.
172 48 Steve Welburn
173 48 Steve Welburn
Whitlock, Michael C. "Data archiving in ecology and evolution: best practices."
174 48 Steve Welburn
Trends in ecology & evolution 26.2 (2011): 61-65.
175 48 Steve Welburn
176 48 Steve Welburn
Whitlock, Michael C., et al. "Data archiving."
177 48 Steve Welburn
The American Naturalist 175.2 (2010): 145-146.
178 48 Steve Welburn
179 48 Steve Welburn
Wicherts, Jelte M., Marjan Bakker, and Dylan Molenaar. "Willingness to share research data is related to the strength of the evidence and the quality of reporting of statistical results."
180 48 Steve Welburn
PloS one 6.11 (2011): e26828.
181 64 Steve Welburn
182 64 Steve Welburn
183 64 Steve Welburn
184 64 Steve Welburn
185 64 Steve Welburn
NEED FOR AN INTERNATIONAL REPOSITORY FOR ORIGINAL RESEARCH DATA
186 64 Steve Welburn
Thatcher, 70 (1807): 167-168
187 64 Steve Welburn
Science 16 August 1929: Vol. 70 no. 1807 pp. 167-168
188 64 Steve Welburn
DOI: 10.1126/science.70.1807.167
189 64 Steve Welburn
190 64 Steve Welburn
Research Data in the Digital Age 
191 64 Steve Welburn
Daniel Kleppner and Phillip A. Sharp
192 64 Steve Welburn
Science 24 July 2009: Vol. 325 no. 5939 p. 368
193 64 Steve Welburn
DOI: 10.1126/science.1178927
194 64 Steve Welburn
195 64 Steve Welburn
Sharing Research Data Urged
196 64 Steve Welburn
COLIN NORMAN
197 64 Steve Welburn
Science 16 August 1985: Vol. 229 no. 4714 p. 632
198 64 Steve Welburn
DOI: 10.1126/science.229.4714.632