Evidence Promoting Good Data Management » History » Version 71

Steve Welburn, 2012-11-16 10:51 AM

1 1 Steve Welburn
h1. Evidence Promoting Good Data Management
2 1 Steve Welburn
3 18 Steve Welburn
{{>toc}}
4 18 Steve Welburn
5 20 Steve Welburn
If you have any additional examples that you would like to share, please email them to: rdm.c4dm at gmail.com
6 11 Steve Welburn
7 68 Steve Welburn
{{include(Disasters)}}
8 54 Steve Welburn
9 1 Steve Welburn
h2. Anecdotal Tales Of Lost Data
10 1 Steve Welburn
11 4 Steve Welburn
h3. Recovery of Overwritten Hard Disk Data
12 5 Steve Welburn
13 5 Steve Welburn
5 October 2005 Linux Forums - http://tinyurl.com/8t7uaop
14 5 Steve Welburn
15 3 Steve Welburn
<pre>
16 21 Steve Welburn
Hi, a friend of mine just overwrote two months of her
17 21 Steve Welburn
PhD thesis with an older version. I know recovery of
18 21 Steve Welburn
overwritten data is possible, but wonder if I'd need
19 21 Steve Welburn
special hardware to do it. Does anyone know something
20 21 Steve Welburn
about this ?
21 21 Steve Welburn
22 2 Steve Welburn
Thank You.
23 3 Steve Welburn
</pre>
24 1 Steve Welburn
25 1 Steve Welburn
h3. Stolen laptop had PhD research
26 5 Steve Welburn
27 5 Steve Welburn
19 March 2008 Surrey Leader - http://tinyurl.com/9hmtlv4
28 5 Steve Welburn
29 1 Steve Welburn
<pre>
30 45 Steve Welburn
Thirty-five minutes spent in Langley’s Willowbrook
31 23 Steve Welburn
Shopping Centre cost a Surrey woman much more than
32 23 Steve Welburn
she had anticipated.
33 23 Steve Welburn
34 23 Steve Welburn
Langley RCMP say that while she was shopping from
35 23 Steve Welburn
1-1:35 p.m. last Monday, someone broke into her
36 23 Steve Welburn
vehicle and stole a number of items, including
37 23 Steve Welburn
a Mac iBook laptop containing the research she had 
38 23 Steve Welburn
compiled as she worked towards her PhD.
39 23 Steve Welburn
40 24 Steve Welburn
“All that information was on that computer and she
41 24 Steve Welburn
has no back-up file,” said Langley RCMP spokesman
42 24 Steve Welburn
Cpl. Brenda Marshall.
43 1 Steve Welburn
</pre>
44 45 Steve Welburn
45 45 Steve Welburn
46 45 Steve Welburn
Google images of "Langley Willowbrook":https://www.google.co.uk/search?num=50&hl=en&q=Langley+Willowbrook+Shopping+Centre&&tbm=isch
47 6 Steve Welburn
48 6 Steve Welburn
h3. Happiness is the return of a stolen computer, with data intact
49 6 Steve Welburn
50 6 Steve Welburn
27 May 2010 The Press, NZ - http://tinyurl.com/38sznnh
51 6 Steve Welburn
52 6 Steve Welburn
<pre>
53 6 Steve Welburn
Never has a man been so happy to see a computer full of data
54 6 Steve Welburn
spreadsheets.
55 6 Steve Welburn
56 6 Steve Welburn
Claudio De Sassi's world fell apart when a car containing almost three
57 6 Steve Welburn
years work towards his PhD was stolen two weeks ago.
58 6 Steve Welburn
De Sassi, a Canterbury University academic, could not hide his joy
59 6 Steve Welburn
yesterday as police reunited him with his stolen laptop and backpack.
60 6 Steve Welburn
</pre>
61 6 Steve Welburn
62 19 Steve Welburn
h3. Thugs steal Christmas, doctoral dreams
63 8 Steve Welburn
64 8 Steve Welburn
22 December 2010 KRQE - http://tinyurl.com/9a5j56f
65 8 Steve Welburn
66 8 Steve Welburn
<pre>
67 8 Steve Welburn
A tiny television sits where a big screen used to, and a Christmas tree
68 8 Steve Welburn
stands with little underneath it...
69 8 Steve Welburn
70 8 Steve Welburn
Even worse than the gifts, the crooks stole a MacBook Pro laptop and a
71 8 Steve Welburn
LaCie hard drive.
72 8 Steve Welburn
73 8 Steve Welburn
The hard drive had … her dissertation and nearly seven years of
74 8 Steve Welburn
research for her doctoral degree she was set to fnish in a few weeks.
75 8 Steve Welburn
Osuna had everything backed up on a separate hard drive in a safe, but
76 8 Steve Welburn
burglars made off with that too.
77 8 Steve Welburn
78 8 Steve Welburn
"All I could think about is that all that time is gone, all that effort,
79 8 Steve Welburn
everything is gone," Osuna said.
80 9 Steve Welburn
</pre>
81 8 Steve Welburn
82 8 Steve Welburn
83 8 Steve Welburn
h3. Laptop Stolen From OSU Doctoral Student
84 8 Steve Welburn
85 8 Steve Welburn
NBC4i January 06 2011 - http://tinyurl.com/bmybv9x
86 8 Steve Welburn
87 8 Steve Welburn
<pre>
88 8 Steve Welburn
...her car was broken into and her chrome Mac book pro was stolen.
89 8 Steve Welburn
She has a back-up for all but the last six months of research, but the
90 8 Steve Welburn
most important part of the research had happened recently.
91 8 Steve Welburn
</pre>
92 8 Steve Welburn
93 39 Steve Welburn
h3. Lost Thesis Poster
94 39 Steve Welburn
95 39 Steve Welburn
http://twitpic.com/45t7vu
96 1 Steve Welburn
97 40 Steve Welburn
!http://twitpic.com/show/thumb/45t7vu.jpg!
98 39 Steve Welburn
99 42 Steve Welburn
h3. Recovery
100 42 Steve Welburn
101 42 Steve Welburn
PostgraduateForum.com > Current PhD Students, PhD Life. 29 September 2011 - http://tinyurl.com/ct5e2no
102 42 Steve Welburn
103 42 Steve Welburn
<pre>
104 42 Steve Welburn
I've 'lost' my thesis
105 1 Steve Welburn
106 43 Steve Welburn
Yes, I 'lost' my thesis today, at around 12:42pm (thesis RIP), microsoft word couldn't
107 43 Steve Welburn
cope with the size of the document and my file got corrupted. I'd removed a small chunk
108 43 Steve Welburn
of it and did some formatting to decrease its size yesterday but that obviously didn't
109 43 Steve Welburn
stop it happening. After a few hours trying to recover it, I gave in and called for
110 43 Steve Welburn
help. I then found out that, even if I'd managed to recover it, it probably wouldn't
111 44 Steve Welburn
be the whole document, there could be parts missing, formatting gone awol, etc No sweat
112 44 Steve Welburn
though, I regularly back up my work so it's just today's work that's been lost, well
113 44 Steve Welburn
morning and lunch really as I spent the afternoon attempting to savage it,-) bit
114 44 Steve Welburn
stressful but hey ho, not the end of the world. So for those of you who don't back your
115 44 Steve Welburn
work up, start doing it now! And regularly! I can't possibly imagine what would have
116 44 Steve Welburn
happened to me if I'd really lost everything weeks before submission...
117 42 Steve Welburn
</pre>
118 42 Steve Welburn
119 6 Steve Welburn
h2. The Lost Laptop Problem
120 6 Steve Welburn
121 6 Steve Welburn
* 2010 Ponemon Institute report for Intel re. US laptops
122 6 Steve Welburn
** On average, 2.3% of laptops assigned to employees are lost each year
123 25 Steve Welburn
** In education & research that rises to 3.7%, with 10.8% of laptops being lost before the end of their useful life
124 25 Steve Welburn
*** ~3 years i.e. within 1 PhD of allocation!
125 6 Steve Welburn
** 75% lost outside the workplace
126 6 Steve Welburn
* Very similar results from 2011 European report!
127 6 Steve Welburn
128 46 Steve Welburn
Intel 2010, The Billion Dollar Lost Laptop Problem - http://tinyurl.com/8c9m4bn
129 46 Steve Welburn
130 46 Steve Welburn
Intel 2011, The Billion Euro Laptop Problem - http://tinyurl.com/9wpbxn9
131 7 Steve Welburn
132 71 Steve Welburn
h2. Reliability
133 71 Steve Welburn
134 69 Steve Welburn
h3. Laptop Reliability
135 7 Steve Welburn
136 7 Steve Welburn
* 2011 PC World Laptop Reliability Survey from 63,000 readers:
137 7 Steve Welburn
** 22.6% had signifcant problems during the product's lifetime
138 7 Steve Welburn
** Of which...
139 7 Steve Welburn
*** 19% had OS problems ~1 in 25 of all laptops
140 7 Steve Welburn
*** 18% had HDD problems ~1 in 25 of all laptops
141 7 Steve Welburn
*** 10% PSU problems ~1 in 50 of all laptops
142 7 Steve Welburn
143 7 Steve Welburn
PC World 2011 - http://tinyurl.com/876qza5
144 8 Steve Welburn
145 70 Steve Welburn
h3. Hard Disk Failures
146 8 Steve Welburn
147 8 Steve Welburn
* Failure Trends In A Large Disk Drive Population
148 8 Steve Welburn
** Usenix conference on File and Storage Technologies 2007 (FAST '07)
149 8 Steve Welburn
** Eduardo Pinheiro & Wolf-Dietrich Weber, Google Inc.
150 8 Steve Welburn
* Data collected from over 100,000 disk drives at Google
151 8 Steve Welburn
* As part of repairs procedures:
152 8 Steve Welburn
** ~13% of disk drives replaced over 3 years
153 8 Steve Welburn
** ~20% of disk drives replaced over 4 years
154 8 Steve Welburn
155 8 Steve Welburn
Article: http://tinyurl.com/octz6b
156 8 Steve Welburn
157 38 Steve Welburn
[[Failure Trends In A Large Disk Drive Population|More info]]
158 37 Steve Welburn
159 8 Steve Welburn
h2. Data management in the cloud
160 8 Steve Welburn
161 8 Steve Welburn
See JISC/DCC document "Curation In The Cloud" - http://tinyurl.com/8nogtmv
162 8 Steve Welburn
163 8 Steve Welburn
Service agreements may give wide-ranging rights to the data service.
164 8 Steve Welburn
165 8 Steve Welburn
h3. Google Terms Of Service
166 8 Steve Welburn
167 8 Steve Welburn
1 March 2012 Google Terms of Service : http://tinyurl.com/89dc9fa
168 8 Steve Welburn
169 8 Steve Welburn
<pre>
170 8 Steve Welburn
When you upload or otherwise submit content to our Services, you give
171 8 Steve Welburn
Google (and those we work with) a worldwide license to use, host, store,
172 8 Steve Welburn
reproduce, modify, create derivative works (such as those resulting from
173 8 Steve Welburn
translations, adaptations or other changes we make so that your
174 8 Steve Welburn
content works better with our Services), communicate, publish, publicly
175 8 Steve Welburn
perform, publicly display and distribute such content. The rights you
176 8 Steve Welburn
grant in this license are for the limited purpose of operating, promoting,
177 8 Steve Welburn
and improving our Services, and to develop new ones. This license
178 8 Steve Welburn
continues even if you stop using our Services (for example, for a
179 8 Steve Welburn
business listing you have added to Google Maps).
180 8 Steve Welburn
</pre>
181 8 Steve Welburn
182 8 Steve Welburn
h3. Microsoft Services Agreement
183 8 Steve Welburn
184 10 Steve Welburn
19 October 2012 Microsoft services agreement : http://tinyurl.com/8e4kucy
185 8 Steve Welburn
186 8 Steve Welburn
<pre>
187 8 Steve Welburn
When you upload your content to the services, you agree that it may
188 8 Steve Welburn
be used, modifed, adapted, saved, reproduced, distributed, and
189 8 Steve Welburn
displayed to the extent necessary to protect you and to provide, protect
190 8 Steve Welburn
and improve Microsoft products and services. For example, we may
191 8 Steve Welburn
occasionally use automated means to isolate information from email,
192 8 Steve Welburn
chats, or photos in order to help detect and protect against spam and
193 8 Steve Welburn
malware, or to improve the services with new features that makes them
194 8 Steve Welburn
easier to use. When processing your content, Microsoft takes steps to
195 8 Steve Welburn
help preserve your privacy.
196 8 Steve Welburn
</pre>
197 8 Steve Welburn
198 8 Steve Welburn
h2. Archiving Data
199 8 Steve Welburn
200 8 Steve Welburn
h3. BBC Domesday Project
201 8 Steve Welburn
202 8 Steve Welburn
1986 Project to do a modern-day Domesday book (early crowd-sourcing)
203 8 Steve Welburn
* Used “BBC Master” computers with data on laserdisc
204 8 Steve Welburn
* Collected 147,819 pages of text and 23,225 photos
205 8 Steve Welburn
* Media expiring and obsolete technology put the data at risk!
206 8 Steve Welburn
207 8 Steve Welburn
Domesday Reloaded (2011)
208 8 Steve Welburn
* Required emulation of software
209 8 Steve Welburn
* Images restored from original masters
210 8 Steve Welburn
* http://www.bbc.co.uk/history/domesday
211 8 Steve Welburn
212 8 Steve Welburn
To allow long-term access to data
213 8 Steve Welburn
* Don't use obscure formats!
214 8 Steve Welburn
* Don't use obscure media!
215 8 Steve Welburn
* Don't rely on technology being available!
216 8 Steve Welburn
* Do keep original source material!
217 12 Steve Welburn
218 15 Steve Welburn
Google images for "BBC Domesday":https://www.google.co.uk/search?tbm=isch&q=bbc+domesday
219 12 Steve Welburn
220 27 Steve Welburn
h2. Sharing Data
221 27 Steve Welburn
222 47 Steve Welburn
Piwowar, Heather A., Roger S. Day, and Douglas B. Fridsma. "Sharing detailed research data is associated with increased citation rate.":http://www.plosone.org/article/info%3Adoi%2F10.1371%2Fjournal.pone.0000308
223 47 Steve Welburn
PLoS One 2.3 (2007): e308.
224 12 Steve Welburn
225 12 Steve Welburn
226 12 Steve Welburn
h2. Related Media
227 12 Steve Welburn
228 12 Steve Welburn
h3. Disk Drives Break
229 12 Steve Welburn
230 12 Steve Welburn
"DataCent collection of disk drive failure sounds":http://datacent.com/hard_drive_sounds.php
231 12 Steve Welburn
232 12 Steve Welburn
h3. Laptops Break / Get Broken
233 13 Steve Welburn
234 13 Steve Welburn
* "Shot laptop":http://lilysussman.wordpress.com/tag/laptop-destroyed/
235 22 Steve Welburn
* "Google images of broken laptops":https://www.google.co.uk/search?q=broken%20laptop&um=1&tbm=isch
236 1 Steve Welburn
237 1 Steve Welburn
h2. More To Read
238 1 Steve Welburn
239 48 Steve Welburn
Albers, S. "Editorial: Well Documented Articles Achieve More Impact":http://papers.ssrn.com/sol3/papers.cfm?abstract_id=1568022
240 48 Steve Welburn
BuR Business Research Journal, Vol. 2, No.2, May 2009
241 1 Steve Welburn
242 48 Steve Welburn
Anderson, Richard G., et al. "The role of data/code archives in the future of economic research.":http://www.tandfonline.com/doi/abs/10.1080/13501780801915574
243 48 Steve Welburn
Journal of Economic Methodology 15.1 (2008): 99-119.
244 32 Steve Welburn
245 48 Steve Welburn
Borgman, Christine L. "The conundrum of sharing research data."
246 48 Steve Welburn
Journal of the American Society for Information Science and Technology 63.6 (2012): 1059-1078.
247 1 Steve Welburn
248 48 Steve Welburn
Campbell, Eric G., et al. "Data withholding in academic genetics."
249 48 Steve Welburn
JAMA: the journal of the American Medical Association 287.4 (2002): 473-480.
250 31 Steve Welburn
251 48 Steve Welburn
Evanschitzky, Heiner, et al. "Replication research's disturbing trend.":http://www.sciencedirect.com/science/article/pii/S0148296306002347
252 48 Steve Welburn
Journal of Business Research 60.4 (2007): 411-415.
253 1 Steve Welburn
254 48 Steve Welburn
Fischer, Beth A., and Michael J. Zigmond. "The essential nature of sharing in science."
255 48 Steve Welburn
Science and engineering ethics 16.4 (2010): 783-799.
256 48 Steve Welburn
257 1 Steve Welburn
Freckleton, R.P., P. Hulme, P. Giller and G. Kerby. 2005. "The changing face of applied ecology.":http://onlinelibrary.wiley.com/doi/10.1111/j.1365-2664.2005.00969.x/full
258 31 Steve Welburn
J. Appl. Ecol. 42:1–3.
259 1 Steve Welburn
260 48 Steve Welburn
Gleditsch, N.P., C. Metelits and H. Strand. 2003. Posting your data: Will you be scooped or will you be famous?.
261 48 Steve Welburn
Int. Stud. Perspect. 4:89–97.
262 1 Steve Welburn
263 48 Steve Welburn
Lancaster, Larry, and Alan Rowe. "Measuring Real World Data Availability.":http://static.usenix.org/publications/library/proceedings/lisa2001/tech/full_papers/lancaster/lancaster_html/
264 48 Steve Welburn
Proceedings of the LISA 2001 15th Systems Administration Conference. 2001.
265 1 Steve Welburn
266 48 Steve Welburn
McCullough, Bruce D., Kerry Anne McGeary, and Teresa D. Harrison. "Lessons from the JMCB Archive.":http://muse.jhu.edu/journals/mcb/summary/v038/38.4mccullough.html
267 48 Steve Welburn
Journal of Money, Credit, and Banking 38.4 (2006): 1093-1107.
268 1 Steve Welburn
269 48 Steve Welburn
Piwowar, Heather A., and Wendy W. Chapman. "Public sharing of research datasets: a pilot study of associations."
270 48 Steve Welburn
Journal of informetrics 4.2 (2010): 148-156.
271 1 Steve Welburn
272 48 Steve Welburn
Piwowar, Heather A., et al. "Towards a data sharing culture: recommendations for leadership from academic health centers."
273 48 Steve Welburn
PLoS medicine 5.9 (2008): e183.
274 31 Steve Welburn
275 48 Steve Welburn
Schroeder, Bianca, and Garth A. Gibson. "Disk failures in the real world: What does an MTTF of 1,000,000 hours mean to you.":http://www.usenix.org/event/fast07/tech/schroeder/schroeder.pdf
276 48 Steve Welburn
Proceedings of the 5th USENIX Conference on File and Storage Technologies (FAST). 2007.
277 48 Steve Welburn
278 48 Steve Welburn
Vandewalle, Patrick, Jelena Kovacevic, and Martin Vetterli. "Reproducible research in signal processing."
279 48 Steve Welburn
Signal Processing Magazine, IEEE 26.3 (2009): 37-47.
280 48 Steve Welburn
281 48 Steve Welburn
Whitlock, Michael C. "Data archiving in ecology and evolution: best practices."
282 48 Steve Welburn
Trends in ecology & evolution 26.2 (2011): 61-65.
283 48 Steve Welburn
284 48 Steve Welburn
Whitlock, Michael C., et al. "Data archiving."
285 48 Steve Welburn
The American Naturalist 175.2 (2010): 145-146.
286 48 Steve Welburn
287 48 Steve Welburn
Wicherts, Jelte M., Marjan Bakker, and Dylan Molenaar. "Willingness to share research data is related to the strength of the evidence and the quality of reporting of statistical results."
288 48 Steve Welburn
PloS one 6.11 (2011): e26828.
289 64 Steve Welburn
290 64 Steve Welburn
291 64 Steve Welburn
292 64 Steve Welburn
293 64 Steve Welburn
NEED FOR AN INTERNATIONAL REPOSITORY FOR ORIGINAL RESEARCH DATA
294 64 Steve Welburn
Thatcher, 70 (1807): 167-168
295 64 Steve Welburn
Science 16 August 1929: Vol. 70 no. 1807 pp. 167-168
296 64 Steve Welburn
DOI: 10.1126/science.70.1807.167
297 64 Steve Welburn
298 64 Steve Welburn
Research Data in the Digital Age 
299 64 Steve Welburn
Daniel Kleppner and Phillip A. Sharp
300 64 Steve Welburn
Science 24 July 2009: Vol. 325 no. 5939 p. 368
301 64 Steve Welburn
DOI: 10.1126/science.1178927
302 64 Steve Welburn
303 64 Steve Welburn
Sharing Research Data Urged
304 64 Steve Welburn
COLIN NORMAN
305 64 Steve Welburn
Science 16 August 1985: Vol. 229 no. 4714 p. 632
306 64 Steve Welburn
DOI: 10.1126/science.229.4714.632