Make It Easier to (Re)use Your Data
Ethan P. White, Elita Baldridge, Zachary T. Brym, Kenneth J. Locey, Daniel J. McGlinn, and Sarah R. Supp: "Nine simple ways to make it easier to (re)use your data". Ideas in Ecology and Evolution, 6(2):1-10 DOI:10.4033/iee.2013.6b.6.f
Sharing data is increasingly considered to be an important part of the scientific process. Making your data publicly available allows original results to be reproduced and new analyses to be conducted. While sharing your data is the first step in allowing reuse, it is also important that the data be easy understand and use. We describe nine simple ways to make it easy to reuse the data that you share and also make it easier to work with it yourself. Our recommendations focus on making your data understandable, easy to analyze, and readily available to the wider community of scientists.
Their nine specific recommendations (elaborated at readable length in the paper) are:
- Share your data.
- Provide metadata.
- Provide an unprocessed form of the data.
- Use standard data formats.
- Use good null values.
- Make it easy to combine your data with other datasets
- Perform basic quality control.
- Use an established repository.
- Use an established and liberal license.
It's a great outline for a half-day introduction to data management as part of an "extended play" Software Carpentry course, particularly when combined with William Stafford Noble's "A Quick Guide to Organizing Computational Biology Projects". We hope to turn the pair into lessons by September.