Keep multiple copies: two copies of your data, in two different locations, one of them on the cloud (Reed’s Google Drive) unless not allowed due to data size or privacy. Email data@reed.edu if you think you might have special circumstances
Backup your data regularly. Backup recommendations from Reed.
Regularly audit your backups: are they actually happening? Are they happening the way you intended??? Check for expected number of files, matching file size between copies, can the files be opened.
Keep the 10 Steps to Safer Computing in mind
Use this | Instead of this |
Project1/SiteB/SiteB_2021-09-15_rawdata.txt | Project1/SiteB/2021/09/15/rawdata.txt |
sequence_readings_2021-09-15.csv | sequence_readings_3rd_sept.csv |
SiteB_2021-09-15_processed_v02.r | SiteB_2021-09-15_processed_redo.r |
Document the decisions you make about your data during the research process. It is easy to forget and you may need to refer to it later during the thesis writing process.
Describe the content of your data files in a data dictionary. List variables with definitions, units of measure, scope notes, coded values. Document how missing values are represented. List the file formats you are using.
If possible, migrate your data out of software specific formats to an open format (such as a csv). This is especially true for files that require an expensive and specailized software program to open - will you still be able to access your data after you graduate?
Consider uploading the data to the Reed College Dataverse
Data @ Reed offers R Resources help. Still need help? Send an email to them team, data@reed.edu
This work by the Reed College Library is licensed under a Creative Commons CC-BY Attribution 4.0 International License.
Reed College Library | Email: library@reed.edu | Phone: 503-777-7702 | 3203 Southeast Woodstock Boulevard, Portland, Oregon 97202-8199