Chat with a librarian or reference assistant
What is curation:
The active and ongoing management of data through its lifecycle of interest and usefulness to scholarship, science, and education. Data curation enables data discovery and retrieval, maintains data quality, adds value, and provides for re-use over time through activities including authentication, archiving, management, preservation, and representation.
Johnston, Lisa R. 2017. Curating Research Data, Volume One: Practical Strategies for Your Digital Repository. 1 edition. Chicago, Illinois: Amer Library Assn.
What WashU curators do:
Advise on repository selection
Review data and documentation
Provide suggestions to improve the FAIRness
Help you prepare a readme file to accompany your data
The Office of Science and Technology Policy charged a subgroup to explore and write a report on the Desirable Characteristic of Data Repositories for Federally Funded Research. Below is an overview of the recommendations. For WashU, we can address these characteristics directly, but because there are many options to choose from in the other categories, specificity is more difficult. Here is a desirable characteristics checklist, which you can copy and to help you evaluate whether a data repository meets these characteristics. Check out https://www.re3data.org/ to explore data repositories.
Guidance |
Institutional (WashU) | General | Domain |
Free and Easy Access | yes | varies | varies |
Clear Use Guidance | yes | varies | varies |
Risk Management | Yes and.... | varies | varies |
Retention Policy | yes | varies | varies |
Long-term Organizational Sustainability | yes | varies | varies |
Authentication | yes | yes | yes |
Long-term Technical Sustainability | yes | varies | varies |
Security and Integrity | yes | varies | varies |
Unique Persistent Identifiers | yes | varies | varies |
Metadata | yes | yes | yes |
Curation/ Quality Assurance | yes | usually not | varies |
Broad and Measured Reuse | yes | varies | varies |
Common Format | yes | varies | varies |
Provenance | yes | varies | varies |
Organization Infrastructure | |||
Technology | |||
Digital Object Management |
The Data Curation Network (DCN) is a membership organization of institutional and non-profit data repositories whose vision is to advance open research by making data more ethical, reusable, and understandable.
How FAIR is your data?
What is FAIR
In 2016, the ‘FAIR Guiding Principles for scientific data management and stewardship’ were published in Scientific Data. The authors intended to provide guidelines to improve the Findability, Accessibility, Interoperability, and Reuse of digital assets. The principles emphasise machine-actionability (i.e., the capacity of computational systems to find, access, interoperate, and reuse data with none or minimal human intervention) because humans increasingly rely on computational support to deal with data as a result of the increase in volume, complexity, and creation speed of data. (Go-FAIR)
Findable
In an indexed repository, with a unique, persistent ID, and rich metadata
Accessible
Repository uses open, standard protocols so the metadata and data can be accessed
Interoperable
data are in formal, standard, open application languages
Reusable
well documented, explicit provenance, open licenses, follows community standards