DATS, the data tag suite to enable discoverability of datasets
Open Access
- 6 June 2017
- journal article
- research article
- Published by Springer Science and Business Media LLC in Scientific Data
- Vol. 4 (1), 170059
- https://doi.org/10.1038/sdata.2017.59
Abstract
Today’s science increasingly requires effective ways to find and access existing datasets that are distributed across a range of repositories. For researchers in the life sciences, discoverability of datasets may soon become as essential as identifying the latest publications via PubMed. Through an international collaborative effort funded by the National Institutes of Health (NIH)’s Big Data to Knowledge (BD2K) initiative, we have designed and implemented the DAta Tag Suite (DATS) model to support the DataMed data discovery index. DataMed’s goal is to be for data what PubMed has been for the scientific literature. Akin to the Journal Article Tag Suite (JATS) used in PubMed, the DATS model enables submission of metadata on datasets to DataMed. DATS has a core set of elements, which are generic and applicable to any type of dataset, and an extended set that can accommodate more specialized data types. DATS is a platform-independent model also available as an annotated serialization in schema.org, which in turn is widely used by major search engines like Google, Microsoft, Yahoo and Yandex.Keywords
This publication has 14 references indexed in Scilit:
- Finding useful data across multiple biomedical data repositories using DataMedNature Genetics, 2017
- Discovering and linking public omics data sets using the Omics Discovery IndexNature Biotechnology, 2017
- A Data Citation Roadmap for Scholarly Data RepositoriesPublished by Cold Spring Harbor Laboratory ,2016
- BioSharing: curated and crowd-sourced metadata standards, databases and data policies in the life sciencesDatabase: The Journal of Biological Databases and Curation, 2016
- The FAIR Guiding Principles for scientific data management and stewardshipScientific Data, 2016
- Perspective: Sustaining the big-data ecosystemNature, 2015
- The NIH Big Data to Knowledge (BD2K) initiativeJournal of the American Medical Informatics Association, 2015
- The center for expanded data annotation and retrievalJournal of the American Medical Informatics Association, 2015
- Toward interoperable bioscience dataNature Genetics, 2012
- bioCADDIE white paper - Data Discovery Index