Data integration enables global biodiversity synthesis
Open Access
- 9 February 2021
- journal article
- research article
- Published by Proceedings of the National Academy of Sciences in Proceedings of the National Academy of Sciences of the United States of America
- Vol. 118 (6)
- https://doi.org/10.1073/pnas.2018093118
Abstract
The accessibility of global biodiversity information has surged in the past two decades, notably through widespread funding initiatives for museum specimen digitization and emergence of large-scale public participation in community science. Effective use of these data requires the integration of disconnected datasets, but the scientific impacts of consolidated biodiversity data networks have not yet been quantified. To determine whether data integration enables novel research, we carried out a quantitative text analysis and bibliographic synthesis of >4,000 studies published from 2003 to 2019 that use data mediated by the world’s largest biodiversity data network, the Global Biodiversity Information Facility (GBIF). Data available through GBIF increased 12-fold since 2007, a trend matched by global data use with roughly two publications using GBIF-mediated data per day in 2019. Data-use patterns were diverse by authorship, geographic extent, taxonomic group, and dataset type. Despite facilitating global authorship, legacies of colonial science remain. Studies involving species distribution modeling were most prevalent (31% of literature surveyed) but recently shifted in focus from theory to application. Topic prevalence was stable across the 17-y period for some research areas (e.g., macroecology), yet other topics proportionately declined (e.g., taxonomy) or increased (e.g., species interactions, disease). Although centered on biological subfields, GBIF-enabled research extends surprisingly across all major scientific disciplines. Biodiversity data mobilization through global data aggregation has enabled basic and applied research use at temporal, spatial, and taxonomic scales otherwise not possible, launching biodiversity sciences into a new era.Keywords
This publication has 57 references indexed in Scilit:
- Design and Update of a Classification System: The UCSD Map of SciencePLOS ONE, 2012
- Integrating biodiversity distribution knowledge: toward a global map of lifeTrends in Ecology & Evolution, 2012
- Darwin Core: An Evolving Community-Developed Biodiversity Data StandardPLOS ONE, 2012
- Species Distribution Models: Ecological Explanation and Prediction Across Space and TimeAnnual Review of Ecology, Evolution, and Systematics, 2009
- GenBankNucleic Acids Research, 2007
- How Global Is the Global Biodiversity Information Facility?PLOS ONE, 2007
- Maximum entropy modeling of species geographic distributionsEcological Modelling, 2006
- New developments in museum-based informatics and applications in biodiversity analysisTrends in Ecology & Evolution, 2004
- Visualizing knowledge domainsAnnual Review of Information Science and Technology, 2003
- 10.1162/jmlr.2003.3.4-5.993Applied Physics Letters, 2000