Building essential biodiversity variables (EBVs) of species distribution and abundance at a global scale
Open Access
- 2 August 2017
- journal article
- review article
- Published by Wiley in Biological Reviews
- Vol. 93 (1), 600-625
- https://doi.org/10.1111/brv.12359
Abstract
Much biodiversity data is collected worldwide, but it remains challenging to assemble the scattered knowledge for assessing biodiversity status and trends. The concept of Essential Biodiversity Variables (EBVs) was introduced to structure biodiversity monitoring globally, and to harmonize and standardize biodiversity data from disparate sources to capture a minimum set of critical variables required to study, report and manage biodiversity change. Here, we assess the challenges of a ‘Big Data’ approach to building global EBV data products across taxa and spatiotemporal scales, focusing on species distribution and abundance. The majority of currently available data on species distributions derives from incidentally reported observations or from surveys where presence‐only or presence–absence data are sampled repeatedly with standardized protocols. Most abundance data come from opportunistic population counts or from population time series using standardized protocols (e.g. repeated surveys of the same population from single or multiple sites). Enormous complexity exists in integrating these heterogeneous, multi‐source data sets across space, time, taxa and different sampling methods. Integration of such data into global EBV data products requires correcting biases introduced by imperfect detection and varying sampling effort, dealing with different spatial resolution and extents, harmonizing measurement units from different data sources or sampling methods, applying statistical tools and models for spatial inter‐ or extrapolation, and quantifying sources of uncertainty and errors in data and models. To support the development of EBVs by the Group on Earth Observations Biodiversity Observation Network (GEO BON), we identify 11 key workflow steps that will operationalize the process of building EBV data products within and across research infrastructures worldwide. These workflow steps take multiple sequential activities into account, including identification and aggregation of various raw data sources, data quality control, taxonomic name matching and statistical modelling of integrated data. We illustrate these steps with concrete examples from existing citizen science and professional monitoring projects, including eBird, the Tropical Ecology Assessment and Monitoring network, the Living Planet Index and the Baltic Sea zooplankton monitoring. The identified workflow steps are applicable to both terrestrial and aquatic systems and a broad range of spatial, temporal and taxonomic scales. They depend on clear, findable and accessible metadata, and we provide an overview of current data and metadata standards. Several challenges remain to be solved for building global EBV data products: (i) developing tools and models for combining heterogeneous, multi‐source data sets and filling data gaps in geographic, temporal and taxonomic coverage, (ii) integrating emerging methods and technologies for data collection such as citizen science, sensor networks, DNA‐based techniques and satellite remote sensing, (iii) solving major technical issues related to data product structure, data storage, execution of workflows and the production process/cycle as well as approaching technical interoperability among research infrastructures, (iv) allowing semantic interoperability by developing and adopting standards and tools for capturing consistent data and metadata, and (v) ensuring legal interoperability by endorsing open data or data that are free from restrictions on use, modification and sharing. Addressing these challenges is critical for biodiversity research and for assessing progress towards conservation policy targets and sustainable development goals.Keywords
Funding Information
- Australian Research Council (FT0991640)
- Vetenskapsrådet (829‐2009‐6278)
- General Secretariat for Research and Technology
- European Commission (654003)
This publication has 113 references indexed in Scilit:
- A specialist’s audit of aggregated occurrence records: An ‘aggregator’s’ perspectiveZooKeys, 2013
- An integrated view of data quality in Earth observationPhilosophical Transactions of the Royal Society A: Mathematical, Physical and Engineering Sciences, 2013
- The taxonomic name resolution service: an online tool for automated standardization of plant namesBMC Bioinformatics, 2013
- Computational meta'omics for microbial community studiesMolecular Systems Biology, 2013
- A decadal view of biodiversity informatics: challenges and prioritiesBMC Ecology, 2013
- Incorporating uncertainty in predictive species distribution modellingPhilosophical Transactions B, 2012
- eBird: Engaging Birders in Science and ConservationPLoS Biology, 2011
- Minimum information about a marker gene sequence (MIMARKS) and minimum information about any (x) sequence (MIxS) specificationsNature Biotechnology, 2011
- Global Biodiversity: Indicators of Recent DeclinesScience, 2010
- BioGeomancer: Automated Georeferencing to Map the World's Biodiversity DataPLoS Biology, 2006