Enhancing discovery in spatial data infrastructures using a search engine
Open Access
- 21 May 2018
- journal article
- research article
- Published by PeerJ in PeerJ Computer Science
- Vol. 4, e152
- https://doi.org/10.7717/peerj-cs.152
Abstract
A spatial data infrastructure (SDI) is a framework of geospatial data, metadata, users and tools intended to provide an efficient and flexible way to use spatial information. One of the key software components of an SDI is the catalogue service which is needed to discover, query and manage the metadata. Catalogue services in an SDI are typically based on the Open Geospatial Consortium (OGC) Catalogue Service for the Web (CSW) standard which defines common interfaces for accessing the metadata information. A search engine is a software system capable of supporting fast and reliable search, which may use ‘any means necessary’ to get users to the resources they need quickly and efficiently. These techniques may include full text search, natural language processing, weighted results, fuzzy tolerance results, faceting, hit highlighting, recommendations and many others. In this paper we present an example of a search engine being added to an SDI to improve search against large collections of geospatial datasets. The Centre for Geographic Analysis (CGA) at Harvard University re-engineered the search component of its public domain SDI (Harvard WorldMap) which is based on the GeoNode platform. A search engine was added to the SDI stack to enhance the CSW catalogue discovery abilities. It is now possible to discover spatial datasets from metadata by using the standard search operations of the catalogue and to take advantage of the new abilities of the search engine, to return relevant and reliable content to SDI users.Keywords
Funding Information
- U.S. National Endowment for the Humanities, Digital Humanities Implementation (#HK5009113)
- U.S. National Science Foundation Industry-University Cooperative Research Centers Program (IUCRC)
- Spatiotemporal Thinking, Computing, and Applications Center (STC) (#1338914)
- Harvard University
- Harvard’s Institute for Quantitative Social Science
This publication has 6 references indexed in Scilit:
- A geospatial search engine for discovering multi-format geospatial data across the webInternational Journal of Digital Earth, 2014
- Tuning the second-generation SDI: theoretical aspects and real use casesInternational Journal of Geographical Information Science, 2012
- WorldMap – a geospatial framework for collaborative researchAnnals of GIS, 2012
- A capability matching and ontology reasoning method for high precision OGC web service discoveryInternational Journal of Digital Earth, 2011
- An active crawler for discovering geospatial Web services and their distribution pattern – A case study of OGC Web Map ServiceInternational Journal of Geographical Information Science, 2010
- Sharing Geographic Information: An Assessment of the Geospatial One-StopAnnals of the American Association of Geographers, 2007