Towards geospatial semantic search: exploiting latent semantic relations in geospatial data
- 10 April 2012
- journal article
- research article
- Published by Informa UK Limited in International Journal of Digital Earth
- Vol. 7 (1), 17-37
- https://doi.org/10.1080/17538947.2012.674561
Abstract
This paper reports our efforts to address the grand challenge of the Digital Earth vision in terms of intelligent data discovery from vast quantities of geo-referenced data. We propose an algorithm combining LSA and a Two-Tier Ranking (LSATTR) algorithm based on revised cosine similarity to build a more efficient search engine – Semantic Indexing and Ranking (SIR) – for a semantic-enabled, more effective data discovery. In addition to its ability to handle subject-based search, we propose a mechanism to combine geospatial taxonomy and Yahoo! GeoPlanet for automatic identification of location information from a spatial query and automatic filtering of datasets that are not spatially related. The metadata set, in the format of ISO19115, from NASA's SEDAC (Socio-Economic Data Application Center) is used as the corpus of SIR. Results show that our semantic search engine SIR built on LSATTR methods outperforms existing keyword-matching techniques, such as Lucene, in terms of both recall and precision. Moreover, the semantic associations among all existing words in the corpus are discovered. These associations provide substantial support for automating the population of spatial ontologies. We expect this work to support the operationalization of the Digital Earth vision by advancing the semantic-based geospatial data discovery.Keywords
This publication has 18 references indexed in Scilit:
- Semantic-based web service discovery and chaining for building an Arctic spatial data infrastructureComputers & Geosciences, 2011
- An active crawler for discovering geospatial Web services and their distribution pattern – A case study of OGC Web Map ServiceInternational Journal of Geographical Information Science, 2010
- Mining geophysical parameters through decision-tree analysis to determine correlation with tropical cyclone developmentComputers & Geosciences, 2009
- Ontology-supported scientific data frameworks: The Virtual Solar-Terrestrial Observatory experienceComputers & Geosciences, 2009
- Distributed geospatial information processing: sharing distributed geospatial resources to support Digital EarthInternational Journal of Digital Earth, 2008
- A REGIONAL ECONOMY, LAND USE, AND TRANSPORTATION MODEL (RELU‐TRAN©): FORMULATION, ALGORITHM DESIGN, AND TESTING*Journal of Regional Science, 2007
- Scientific data management in the coming decadeACM SIGMOD Record, 2005
- Service-oriented environments for dynamically interacting with mesoscale weatherComputing in Science & Engineering, 2005
- Indexing by latent semantic analysisJournal of the American Society for Information Science, 1990
- A Computer Movie Simulating Urban Growth in the Detroit RegionEconomic Geography, 1970