Assigning Geographical Scopes To Web Pages
- 1 January 2005
- book chapter
- conference paper
- Published by Springer Science and Business Media LLC in Lecture Notes in Computer Science
Abstract
Finding automatic ways of attaching geographical scopes to on-line resources, also called “geo-referencing” documents, is a challenging problem, getting increasing attention [1,5,3]. Here we present a system architecture and a process for identifying the geographical scope of Web pages, defining a scope as the region where more people than average would find that page relevant. We rely on typical Web IR heuristics (i.e. feature weighting, hypertext topic locality, anchor description) and assumptions on how people use geographical references in documents. The method involves three major steps. First, geographical named entities are identified in the text. Next, we propagate the found named entities through the Web linkage graph. Finally, a geographical ontology is used to disambiguate among the named entities associated to a document, this way selecting the most likely scope. In the future, we plan on using scopes in new location-aware search tools.Keywords
This publication has 4 references indexed in Scilit:
- Web-a-wherePublished by Association for Computing Machinery (ACM) ,2004
- Spatial information retrieval and geographical ontologies an overview of the SPIRIT projectPublished by Association for Computing Machinery (ACM) ,2002
- Named Entity recognition without gazetteersPublished by Association for Computational Linguistics (ACL) ,1999
- Geographic NamesD-Lib Magazine, 1999