Home Location Identification of Twitter Users
- 28 July 2014
- journal article
- Published by Association for Computing Machinery (ACM) in ACM Transactions on Intelligent Systems and Technology
- Vol. 5 (3), 1-21
- https://doi.org/10.1145/2528548
Abstract
We present a new algorithm for inferring the home location of Twitter users at different granularities, including city, state, time zone, or geographic region, using the content of users’ tweets and their tweeting behavior. Unlike existing approaches, our algorithm uses an ensemble of statistical and heuristic classifiers to predict locations and makes use of a geographic gazetteer dictionary to identify place-name entities. We find that a hierarchical classification approach, where time zone, state, or geographic region is predicted first and city is predicted next, can improve prediction accuracy. We have also analyzed movement variations of Twitter users, built a classifier to predict whether a user was travelling in a certain period of time, and use that to further improve the location detection accuracy. Experimental evidence suggests that our algorithm works well in practice and outperforms the best existing algorithms for predicting the home location of Twitter users.Keywords
This publication has 19 references indexed in Scilit:
- Finding your friends and following them to where you arePublished by Association for Computing Machinery (ACM) ,2012
- Object matching in tweets with spatial modelsPublished by Association for Computing Machinery (ACM) ,2012
- "I'm eating a sandwich in Glasgow"Published by Association for Computing Machinery (ACM) ,2011
- The where in the tweetPublished by Association for Computing Machinery (ACM) ,2011
- Friendship and mobilityPublished by Association for Computing Machinery (ACM) ,2011
- Tweets from Justin Bieber's heartPublished by Association for Computing Machinery (ACM) ,2011
- Earthquake shakes Twitter usersPublished by Association for Computing Machinery (ACM) ,2010
- Multi-class AdaBoostStatistics and Its Interface, 2009
- Dynamically weighted ensemble neural networks for classificationPublished by Institute of Electrical and Electronics Engineers (IEEE) ,2002
- Hierarchical classification of Web contentPublished by Association for Computing Machinery (ACM) ,2000