Estimating Twitter User Location Using Social Interactions--A Content Based Approach
- 1 October 2011
- conference paper
- conference paper
- Published by Institute of Electrical and Electronics Engineers (IEEE) in 2011 IEEE Third Int'l Conference on Privacy, Security, Risk and Trust and 2011 IEEE Third Int'l Conference on Social Computing
Abstract
Microblogging services such as Twitter allow users to interact with each other by forming a social network. The interaction between users in a social network group forms a dialogue or discussion. A typical dialogue between users involves a set of topics. We make the assumption that this set of topics remains constant throughout the conversation. Using this model of social interaction between users in the Twitter social network, along with content-derived location information, we employ a probabilistic framework to estimate the city-level location of a Twitter user, based on the content of the tweets in their dialogues, using reply-tweet messages. We estimate the city-level user location based purely on the content of the tweets, which may include reply-tweet information, without the use of any external information, such as a gazetteer, IP information etc. The current framework for estimating user location does not consider the underlying social interaction, i.e. the structure of interactions between the users. In this paper, we calculate a baseline probability estimate of the distribution of words used by a user. This distribution is formed by using the fact that terms used in the tweets of a certain discussion may be related to the location information of the user initiating the discussion. We also estimate the top K probable cities for a given user and measure the accuracy. We find that our baseline estimation yields an accuracy higher that the 10% accuracy of the current state of the art estimation.Keywords
This publication has 7 references indexed in Scilit:
- You are where you tweetPublished by Association for Computing Machinery (ACM) ,2010
- Find me if you canPublished by Association for Computing Machinery (ACM) ,2010
- Social network classification incorporating link type valuesPublished by Institute of Electrical and Electronics Engineers (IEEE) ,2009
- Tag-geotag correlation in social networksPublished by Association for Computing Machinery (ACM) ,2008
- Spatial variation in search engine queriesPublished by Association for Computing Machinery (ACM) ,2008
- Why we twitterPublished by Association for Computing Machinery (ACM) ,2007
- Web-a-wherePublished by Association for Computing Machinery (ACM) ,2004