Empath
- 7 May 2016
- conference paper
- conference paper
- Published by Association for Computing Machinery (ACM)
- p. 4647-4657
- https://doi.org/10.1145/2858036.2858535
Abstract
Human language is colored by a broad range of topics, but existing text analysis tools only focus on a small number of them. We present Empath, a tool that can generate and validate new lexical categories on demand from a small set of seed terms (like \"bleed\" and \"punch\" to generate the category violence). Empath draws connotations between words and phrases by deep learning a neural embedding across more than 1.8 billion words of modern fiction. Given a small set of seed words that characterize a category, Empath uses its neural embedding to discover new related terms, then validates the category with a crowd-powered filter. Empath also analyzes text across 200 built-in, pre-validated categories we have generated from common topics in our web dataset, like neglect, government, and social media. We show that Empath's data-driven, human validated categories are highly correlated (r=0.906) with similar categories in LIWC.Keywords
This publication has 23 references indexed in Scilit:
- Sentiment, emotion, purpose, and style in electoral tweetsInformation Processing & Management, 2015
- We Are DynamoPublished by Association for Computing Machinery (ACM) ,2015
- Comparing Person- and Process-centric Strategies for Obtaining Quality Data on Amazon Mechanical TurkPublished by Association for Computing Machinery (ACM) ,2015
- Experimental evidence of massive-scale emotional contagion through social networksProceedings of the National Academy of Sciences of the United States of America, 2014
- CROWDSOURCING A WORD–EMOTION ASSOCIATION LEXICONComputational Intelligence, 2012
- "Discovering emotion influence patterns in online social network conversations" by Suin Kim, JinYeong Bak, and Alice Oh, with Ching-man Au Yeung as coordinatorACM SIGWEB Newsletter, 2012
- Diurnal and Seasonal Mood Vary with Work, Sleep, and Daylength Across Diverse CulturesScience, 2011
- Separating Fact From Fiction: An Examination of Deceptive Self-Presentation in Online Dating ProfilesPersonality and Social Psychology Bulletin, 2008
- ConceptNet — A Practical Commonsense Reasoning Tool-KitBT Technology Journal, 2004
- WordNetCommunications of the ACM, 1995