A Statistical Language Modeling Approach to Online Deception Detection
- 27 June 2008
- journal article
- Published by Institute of Electrical and Electronics Engineers (IEEE) in IEEE Transactions on Knowledge and Data Engineering
- Vol. 20 (8), 1077-1081
- https://doi.org/10.1109/tkde.2007.190624
Abstract
Online deception is disrupting our daily life, organizational process, and even national security. Existing approaches to online deception detection follow a traditional paradigm by using a set of cues as antecedents for deception detection, which may be hindered by ineffective cue identification. Motivated by the strength of statistical language models (SLMs) in capturing the dependency of words in text without explicit feature extraction, we developed SLMs to detect online deception. We also addressed the data sparsity problem in building SLMs in general and in deception detection in specific using smoothing and vocabulary pruning techniques. The developed SLMs were evaluated empirically with diverse datasets. The results showed that the proposed SLM approach to deception detection outperformed a state-of-the-art text categorization method as well as traditional feature-based methods.Keywords
This publication has 21 references indexed in Scilit:
- A Heuristic Approach to Establishing Punctuation Convention in Instant MessagingIEEE Transactions on Dependable and Secure Computing, 2005
- Augmenting Naive Bayes Classifiers with Statistical Language ModelsInformation Retrieval Journal, 2004
- Automating Linguistics-Based Cues for Detecting Deception in Text-Based Asynchronous Computer-Mediated CommunicationsGroup Decision and Negotiation, 2004
- Deception in Computer-Mediated CommunicationGroup Decision and Negotiation, 2004
- Cues to deception.Psychological Bulletin, 2003
- Inducing Sensitivity to Deception in Order to Improve Decision Making Performance: A Field StudyMIS Quarterly, 2002
- A survey of smoothing techniques for ME modelsIEEE Transactions on Speech and Audio Processing, 2000
- Testing Media Richness Theory in the New Media: The Effects of Cues, Feedback, and Task EquivocalityInformation Systems Research, 1998
- Interpersonal Deception TheoryCommunication Theory, 1996
- The language of deceit: An investigation of the verbal clues to deception in the interrogation context.Law and Human Behavior, 1996