Uncovering Crowdsourced Manipulation of Online Reviews

conference paper
conference paper
Published by Association for Computing Machinery (ACM) in Proceedings of the 38th International ACM SIGIR Conference on Research and Development in Information Retrieval

https://doi.org/10.1145/2766462.2767742

Abstract

Online reviews are a cornerstone of consumer decision making. However, their authenticity and quality has proven hard to control, especially as polluters target these reviews toward promoting products or in degrading competitors. In a troubling direction, the widespread growth of crowdsourcing platforms like Mechanical Turk has created a large-scale, potentially difficult-to-detect workforce of malicious review writers. Hence, this paper tackles the challenge of uncovering crowdsourced manipulation of online reviews through a three-part effort: (i) First, we propose a novel sampling method for identifying products that have been targeted for manipulation and a seed set of deceptive reviewers who have been enlisted through crowdsourcing platforms. (ii) Second, we augment this base set of deceptive reviewers through a reviewer-reviewer graph clustering approach based on a Markov Random Field where we define individual potentials (of single reviewers) and pair potentials (between two reviewers). (iii) Finally, we embed the results of this probabilistic model into a classification framework for detecting crowd-manipulated reviews. We find that the proposed approach achieves up to 0.96 AUC, outperforming both traditional detection methods and a SimRank-based alternative clustering approach.

Keywords

Funding Information

Google Faculty Research Award
AFOSR (FA9550-12-1-0363)
Army Research Office (W911NF-13-1-0271)

This publication has 17 references indexed in Scilit:

CrowdDB
Published by Association for Computing Machinery (ACM) ,2011
Soylent
Published by Association for Computing Machinery (ACM) ,2010
How opinions are received by online communities
Published by Association for Computing Machinery (ACM) ,2009
Examining the Relationship Between Reviews and Sales: The Role of Reviewer Identity Disclosure in Electronic Markets
Information Systems Research, 2008
Opinion spam and analysis
Published by Association for Computing Machinery (ACM) ,2008
An introduction to ROC analysis
Pattern Recognition Letters, 2005
The Effect of Word of Mouth on Sales: Online Book Reviews
Published by National Bureau of Economic Research ,2003
Approximation algorithms for classification problems with pairwise relationships
Journal of the ACM, 2002
SimRank
Published by Association for Computing Machinery (ACM) ,2002
Maximum Likelihood from Incomplete Data Via the EM Algorithm
Journal of the Royal Statistical Society: Series B (Methodological), 1977

Cited by 57 articles