Arbitrarily distributed data-based recommendations with privacy
- 1 February 2012
- journal article
- research article
- Published by Elsevier BV in Data & Knowledge Engineering
- Vol. 72, 239-256
- https://doi.org/10.1016/j.datak.2011.11.002
Abstract
Collaborative filtering (CF) systems use customers' preferences about various products to offer recommendations. Providing accurate and reliable predictions is vital for both e-commerce companies and their customers. To offer such referrals, CF systems should have sufficient data. When data collected for CF purposes held by a central server, it is an easy task to provide recommendations. However, customers' preferences represented as ratings might be partitioned between two vendors. To supply trustworthy and correct predictions, such companies might desire to collaborate. Due to privacy concerns, financial fears, and legal issues; however, the parties may not want to disclose their data to each other. In this study, we scrutinize how to estimate item-based predictions on arbitrarily distributed data (ADD) between two e-commerce sites without deeply jeopardizing their privacy. We analyze our proposed scheme in terms of privacy; and demonstrate that the method does not intensely violate data owners' confidentiality. We conduct experiments using real data sets to show how coverage and quality of the predictions improve due to collaboration. We also investigate our scheme in terms of online performance; and demonstrate that supplementary online costs caused by privacy measures are negligible. Moreover, we perform trials to show how privacy concerns affect accuracy. Our results show that accuracy and coverage improve due to collaboration; and the proposed scheme is still able to offer truthful predictions with privacy concerns. (C) 2011 Elsevier B.V. All rights reserved.Keywords
This publication has 23 references indexed in Scilit:
- I trust not therefore it must be risky: Determinants of the perceived risks of disclosing personal data for e-government transactionsComputers in Human Behavior, 2011
- Reliable representations for association rulesData & Knowledge Engineering, 2011
- Recommendation-based editor for business process modelingData & Knowledge Engineering, 2011
- Frequent Item Computation on a ChipIEEE Transactions on Knowledge and Data Engineering, 2010
- A Survey of Collaborative Filtering TechniquesAdvances in Artificial Intelligence, 2009
- Privacy-preserving naive Bayes classification on distributed data via semi-trusted mixersInformation Systems, 2009
- Using error-correcting dependencies for collaborative filteringData & Knowledge Engineering, 2008
- Recent advances in preserving privacy when mining dataData & Knowledge Engineering, 2008
- Providing predictions on distributed HMMs with privacyArtificial Intelligence Review, 2007
- A privacy-preserving clustering approach toward secure and effective data analysis for business collaborationComputers & Security, 2007