StoreSim: Optimizing Information Leakage in Multicloud Storage Services
- 1 November 2015
- conference paper
- conference paper
- Published by Institute of Electrical and Electronics Engineers (IEEE) in 2015 IEEE 7th International Conference on Cloud Computing Technology and Science (CloudCom)
- p. 379-386
- https://doi.org/10.1109/cloudcom.2015.26
Abstract
Many schemes have been recently advanced for storing data on multiple clouds. Distributing data over different cloud storage providers (CSPs) automatically provides users with a certain degree of information leakage control, as no single point of attack can leak all user's information. However, unplanned distribution of data chunks can lead to high information disclosure even while using multiple clouds. In this paper, to address this problem we present StoreSim, an information leakage aware storage system in multicloud. StoreSim aims to store syntactically similar data on the same cloud, thus minimizing the user's information leakage across multiple clouds. We design an approximate algorithm to efficiently generate similarity-preserving signatures for data chunks based on MinHash and Bloom filter, and also design a function to compute the information leakage based on these signatures. Next, we present an effective storage plan generation algorithm based on clustering for distributing data chunks with minimal information leakage across multiple clouds. Finally, we evaluate our scheme using two real datasets from Wikipedia and GitHub. We show that our scheme can reduce the information leakage by up to 60% compared to unplanned placement.Keywords
This publication has 18 references indexed in Scilit:
- Is the Same Instance Type Created Equal? Exploiting Heterogeneity of Public CloudsIEEE Transactions on Cloud Computing, 2013
- DepSkyACM Transactions on Storage, 2013
- Scalia: An adaptive scheme for efficient multi-cloud storagePublished by Institute of Electrical and Electronics Engineers (IEEE) ,2012
- b-Bit minwise hashingPublished by Association for Computing Machinery (ACM) ,2010
- Finding near-duplicate web pagesPublished by Association for Computing Machinery (ACM) ,2006
- A Survey of Clustering Data Mining TechniquesPublished by Springer Science and Business Media LLC ,2006
- Network Applications of Bloom Filters: A SurveyInternet Mathematics, 2004
- Similarity estimation techniques from rounding algorithmsPublished by Association for Computing Machinery (ACM) ,2002
- Bitmap index design and evaluationPublished by Association for Computing Machinery (ACM) ,1998
- Syntactic clustering of the WebComputer Networks and ISDN Systems, 1997