A Big Data Modeling Methodology for Apache Cassandra
- 1 June 2015
- conference paper
- conference paper
- Published by Institute of Electrical and Electronics Engineers (IEEE)
Abstract
Apache Cassandra is a leading distributed database of choice when it comes to big data management with zero downtime, linear scalability, and seamless multiple data center deployment. With increasingly wider adoption of Cassandra for online transaction processing by hundreds of Web-scale companies, there is a growing need for a rigorous and practical data modeling approach that ensures sound and efficient schema design. This work i) proposes the first query-driven big data modeling methodology for Apache Cassandra, ii) defines important data modeling principles, mapping rules, and mapping patterns to guide logical data modeling, iii) presents visual diagrams for Cassandra logical and physical data models, and iv) demonstrates a data modeling tool that automates the entire data modeling process.Keywords
This publication has 7 references indexed in Scilit:
- Database Design for NoSQL SystemsLecture Notes in Computer Science, 2014
- The relational model is dead, SQL is dead, and I don't feel so good myselfACM SIGMOD Record, 2013
- A call to armsACM SIGMOD Record, 2011
- CassandraACM SIGOPS Operating Systems Review, 2010
- BigtableACM Transactions on Computer Systems, 2008
- The entity-relationship model—toward a unified view of dataACM Transactions on Database Systems, 1976
- A relational model of data for large shared data banksCommunications of the ACM, 1970