Introduction to Apache Cassandra
Tuesday, July 13, 2010 at 6:01AM |
Derek Stainer So in case you missed it, Cassandra has been in the news the last couple of days. So I thought this would be a good opportunity to provide an introduction to Cassandra via Gary Dusbabek from Rackspace. This presentation was actually given at Silicon Valley Cloud Computing Group back in June of this year.
Couple of key points about Cassandra (not from the presentation):
- Initially created by Facebook for search functionality for users inbox mail on the site.
- The source code was open sourced and released to the Apache Software Foundation.
- Its design was inspired by both Google's BigTable and Amazon's Dynamo.
- It's considered to be a column data store, similar to a Google BigTable or Apache HBase.
So why Cassandra at all? As Dusbabek mentions from his presentation "vertical scaling is hard". So as the amount of data we create and analyze increases, our strategies for dealing with that data change. Dusbabek walks us through a number of topics in his discussion including scaling, replication model, data model and practical considerations.
So without any further interruptions...

