Hadoop, Pig and HBase at Twitter
Tuesday, May 25, 2010 at 12:04AM |
Derek Stainer Again the folks at Twitter provide us the material for this next post. Specifically, Dimitriy Ryaboy a member of the Analytics team at Twitter, discusses Twitter's usage of Hadoop, Pig and HBase. Now technically both Hadoop and Pig are not really pure NoSQL, really they are ancilary components that interact with a NoSQL data store HBase.
However, HBase is a big part of the Hadoop ecosystem and Pig provides a simplified query mechanism for HBase, so in my opinion they are worth the discussion.
There are obviously several important points made throughout the discussion but one slide I found particularily interesting in which Dimitriy explains how they use Cassandra and HBase and what they use them for.
Rough Analogy: Cassandra is OLTP and HBase is OLAP
Dimitriy Ryaboy,
HBase,
Hadoop,
Pig,
Presentation,
Twitter 
