Search
Follow Us

Follow nosqldatabases on Twitter Follow nosqldatabases on Facebook Follow nosqldatabases on Google Buzz Follow nosqldatabases on LinkedIn Follow nosqldatabases on FeedBurner NoSQL presentations on slideshare

Sponsors

Become a sponsor of NoSQLDatabases.com. Contact us to find out how.

Featured Jobs

 

Follow On Facebook
Recent NoSQL News

Advertisments

Entries in Hive (4)

Thursday
Nov112010

Hadoop and Hive at Orbitz

More slides and video from this years Hadoop World. In this presentation Jonathan Seidman and Ramesh Venkataramaiah discuss how Orbitz is using Hadoop and Hive. Specifically the presentation discusses how Orbitz uses Hadoop, Hive and machine learning to improve hotel ranking. In addition to the hotel ranking application, Orbitz is starting to use Hadoop for other projects such as: measuring page download performance, raw data log search and cache analysis.

Click to read more ...

Wednesday
Sep012010

How Facebook Scales with Open Source

ReadWriteWeb has a piece on how Facebook scales to accomidate their 500 million users by using open source. The article itself is relevant to NoSQL in a couple of ways. First it's a known fact that Facebook uses multiple NoSQL databases, and let's not forget they actually created one as well.

Click to read more ...

Monday
Jun142010

Hive - A Petabyte Scale Data Warehouse Using Hadoop

Now technically, Hadoop itself does not belong in the NoSQL discussion.  However, components that make up the Hadoop ecosystem such as HBase and Hive are definitely candidates for discussion. In this presentation by the Facebook Data Team, they discuss their usage of Hive in combination with Hadoop to solve Facebook's data warehousing and analytical needs.

I found it interesting how Facebook's blended the use of both traditional SQL data stores such as Oracle and MySQL and NoSQL solutions such as Hive as part of their overall solution.

Other interesting statistics:

  • 10 TB of compressed new data added per day
  • 135 TB of compressed data scanned per day
  • 7500+ Hive jobs per day
  • 80K compute hours per day

Tuesday
Jun012010

Cloudera's CEO talks us through big data trends

Really good interview by Robert Scoble with Mike Olsen the CEO of Cloudera. It touches on a number of topics including Hadoop, Big Data and NoSQL. There are some mentions of NoSQL technologies such as MongoDB, CouchDB and Cassandra. Regardless, Mike Olsen is a very engaging guy and is very interesting to listen to. Despite being focused on Hadoop and Cloudera it's worth checking it out.