Taming the elephant

Learning Had00p

SQL for Hadoop

leave a comment »

Listing of all the SQL solutions on top of Hadoop (or just linking to the name Hadoop).

SQL is what’s next for Hadoop: Here’s who’s doing it — Tech News and Analysis.

Key to SQL on Hadoop is to integrate with HDFS to take advantage of data locality and a good query optimizer. As of now systems based on HBase or using in memory caches seem to be the best bet.

I have recently work on Teradata offload to Hadoop and the current attempts of “SQL on Hadoop” hint at the model Teradata uses. Only *big* difference is the cost though. SQL on Hadoop solutions can mature and replace such costly systems

Challenge:

  • Keep the storage format open or else you are creating a new MPP database.
Advertisements

Written by rawatra

February 23, 2013 at 5:33 pm

Posted in bigdata, hadoop

Tagged with ,

Leave a Reply

Fill in your details below or click an icon to log in:

WordPress.com Logo

You are commenting using your WordPress.com account. Log Out / Change )

Twitter picture

You are commenting using your Twitter account. Log Out / Change )

Facebook photo

You are commenting using your Facebook account. Log Out / Change )

Google+ photo

You are commenting using your Google+ account. Log Out / Change )

Connecting to %s

%d bloggers like this: