Taming the elephant

Learning Had00p

Archive for February 2013

SQL for Hadoop

leave a comment »

Listing of all the SQL solutions on top of Hadoop (or just linking to the name Hadoop).

SQL is what’s next for Hadoop: Here’s who’s doing it — Tech News and Analysis.

Key to SQL on Hadoop is to integrate with HDFS to take advantage of data locality and a good query optimizer. As of now systems based on HBase or using in memory caches seem to be the best bet.

I have recently work on Teradata offload to Hadoop and the current attempts of “SQL on Hadoop” hint at the model Teradata uses. Only *big* difference is the cost though. SQL on Hadoop solutions can mature and replace such costly systems


  • Keep the storage format open or else you are creating a new MPP database.

Written by rawatra

February 23, 2013 at 5:33 pm

Posted in bigdata, hadoop

Tagged with ,