Tuesday, August 26, 2014

Hadoop - Hive and Pig and Impala

Hive and Pig seem to both be query languages for Hadoop.
Hive seems to have a more familiar SQL style language.
Hive is a framework for performing analytic queries.
Pig uses a new language (pig latin).


http://vision.cloudera.com/impala-v-hive/ 
http://www-01.ibm.com/software/data/infosphere/hadoop/hive/
http://www-01.ibm.com/software/data/infosphere/hadoop/pig/




Other notes:

Hadoop is basically 2 things - a Distributed FileSystem(HDFS) + a Computation or Processing framework(MapReduce).

HBase sits on top of HDFS. HBase provides random read/write access.
HBase similar to Google's BigTable.

Kafka.. eventing & queuing ....somewhat similar to Rabbit

No comments:

Post a Comment