Building Apps on Hadoop with Pig, Hive, and Zookeeper

Over at Data Informed, David Loshin writes that designing, developing, and deploying analytic applications involves much more than just downloading open source software and still requires some skill and expertise.

This article examines the prototypical big data platform using Hadoop, and how Hadoop-related projects address these pieces of the puzzle:

  • Synchronization and coordination of process and object namespace across different applications and assets.
  • Data management, which has a number of alternatives. This article focuses on Hive and HBase; a future article will discuss a variety of big data management schemes.
  • Programing ease-of-use, and methods to simplify MapReduce programming.
  • Analytics, or rather some implementations of algorithms that can be used to develop analytical models.

Read the Full Story.

Resource Links: