Components: Apache Hive

Description: The Apache Hive ™ data warehouse software facilitates reading, writing, and managing large datasets residing in distributed storage using SQL. Structure can be projected onto data already in storage. A command line tool and JDBC driver are provided to connect users to Hive.

Provision script (Puppet manifest): hadoop.pp

Additional info: Hive site

Table of content:

Home
Prerequisites
Developement stand provisioning
Components
Monitoring Links
Development