[ 
https://issues.apache.org/jira/browse/HADOOP-3601?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12623094#action_12623094
 ] 

Ashish Thusoo commented on HADOOP-3601:
---------------------------------------

We ran into some issues while porting this to trunk. We are actively working to 
resolve those issues.

While we solve the compatibility issues with hadoop trunk, interested users can 
get a source tar ball and a jar distribution which compiles and works with 
hadoop 0.17 from the following location

http://mirror.facebook.com/facebook/hive/hadoop-0.17/

Please follow the instructions in the README file  on how to compile the src 
tar ball and how to use the jar distribution. Not all the features mentioned in 
the tutorial on this JIRA have made it to this distribution, but a bulk of 
these are already there. The README in the jar distribution has a summary of 
what is working and what is not.

Feel free to try it out and send us feedback.

[EMAIL PROTECTED]




> Hive as a contrib project
> -------------------------
>
>                 Key: HADOOP-3601
>                 URL: https://issues.apache.org/jira/browse/HADOOP-3601
>             Project: Hadoop Core
>          Issue Type: Wish
>    Affects Versions: 0.17.0
>         Environment: N/A
>            Reporter: Joydeep Sen Sarma
>            Priority: Minor
>         Attachments: HiveTutorial.pdf
>
>   Original Estimate: 1080h
>  Remaining Estimate: 1080h
>
> Hive is a data warehouse built on top of flat files (stored primarily in 
> HDFS). It includes:
> - Data Organization into Tables with logical and hash partitioning
> - A Metastore to store metadata about Tables/Partitions etc
> - A SQL like query language over object data stored in Tables
> - DDL commands to define and load external data into tables
> Hive's query language is executed using Hadoop map-reduce as the execution 
> engine. Queries can use either single stage or multi-stage map-reduce. Hive 
> has a native format for tables - but can handle any data set (for example 
> json/thrift/xml) using an IO library framework.
> Hive uses Antlr for query parsing, Apache JEXL for expression evaluation and 
> may use Apache Derby as an embedded database for MetaStore. Antlr has a BSD 
> license and should be compatible with Apache license.
> We are currently thinking of contributing to the 0.17 branch as a contrib 
> project (since that is the version under which it will get tested internally) 
> - but looking for advice on the best release path.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.

Reply via email to