[jira] [Commented] (HIVE-5783) Native Parquet Support in Hive

Lefty Leverenz (JIRA) Tue, 21 Jan 2014 01:05:10 -0800

    [ 
https://issues.apache.org/jira/browse/HIVE-5783?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13877345#comment-13877345
 ]


Lefty Leverenz commented on HIVE-5783:
--------------------------------------

What documentation will this need?  Is anything already written up that can be 
added to the wiki?

Here's where the wiki documents file formats and serdes:

* "Row Format, Storage Format, and SerDe" section in DDL doc, with links to 
other serde docs: 
[https://cwiki.apache.org/confluence/display/Hive/LanguageManual+DDL#LanguageManualDDL-RowFormat,StorageFormat,andSerDe]
* "File Formats" section in the Language Manual (includes ORC doc):  
[https://cwiki.apache.org/confluence/display/Hive/LanguageManual]
* Avro SerDe doc:  [https://cwiki.apache.org/confluence/display/Hive/AvroSerDe]

> Native Parquet Support in Hive
> ------------------------------
>
>                 Key: HIVE-5783
>                 URL: https://issues.apache.org/jira/browse/HIVE-5783
>             Project: Hive
>          Issue Type: New Feature
>          Components: Serializers/Deserializers
>            Reporter: Justin Coffey
>            Assignee: Justin Coffey
>            Priority: Minor
>             Fix For: 0.13.0
>
>         Attachments: HIVE-5783.patch, HIVE-5783.patch, HIVE-5783.patch, 
> HIVE-5783.patch, HIVE-5783.patch
>
>
> Problem Statement:
> Hive would be easier to use if it had native Parquet support. Our 
> organization, Criteo, uses Hive extensively. Therefore we built the Parquet 
> Hive integration and would like to now contribute that integration to Hive.
> About Parquet:
> Parquet is a columnar storage format for Hadoop and integrates with many 
> Hadoop ecosystem tools such as Thrift, Avro, Hadoop MapReduce, Cascading, 
> Pig, Drill, Crunch, and Hive. Pig, Crunch, and Drill all contain native 
> Parquet integration.
> Changes Details:
> Parquet was built with dependency management in mind and therefore only a 
> single Parquet jar will be added as a dependency.



--
This message was sent by Atlassian JIRA
(v6.1.5#6160)

[jira] [Commented] (HIVE-5783) Native Parquet Support in Hive

Reply via email to