[ https://issues.apache.org/jira/browse/HIVE-5783?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13877345#comment-13877345 ]
Lefty Leverenz commented on HIVE-5783: -------------------------------------- What documentation will this need? Is anything already written up that can be added to the wiki? Here's where the wiki documents file formats and serdes: * "Row Format, Storage Format, and SerDe" section in DDL doc, with links to other serde docs: [https://cwiki.apache.org/confluence/display/Hive/LanguageManual+DDL#LanguageManualDDL-RowFormat,StorageFormat,andSerDe] * "File Formats" section in the Language Manual (includes ORC doc): [https://cwiki.apache.org/confluence/display/Hive/LanguageManual] * Avro SerDe doc: [https://cwiki.apache.org/confluence/display/Hive/AvroSerDe] > Native Parquet Support in Hive > ------------------------------ > > Key: HIVE-5783 > URL: https://issues.apache.org/jira/browse/HIVE-5783 > Project: Hive > Issue Type: New Feature > Components: Serializers/Deserializers > Reporter: Justin Coffey > Assignee: Justin Coffey > Priority: Minor > Fix For: 0.13.0 > > Attachments: HIVE-5783.patch, HIVE-5783.patch, HIVE-5783.patch, > HIVE-5783.patch, HIVE-5783.patch > > > Problem Statement: > Hive would be easier to use if it had native Parquet support. Our > organization, Criteo, uses Hive extensively. Therefore we built the Parquet > Hive integration and would like to now contribute that integration to Hive. > About Parquet: > Parquet is a columnar storage format for Hadoop and integrates with many > Hadoop ecosystem tools such as Thrift, Avro, Hadoop MapReduce, Cascading, > Pig, Drill, Crunch, and Hive. Pig, Crunch, and Drill all contain native > Parquet integration. > Changes Details: > Parquet was built with dependency management in mind and therefore only a > single Parquet jar will be added as a dependency. -- This message was sent by Atlassian JIRA (v6.1.5#6160)