[
https://issues.apache.org/jira/browse/HIVE-10114?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14627582#comment-14627582
]
Lefty Leverenz commented on HIVE-10114:
---------------------------------------
Very nice doc, [~gopalv]. I removed the TODOC1.2 label.
Here's a link to the doc:
* [Configuration Properties -- hive.exec.orc.split.strategy |
https://cwiki.apache.org/confluence/pages/viewpage.action?pageId=27842758#ConfigurationProperties-hive.exec.orc.split.strategy]
> Split strategies for ORC
> ------------------------
>
> Key: HIVE-10114
> URL: https://issues.apache.org/jira/browse/HIVE-10114
> Project: Hive
> Issue Type: Improvement
> Affects Versions: 1.2.0
> Reporter: Prasanth Jayachandran
> Assignee: Prasanth Jayachandran
> Fix For: 1.2.0
>
> Attachments: HIVE-10114.1.patch, HIVE-10114.2.patch,
> HIVE-10114.3.patch, HIVE-10114.4.patch, HIVE-10114.5.patch
>
>
> ORC split generation does not have clearly defined strategies for different
> scenarios (many small orc files, few small orc files, many large files etc.).
> Few strategies like storing the file footer in orc split, making entire file
> as a orc split already exists. This JIRA to make the split generation
> simpler, support different strategies for various use cases (BI, ETL, ACID
> etc.) and to lay the foundation for HIVE-7428.
--
This message was sent by Atlassian JIRA
(v6.3.4#6332)