[
https://issues.apache.org/jira/browse/ORC-210?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16244946#comment-16244946
]
ASF GitHub Bot commented on ORC-210:
------------------------------------
Github user omalley commented on the issue:
https://github.com/apache/orc/pull/189
Here is a spread sheet that is sorted by data set and then by read+hdfs
time. The read+hdfs time assumes 15mb/sec from hdfs.
https://docs.google.com/spreadsheets/d/1bE1j-AaUY7Xq_uh1nqX1Jf7qXvfL1crp2Y8Hl2M4Y-E/edit?usp=sharing
> Add new ORC 2.0 encoding for Double, Float.
> -------------------------------------------
>
> Key: ORC-210
> URL: https://issues.apache.org/jira/browse/ORC-210
> Project: ORC
> Issue Type: Improvement
> Components: encoding, Java
> Affects Versions: 2.0.0
> Reporter: Dapeng Sun
> Assignee: Teddy Choi
> Attachments: ORC-210.1.patch, ORC-210.2.patch, patch.txt
>
>
> Currently, Double and Float are using PLAIN encoding, it is better to support
> encoding such as Dictionary or BitPacking to reduce the storage cost.
--
This message was sent by Atlassian JIRA
(v6.4.14#64029)