[ 
https://issues.apache.org/jira/browse/ORC-210?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16307686#comment-16307686
 ] 

ASF GitHub Bot commented on ORC-210:
------------------------------------

Github user winningsix commented on the issue:

    https://github.com/apache/orc/pull/189
  
    @omalley  Thank you for the patch and initial performance result. The 
result looks very promising. With current result, looks like different encoding 
(FPC V1, FPC V2 and split) fits for different data set while split encoding is 
more compression friendly.  Not sure whether we can implement it as what RLE 
did. Dynamically determine the encoding based on the characteristics of data 
set. Any thoughts on this?


> Add new ORC 2.0 encoding for Double, Float.
> -------------------------------------------
>
>                 Key: ORC-210
>                 URL: https://issues.apache.org/jira/browse/ORC-210
>             Project: ORC
>          Issue Type: Improvement
>          Components: encoding, Java
>    Affects Versions: 2.0.0
>            Reporter: Dapeng Sun
>            Assignee: Teddy Choi
>         Attachments: ORC-210.1.patch, ORC-210.2.patch, patch.txt
>
>
> Currently, Double and Float are using PLAIN encoding, it is better to support 
> encoding such as Dictionary or BitPacking to reduce the storage cost.



--
This message was sent by Atlassian JIRA
(v6.4.14#64029)

Reply via email to