[ 
https://issues.apache.org/jira/browse/HIVE-11676?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Sergey Shelukhin updated HIVE-11676:
------------------------------------
    Attachment: HIVE-11676.patch

Implements the API in a simplest possible way, SARG evaluation is done in 
metastore but still with full deserialization. Phase 2 (separate JIRA) will 
include storing only necessary data in separate CF in HBase to avoid 
deserialization and pave the way for phase 3 where we can push this to HBase 
filter. Given that filters are right now min-max only we can even get rid of 
ORC dependency and use standard HBase filters, but it will make pushdown 
difficult for future improvement, need to talk to Gopal and Owen/Prasanth 
before doing that.

This is a patch on top of umpteen other patches that are not committed, in 
particular HIVE-11553 and HIVE-11705, maybe others. Patch management is 
becoming PITA.

[~alangates] can you review? :) This particular patch is not super urgent.

> implement metastore API to do file footer PPD
> ---------------------------------------------
>
>                 Key: HIVE-11676
>                 URL: https://issues.apache.org/jira/browse/HIVE-11676
>             Project: Hive
>          Issue Type: Bug
>            Reporter: Sergey Shelukhin
>         Attachments: HIVE-11676.patch
>
>
> Need to pass on the expression/sarg, extract column stats from footer (at 
> write time?) and then apply one to the other. I may file a separate JIRA for 
> ORC changes cause that is usually PITA



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

Reply via email to