[
https://issues.apache.org/jira/browse/HIVE-11705?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Prasanth Jayachandran updated HIVE-11705:
-----------------------------------------
Affects Version/s: 2.0.0
> refactor SARG stripe filtering for ORC into a separate method
> -------------------------------------------------------------
>
> Key: HIVE-11705
> URL: https://issues.apache.org/jira/browse/HIVE-11705
> Project: Hive
> Issue Type: Bug
> Affects Versions: 2.0.0
> Reporter: Sergey Shelukhin
> Assignee: Sergey Shelukhin
> Fix For: 2.0.0
>
> Attachments: HIVE-11705.01.patch, HIVE-11705.02.patch,
> HIVE-11705.03.patch, HIVE-11705.patch
>
>
> For footer cache PPD to metastore, we'd need a method to do the PPD. Tiny
> item to create it on OrcInputFormat.
> For metastore path, these methods will be called from expression proxy
> similar to current objectstore expr filtering; it will change to have
> serialized sarg and column list to come from request instead of conf;
> includedCols/etc. will also come from request instead of assorted java
> objects.
> -The types and stripe stats will need to be extracted from HBase. This is a
> little bit of a problem, since ideally we want to be inside HBase
> filter/coprocessor/.... I'd need to take a look to see if this is possible...
> since that filter would need to either deserialize orc, or we would need to
> store types and stats information in some other, non-ORC manner on write. The
> latter is probably a better idea, although it's dangerous because there's no
> sync between this code and ORC itself.-
> Meanwhile minimize dependencies for stripe picking to essentials (and conf
> which is easy to remove).
--
This message was sent by Atlassian JIRA
(v6.3.4#6332)