[ https://issues.apache.org/jira/browse/HIVE-5632?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]
Prasanth J updated HIVE-5632: ----------------------------- Attachment: HIVE-5632.2.patch.txt Added unit tests to check split elimination and q file tests to test functionality. q file tests needs orc_split_elim.orc file to be located in "data/files" directory. orc_split_elim.orc file contains 25000 rows. I din't want to create a patch with 25000 rows of text data and so attaching orc_split_elim.orc as a separate file. > Eliminate splits based on SARGs using stripe statistics in ORC > -------------------------------------------------------------- > > Key: HIVE-5632 > URL: https://issues.apache.org/jira/browse/HIVE-5632 > Project: Hive > Issue Type: Improvement > Affects Versions: 0.13.0 > Reporter: Prasanth J > Assignee: Prasanth J > Labels: orcfile > Attachments: HIVE-5632.1.patch.txt, HIVE-5632.2.patch.txt, > orc_split_elim.orc > > > HIVE-5562 provides stripe level statistics in ORC. Stripe level statistics > combined with predicate pushdown in ORC (HIVE-4246) can be used to eliminate > the stripes (thereby splits) that doesn't satisfy the predicate condition. > This can greatly reduce unnecessary reads. -- This message was sent by Atlassian JIRA (v6.1#6144)