[
https://issues.apache.org/jira/browse/HIVE-24566?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17254140#comment-17254140
]
David Mollitor commented on HIVE-24566:
---------------------------------------
[~jcamacho] [~pvary] What do you think about this?
> Add Parquet Stats Optimization
> -------------------------------
>
> Key: HIVE-24566
> URL: https://issues.apache.org/jira/browse/HIVE-24566
> Project: Hive
> Issue Type: Improvement
> Reporter: David Mollitor
> Priority: Major
>
> Parquet files store min/max/count data in foot metadata.
> When a query is submitted to a Parquet table, and stats are not available,
> Hive should launch a single multi-threaded processor that simply reads the
> meta data of each Parquet file instead of walking through every single record
> in the table.
--
This message was sent by Atlassian Jira
(v8.3.4#803005)