Colin Ma created HIVE-16969:
-------------------------------
Summary: Improvement performance of MapOperator for Parquet
Key: HIVE-16969
URL: https://issues.apache.org/jira/browse/HIVE-16969
Project: Hive
Issue Type: Improvement
Affects Versions: 3.0.0
Reporter: Colin Ma
Assignee: Colin Ma
Fix For: 3.0.0
For a table with many partition files,
MapOperator.cloneConfsForNestedColPruning() will update the
hive.io.file.readNestedColumn.paths many times. The larger value of
hive.io.file.readNestedColumn.paths will cause the poor performance for
ParquetHiveSerDe.processRawPrunedPaths().
So, the unnecessary paths should be appended to
hive.io.file.readNestedColumn.paths.
--
This message was sent by Atlassian JIRA
(v6.4.14#64029)