[
https://issues.apache.org/jira/browse/HIVE-11882?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Illya Yalovyy updated HIVE-11882:
---------------------------------
Attachment: HIVE-11882.1.patch
> Fetch optimizer should stop source files traversal once it exceeds the
> hive.fetch.task.conversion.threshold
> -----------------------------------------------------------------------------------------------------------
>
> Key: HIVE-11882
> URL: https://issues.apache.org/jira/browse/HIVE-11882
> Project: Hive
> Issue Type: Improvement
> Components: Physical Optimizer
> Affects Versions: 1.0.0
> Reporter: Illya Yalovyy
> Assignee: Illya Yalovyy
> Attachments: HIVE-11882.1.patch
>
>
> Hive 1.0's fetch optimizer tries to optimize queries of the form "select <C>
> from <T> where <F> limit <L>" to a fetch task (see the
> hive.fetch.task.conversion property). This optimization gets the lengths of
> all the files in the specified partition and does some comparison against a
> threshold value to determine whether it should use a fetch task or not (see
> the hive.fetch.task.conversion.threshold property). This process of getting
> the length of all files. One of the main problems in this optimization is the
> fetch optimizer doesn't seem to stop once it exceeds the
> hive.fetch.task.conversion.threshold. It works fine on HDFS, but could cause
> a significant performance degradation on other supported file systems.
--
This message was sent by Atlassian JIRA
(v6.3.4#6332)