[
https://issues.apache.org/jira/browse/HIVE-15065?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15607127#comment-15607127
]
Prasanth Jayachandran commented on HIVE-15065:
----------------------------------------------
[~ashutoshc] can you please review?
> SimpleFetchOptimizer should decide based on metastore stats when available
> --------------------------------------------------------------------------
>
> Key: HIVE-15065
> URL: https://issues.apache.org/jira/browse/HIVE-15065
> Project: Hive
> Issue Type: Bug
> Components: Logical Optimizer
> Affects Versions: 2.2.0
> Reporter: Prasanth Jayachandran
> Assignee: Prasanth Jayachandran
> Attachments: HIVE-15065.1.patch
>
>
> Currently the decision to use fetch optimizer or not is based on scanning the
> filesystem for file lengths and see if the aggregated size is less the fetch
> task threshold. This can be very expensive for cloud environment. This issue
> is mitigated to some extent by HIVE-14920 but still that requires file system
> scan. We can make decision based on the stats from metastore and falling back
> when stats is not available. Since fast stats (numRows and fileSize) is
> always available this should work most of the time.
--
This message was sent by Atlassian JIRA
(v6.3.4#6332)