Prasanth Jayachandran created HIVE-15065:
--------------------------------------------

             Summary: SimpleFetchOptimizer should decide based on metastore 
stats when available
                 Key: HIVE-15065
                 URL: https://issues.apache.org/jira/browse/HIVE-15065
             Project: Hive
          Issue Type: Bug
          Components: Logical Optimizer
    Affects Versions: 2.2.0
            Reporter: Prasanth Jayachandran
            Assignee: Prasanth Jayachandran


Currently the decision to use fetch optimizer or not is based on scanning the 
filesystem for file lengths and see if the aggregated size is less the fetch 
task threshold. This can be very expensive for cloud environment. This issue is 
mitigated to some extent by HIVE-14920 but still that requires file system 
scan. We can make decision based on the stats from metastore and falling back 
when stats is not available. Since fast stats (numRows and fileSize) is always 
available this should work most of the time. 



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

Reply via email to