[jira] [Commented] (HIVE-10596) Further optimize metadata only query

Ashutosh Chauhan (JIRA) Wed, 30 Sep 2015 15:38:57 -0700

    [ 
https://issues.apache.org/jira/browse/HIVE-10596?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14938987#comment-14938987
 ]


Ashutosh Chauhan commented on HIVE-10596:
-----------------------------------------

We can go farther than that by not launching job at all. Since query result can 
entirely be computed without launching a job, once we obtain result set we can 
attach that to 1) TS operator for next job or 2) to fetchtask if there are no 
other jobs. 
StatsOptimizer already does 2)

> Further optimize metadata only query 
> -------------------------------------
>
>                 Key: HIVE-10596
>                 URL: https://issues.apache.org/jira/browse/HIVE-10596
>             Project: Hive
>          Issue Type: Improvement
>          Components: Query Planning
>            Reporter: Selina Zhang
>            Assignee: Selina Zhang
>
> Currently with NullScan optimizer we skip the full table scan for metadata 
> only query. However, further step can be taken: writing metadata info to a 
> HDFS temp file and replace the original table info with the temp table info. 
> In this way we will not calculate the mapper number according the original 
> files. Users like to run metadata only query for facts table; calculates 
> splits and allocates resources for this type of query seems unnecessary.  
> We had this patch and it has been running for a while. Will upload it soon.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

[jira] [Commented] (HIVE-10596) Further optimize metadata only query

Reply via email to