[ 
https://issues.apache.org/jira/browse/HIVE-8208?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Xuefu Zhang resolved HIVE-8208.
-------------------------------
       Resolution: Won't Fix
    Fix Version/s: spark-branch

With HIVE-8118, this is no longer needed.

> Multi-table insertion optimization #1: don't always break operator tree. 
> [Spark Branch]
> ---------------------------------------------------------------------------------------
>
>                 Key: HIVE-8208
>                 URL: https://issues.apache.org/jira/browse/HIVE-8208
>             Project: Hive
>          Issue Type: Improvement
>            Reporter: Chao
>             Fix For: spark-branch
>
>
> Currently, with the current patch of multi-table insertion, it will break 
> whenever there exists one TableScanOperator that can leads to multiple 
> FileSinkOperators. Then, it identifies the lowest common ancestor (LCA), and 
> breaks the tree there, creating same number of child SparkTasks as the number 
> of FileSinkOperators.
> However, in the following situation it's better not to break the operator 
> tree:
> Of all the paths from these FileSinkOperators to the LCA, if 
> ReduceSinkOperator only exist in 0 or 1 path of them.
> In this case, we can do it in one spark job, and no need to break the 
> operator tree.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

Reply via email to