[ 
https://issues.apache.org/jira/browse/HIVE-10989?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Rui Li updated HIVE-10989:
--------------------------
    Attachment: HIVE-10989.1-spark.patch

The flags were properly set in the MapWork. We just need to create the RDD 
accordingly.

> HoS can't control number of map tasks for runtime skew join [Spark Branch]
> --------------------------------------------------------------------------
>
>                 Key: HIVE-10989
>                 URL: https://issues.apache.org/jira/browse/HIVE-10989
>             Project: Hive
>          Issue Type: Sub-task
>          Components: Spark
>            Reporter: Rui Li
>            Assignee: Rui Li
>         Attachments: HIVE-10989.1-spark.patch
>
>
> Flags {{hive.skewjoin.mapjoin.map.tasks}} and 
> {{hive.skewjoin.mapjoin.min.split}} are used to control the number of map 
> tasks for the map join of runtime skew join. They work well for MR but have 
> no effect for spark.
> This makes runtime skew join less useful, i.e. we just end up with slow 
> mappers instead of reducers.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

Reply via email to