Mridul Muralidharan created SPARK-6170:
------------------------------------------
Summary: Add support for skew join
Key: SPARK-6170
URL: https://issues.apache.org/jira/browse/SPARK-6170
Project: Spark
Issue Type: Improvement
Components: Spark Core
Reporter: Mridul Muralidharan
Priority: Minor
Do an efficient skew join - current spark join does fairly horribly when there
are bad skews in the data. There is sufficient literature on how to solve this
(including impl in pig and hive which have tackled this IIRC).
I wont be able to work/contribute on this unfortunately, would be great if
someone else can pick this up - wanted to ensure this task is not missed out.
--
This message was sent by Atlassian JIRA
(v6.3.4#6332)
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]