Mridul Muralidharan created SPARK-6170:
------------------------------------------

             Summary: Add support for skew join 
                 Key: SPARK-6170
                 URL: https://issues.apache.org/jira/browse/SPARK-6170
             Project: Spark
          Issue Type: Improvement
          Components: Spark Core
            Reporter: Mridul Muralidharan
            Priority: Minor



Do an efficient skew join - current spark join does fairly horribly when there 
are bad skews in the data. There is sufficient literature on how to solve this 
(including impl in pig and hive which have tackled this IIRC).

I wont be able to work/contribute on this unfortunately, would be great if 
someone else can pick this up - wanted to ensure this task is not missed out.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

---------------------------------------------------------------------
To unsubscribe, e-mail: issues-unsubscr...@spark.apache.org
For additional commands, e-mail: issues-h...@spark.apache.org

Reply via email to