Matei Zaharia created SPARK-2043:
------------------------------------

             Summary: ExternalAppendOnlyMap doesn't always find matching keys
                 Key: SPARK-2043
                 URL: https://issues.apache.org/jira/browse/SPARK-2043
             Project: Spark
          Issue Type: Bug
          Components: Spark Core
    Affects Versions: 1.0.0, 0.9.1, 0.9.0
            Reporter: Matei Zaharia
            Assignee: Matei Zaharia
            Priority: Blocker


The current implementation reads one key with the next hash code as it finishes 
reading the keys with the current hash code, which may cause it to miss some 
matches of the next key. This can cause operations like join to give the wrong 
result when reduce tasks spill to disk and there are hash collisions, as values 
won't be matched together.



--
This message was sent by Atlassian JIRA
(v6.2#6252)

Reply via email to