[ https://issues.apache.org/jira/browse/SPARK-2043?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14019482#comment-14019482 ]
Matei Zaharia commented on SPARK-2043: -------------------------------------- https://github.com/apache/spark/pull/986 > ExternalAppendOnlyMap doesn't always find matching keys > ------------------------------------------------------- > > Key: SPARK-2043 > URL: https://issues.apache.org/jira/browse/SPARK-2043 > Project: Spark > Issue Type: Bug > Components: Spark Core > Affects Versions: 0.9.0, 0.9.1, 1.0.0 > Reporter: Matei Zaharia > Assignee: Matei Zaharia > Priority: Blocker > > The current implementation reads one key with the next hash code as it > finishes reading the keys with the current hash code, which may cause it to > miss some matches of the next key. This can cause operations like join to > give the wrong result when reduce tasks spill to disk and there are hash > collisions, as values won't be matched together. -- This message was sent by Atlassian JIRA (v6.2#6252)