Matei Zaharia created SPARK-2043:
------------------------------------
Summary: ExternalAppendOnlyMap doesn't always find matching keys
Key: SPARK-2043
URL: https://issues.apache.org/jira/browse/SPARK-2043
Project: Spark
Issue Type: Bug
Components: Spark Core
Affects Versions: 1.0.0, 0.9.1, 0.9.0
Reporter: Matei Zaharia
Assignee: Matei Zaharia
Priority: Blocker
The current implementation reads one key with the next hash code as it finishes
reading the keys with the current hash code, which may cause it to miss some
matches of the next key. This can cause operations like join to give the wrong
result when reduce tasks spill to disk and there are hash collisions, as values
won't be matched together.
--
This message was sent by Atlassian JIRA
(v6.2#6252)