Kevin Mader created SPARK-4698:
----------------------------------
Summary: Data-locality aware Partitioners
Key: SPARK-4698
URL: https://issues.apache.org/jira/browse/SPARK-4698
Project: Spark
Issue Type: Improvement
Components: Spark Core
Reporter: Kevin Mader
Priority: Minor
The current hash and range partitioner tools do not seem to respect the
existing data-locality. A 'dictionary' driven partitioner that calculated the
partitions based on the existing key locations instead of re-calculating them
would be ideal.
--
This message was sent by Atlassian JIRA
(v6.3.4#6332)
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]