Xiangrui Meng created SPARK-2944:
------------------------------------
Summary: sc.makeRDD doesn't distribute partitions evenly
Key: SPARK-2944
URL: https://issues.apache.org/jira/browse/SPARK-2944
Project: Spark
Issue Type: Bug
Components: Spark Core
Affects Versions: 1.1.0
Reporter: Xiangrui Meng
Assignee: Xiangrui Meng
Priority: Critical
16 nodes EC2 cluster:
{code}
val rdd = sc.makeRDD(0 until 1e9, 1000).cache()
rdd.count()
{code}
Saw 156 partitions on one node while only 8 partitions on another.
--
This message was sent by Atlassian JIRA
(v6.2#6252)
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]