GitHub user bijaybisht opened a pull request:
https://github.com/apache/spark/pull/329
Added validation check for parallelizing a seq
This fixes a bug where in a Seq can be converted into a RDD with partitions
more than the number of elements it has.
Also fixes the bug in the handling of the NumericRange.
You can merge this pull request into a Git repository by running:
$ git pull https://github.com/Guavus/spark
hotfix-spark-0.9.1/parallelize_validation
Alternatively you can review and apply these changes as the patch at:
https://github.com/apache/spark/pull/329.patch
To close this pull request, make a commit to your master/trunk branch
with (at least) the following in the commit message:
This closes #329
----
commit b54236ab9e7dae60bd43f576338cd64879546db0
Author: Bijay Bisht <[email protected]>
Date: 2014-04-04T19:25:04Z
Added validation check for parallelizing a seq
----
---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at [email protected] or file a JIRA ticket
with INFRA.
---