Github user Agent007 commented on the pull request:
https://github.com/apache/spark/pull/8431#issuecomment-136424430
Thanks for the feedback @brennonyork The Scala example was done in a
previous commit on this branch. When I tried to do the same thing in Python, I
got the error message: "It appears that you are attempting to broadcast an RDD
or reference an RDD from an action or transformation. RDD transformations and
actions can only be invoked by the driver, not inside of other transformations;
for example, rdd1.map(lambda x: rdd2.values.count() * x) is invalid because the
values transformation and count action cannot be performed inside of the
rdd1.map transformation. For more information, see SPARK-5063."
---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at [email protected] or file a JIRA ticket
with INFRA.
---
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]