Chandan Kumar created SPARK-1662:
------------------------------------
Summary: PySpark fails if python class is used as a data container
Key: SPARK-1662
URL: https://issues.apache.org/jira/browse/SPARK-1662
Project: Spark
Issue Type: Bug
Components: PySpark
Affects Versions: 1.0.0
Environment: Ubuntu 14, Python 2.7.6
Reporter: Chandan Kumar
Priority: Minor
PySpark fails if RDD operations are performed on data encapsulated in Python
objects (rare use case where plain python objects are used as data containers
instead of regular dict or tuples).
I have written a small piece of code to reproduce the bug:
https://gist.github.com/nrchandan/11394440
<script src="https://gist.github.com/nrchandan/11394440.js"></script>
--
This message was sent by Atlassian JIRA
(v6.2#6252)