Chandan Kumar created SPARK-1662:
------------------------------------

             Summary: PySpark fails if python class is used as a data container
                 Key: SPARK-1662
                 URL: https://issues.apache.org/jira/browse/SPARK-1662
             Project: Spark
          Issue Type: Bug
          Components: PySpark
    Affects Versions: 1.0.0
         Environment: Ubuntu 14, Python 2.7.6
            Reporter: Chandan Kumar
            Priority: Minor


PySpark fails if RDD operations are performed on data encapsulated in Python 
objects (rare use case where plain python objects are used as data containers 
instead of regular dict or tuples).

I have written a small piece of code to reproduce the bug:
https://gist.github.com/nrchandan/11394440

<script src="https://gist.github.com/nrchandan/11394440.js";></script>





--
This message was sent by Atlassian JIRA
(v6.2#6252)

Reply via email to