Github user silentsnooc commented on the issue: https://github.com/apache/spark/pull/8707 Hi! I am not sure if this is related but is I look for this issue everything points me here basically. I'm getting ``` f(self, obj) # Call unbound method with explicit self File "/usr/lib/python2.7/pickle.py", line 649, in save_dict self._batch_setitems(obj.iteritems()) File "/usr/lib/python2.7/pickle.py", line 681, in _batch_setitems save(v) File "/usr/lib/python2.7/pickle.py", line 286, in save f(self, obj) # Call unbound method with explicit self File "/opt/spark/python/lib/pyspark.zip/pyspark/cloudpickle.py", line 315, in save_builtin_function return self.save_function(obj) File "/opt/spark/python/lib/pyspark.zip/pyspark/cloudpickle.py", line 191, in save_function if islambda(obj) or obj.__code__.co_filename == '<stdin>' or themodule is None: AttributeError: 'builtin_function_or_method' object has no attribute '__code__' ``` When trying to create a data frame from an RDD: ```python rdd = self.sc.textFile(self.input_file_path).map(lambda line: self.process_line(line)) schema = StructType([StructField(u'Variable', StringType(), nullable=False), StructField(u'Time', TimestampType(), nullable=False), StructField(u'Value', FloatType(), nullable=False)]) return sql_context.createDataFrame(rdd, schema) ``` I am on PySpark 1.6.0 - any ideas what I'm doing wrong here?
--- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature enabled and wishes so, or if the feature is enabled but not working, please contact infrastructure at infrastruct...@apache.org or file a JIRA ticket with INFRA. --- --------------------------------------------------------------------- To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org For additional commands, e-mail: reviews-h...@spark.apache.org