Hi, You can't invoke any RDD actions/transformations inside another transformations. They must be invoked by the driver.
If I understand your purpose correctly, you can partition your data (i.e., `partitionBy`) when writing out to parquet files. ----- Liang-Chi Hsieh | @viirya Spark Technology Center http://www.spark.tc/ -- View this message in context: http://apache-spark-developers-list.1001551.n3.nabble.com/Null-pointer-exception-with-RDD-while-computing-a-method-creating-dataframe-tp20308p20309.html Sent from the Apache Spark Developers List mailing list archive at Nabble.com. --------------------------------------------------------------------- To unsubscribe e-mail: dev-unsubscr...@spark.apache.org