This is something we are hoping to support in Spark 1.4. We'll post more information to JIRA when there is a design.
On Thu, Mar 26, 2015 at 11:22 PM, Jianshi Huang <jianshi.hu...@gmail.com> wrote: > Hi, > > Anyone has similar request? > > https://issues.apache.org/jira/browse/SPARK-6561 > > When we save a DataFrame into Parquet files, we also want to have it > partitioned. > > The proposed API looks like this: > > def saveAsParquet(path: String, partitionColumns: Seq[String]) > > > > -- > Jianshi Huang > > LinkedIn: jianshi > Twitter: @jshuang > Github & Blog: http://huangjs.github.com/ >