Agree that toDF is not very useful. In fact it was removed from the namespace in a recent change https://github.com/apache/spark/commit/4e930420c19ae7773b138dfc7db8fc03b4660251
Thanks Shivaram On Fri, May 8, 2015 at 1:10 AM, Sun, Rui <rui....@intel.com> wrote: > toDF() is defined to convert an RDD to a DataFrame. But it is just a very > thin wrapper of createDataFrame() by help the caller avoid input of > SQLContext. > > Since Scala/pySpark does not have toDF(), and we'd better keep API as > narrow and simple as possible. Is toDF() really necessary? Could we > eliminate it? > > >