Thanks!

From: Herman van Hövell tot Westerflier [mailto:hvanhov...@questtec.nl]
Sent: Fri, Jun 03, 2016 10:05
To: Gerhard Fiedler <gfied...@algebraixdata.com>
Cc: dev@spark.apache.org
Subject: Re: Where is DataFrame.scala in 2.0?

Hi Gerhard,

DataFrame and DataSet have been merged in Spark 2.0. A DataFrame is now a 
DataSet that contains Row objects. We still maintain a type alias for 
DataFrame: 
https://github.com/apache/spark/blob/master/sql/core/src/main/scala/org/apache/spark/sql/package.scala#L45

HTH

Kind regards,

Herman van Hövell tot Westerflier

2016-06-03 17:01 GMT+02:00 Gerhard Fiedler 
<gfied...@algebraixdata.com<mailto:gfied...@algebraixdata.com>>:
When I look at the sources in Github, I see DataFrame.scala at 
https://github.com/apache/spark/blob/branch-1.6/sql/core/src/main/scala/org/apache/spark/sql/DataFrame.scala
 in the 1.6 branch. But when I change the branch to branch-2.0 or master, I get 
a 404 error. I also can’t find the file in the directory listings, for example 
https://github.com/apache/spark/tree/branch-2.0/sql/core/src/main/scala/org/apache/spark/sql
 (for branch-2.0).

It seems that quite a few APIs use the DataFrame class, even in 2.0. Can 
someone please point me to its location, or otherwise explain why it is not 
there?

Thanks,
Gerhard


Reply via email to