DataFrame is when there is a schema associated with your RDD.. For any of your transformation on the data, you have a defined schema then it is always advised to use DataFrame as there are efficient supporting APIs for the same.. It is neatly explained in the official docs..
Thanks and regards Vinay Kashyap On Wed, Mar 23, 2016 at 7:56 AM Jeff Zhang <[email protected]> wrote: > Please check the offical doc > > http://spark.apache.org/docs/latest/ > > > On Wed, Mar 23, 2016 at 10:08 AM, asethia <[email protected]> wrote: > >> Hi, >> >> I am new to Spark, would like to know any guidelines when to use Data >> Frame >> vs. RDD. >> >> Thanks, >> As >> >> >> >> >> >> -- >> View this message in context: >> http://apache-spark-user-list.1001560.n3.nabble.com/DataFrame-vs-RDD-tp26570.html >> Sent from the Apache Spark User List mailing list archive at Nabble.com. >> >> --------------------------------------------------------------------- >> To unsubscribe, e-mail: [email protected] >> For additional commands, e-mail: [email protected] >> >> > > > -- > Best Regards > > Jeff Zhang >
