Spark 2.0: Unify DataFrames and Datasets question

Arun Patel Mon, 13 Jun 2016 04:02:17 -0700

In Spark 2.0, DataFrames and Datasets are unified. DataFrame is simply an
alias for a Dataset of type row.   I have few questions.


1) What does this really mean to an Application developer?
2) Why this unification was needed in Spark 2.0?
3) What changes can be observed in Spark 2.0 vs Spark 1.6?
4) Compile time safety will be there for DataFrames too?
5) Python API is supported for Datasets in 2.0?

Thanks
Arun

Spark 2.0: Unify DataFrames and Datasets question

Reply via email to