Re: Spark 2.0 Dataset Documentation

2016-06-18 Thread Pedro Rodriguez
Going to go ahead and starting working on the docs assuming this gets merged https://github.com/apache/spark/pull/13592. Opened a JIRA https://issues.apache.org/jira/browse/SPARK-16046 Having some issues building docs. The Java docs fail to build. Output when it fails is here:

Re: Spark 2.0 Dataset Documentation

2016-06-18 Thread Jacek Laskowski
On Sat, Jun 18, 2016 at 6:13 AM, Pedro Rodriguez wrote: > using Datasets (eg using $ to select columns). Or even my favourite one - the tick ` :-) Jacek - To unsubscribe, e-mail:

Re: Spark 2.0 Dataset Documentation

2016-06-18 Thread Reynold Xin
Please go for it! On Friday, June 17, 2016, Pedro Rodriguez wrote: > I would be open to working on Dataset documentation if no one else isn't > already working on it. Thoughts? > > On Fri, Jun 17, 2016 at 11:44 PM, Cheng Lian

Re: Spark 2.0 Dataset Documentation

2016-06-17 Thread Pedro Rodriguez
I would be open to working on Dataset documentation if no one else isn't already working on it. Thoughts? On Fri, Jun 17, 2016 at 11:44 PM, Cheng Lian wrote: > As mentioned in the PR description, this is just an initial PR to bring > existing contents up to date, so that

Re: Spark 2.0 Dataset Documentation

2016-06-17 Thread Cheng Lian
As mentioned in the PR description, this is just an initial PR to bring existing contents up to date, so that people can add more contents incrementally. We should definitely cover more about Dataset. Cheng On 6/17/16 10:28 PM, Pedro Rodriguez wrote: The updates look great! Looks like

Re: Spark 2.0 Dataset Documentation

2016-06-17 Thread Pedro Rodriguez
The updates look great! Looks like many places are updated to the new APIs, but there still isn't a section for working with Datasets (most of the docs work with Dataframes). Are you planning on adding more? I am thinking something that would address common questions like the one I posted on the

Re: Spark 2.0 Dataset Documentation

2016-06-17 Thread Cheng Lian
Hey Pedro, SQL programming guide is being updated. Here's the PR, but not merged yet: https://github.com/apache/spark/pull/13592 Cheng On 6/17/16 9:13 PM, Pedro Rodriguez wrote: Hi All, At my workplace we are starting to use Datasets in 1.6.1 and even more with Spark 2.0 in place of

Spark 2.0 Dataset Documentation

2016-06-17 Thread Pedro Rodriguez
Hi All, At my workplace we are starting to use Datasets in 1.6.1 and even more with Spark 2.0 in place of Dataframes. I looked at the 1.6.1 documentation then the 2.0 documentation and it looks like not much time has been spent writing a Dataset guide/tutorial. Preview Docs: