Re: [VOTE] Apache Spark 2.2.0 (RC2)

2017-05-30 Thread Michael Armbrust
Last call, anything else important in-flight for 2.2? On Thu, May 25, 2017 at 10:56 AM, Michael Allman wrote: > PR is here: https://github.com/apache/spark/pull/18112 > > > On May 25, 2017, at 10:28 AM, Michael Allman wrote: > > Michael, > > If you

dev-unsubscr...@spark.apache.org

2017-05-30 Thread williamtellme123
From: Georg Heiler [mailto:georg.kf.hei...@gmail.com] Sent: Monday, May 29, 2017 2:23 PM To: Spark Dev List Subject: Generic datasets implicit encoder missing Hi, Anyone knows what is wrong with using a generic https://stackoverflow.com/q/44247874/2587904 to

Re: SQL TIMESTAMP semantics vs. SPARK-18350

2017-05-30 Thread Zoltan Ivanfi
Hi, If I remember correctly, the TIMESTAMP type had UTC-normalized local time semantics even before Spark 2, so I can understand that Spark considers it to be the "established" behavior that must not be broken. Unfortunately, this behavior does not provide interoperability with other SQL engines

Re: RDD MLLib Deprecation Question

2017-05-30 Thread Nick Pentreath
The short answer is those distributed linalg parts will not go away. In the medium term, it's much less likely that the distributed matrix classes will be ported over to DataFrames (though the ideal would be to have DataFrame-backed distributed matrix classes) - given the time and effort it's

RDD MLLib Deprecation Question

2017-05-30 Thread John Compitello
Hey all, I see on the MLLib website that there are plans to deprecate the RDD based API for MLLib once the new ML API reaches feature parity with RDD based one. Are there currently plans to reimplement all the distributed linear algebra / matrices operations as part of this new API, or are