Hey all,
I see on the MLLib website that there are plans to deprecate the RDD based API
for MLLib once the new ML API reaches feature parity with RDD based one. Are
there currently plans to reimplement all the distributed linear algebra /
matrices operations as part of this new API, or are the
The short answer is those distributed linalg parts will not go away.
In the medium term, it's much less likely that the distributed matrix
classes will be ported over to DataFrames (though the ideal would be to
have DataFrame-backed distributed matrix classes) - given the time and
effort it's take
Hi,
If I remember correctly, the TIMESTAMP type had UTC-normalized local time
semantics even before Spark 2, so I can understand that Spark considers it
to be the "established" behavior that must not be broken. Unfortunately,
this behavior does not provide interoperability with other SQL engines o
From: Georg Heiler [mailto:georg.kf.hei...@gmail.com]
Sent: Monday, May 29, 2017 2:23 PM
To: Spark Dev List
Subject: Generic datasets implicit encoder missing
Hi,
Anyone knows what is wrong with using a generic
https://stackoverflow.com/q/44247874/2587904 to construct a dataset? Ev
Last call, anything else important in-flight for 2.2?
On Thu, May 25, 2017 at 10:56 AM, Michael Allman
wrote:
> PR is here: https://github.com/apache/spark/pull/18112
>
>
> On May 25, 2017, at 10:28 AM, Michael Allman wrote:
>
> Michael,
>
> If you haven't started cutting the new RC, I'm workin