Hi Toni.
Given there is more than one measure by (user, hour) what is the measure
you want to keep? The sum?, the mean?, the most recent measure?. For the
sum or the mean you don't need to care about the timing. And If you wan't
to have the most recent then you can include the timestamp in the redu
Hi,
Just new to Spark and in need of some help for framing the problem I
have. A problem well stated is half solved it's the saying :)
Let's say that I have a DStream[String] basically containing Json of
some measurements from IoT devices. In order to keep it simple say that
after unmars