Re: Hi / Aggregation support

viktor.rosenfeld Tue, 04 Nov 2014 06:13:53 -0800

Hi Fabian,


Fabian Hueske wrote
> DataSet&lt;Tuple2&lt;String, Integer&gt;> ds = ...
> DataSet&lt;Tuple4&lt;String,Integer, Integer, Long&gt; result =
> ds.groupBy(0).key(0).andMin(1).andMax(1).andCnt();
> 
> The second example explicitly extracts the key
> from the original input data.

Wouldn't it make sense to use the call to groupBy() to also extract the key
fields? So in your example, the call to key(0) is redundant. If there are
multiple fields specified in groupBy() then all of them would be used as the
key. If the user only wants a specific key, he can specify them by
explicitly calling the key() method. Specifying a field in key() that is not
used in groupBy() would be an error. This is close to (proper) SQL
semantics.

What do you think?

I'm not a big fan of how MySQL let's you specify attributes that are not
grouped or averaged and returns a random element for them. (I think that's a
bug in MySQL, although there's probably a reason for the behavior.)

Best,
Viktor



--
View this message in context: 
http://apache-flink-incubator-mailing-list-archive.1008284.n3.nabble.com/Hi-Aggregation-support-tp2311p2359.html
Sent from the Apache Flink (Incubator) Mailing List archive. mailing list 
archive at Nabble.com.

Re: Hi / Aggregation support

Reply via email to