Re: groupBy RDD does not have grouping column ?

2014-03-31 Thread Michael Armbrust
This is similar to how SQL works, items in the GROUP BY clause are not
included in the output by default.  You will need to include 'a in the
second parameter list (which is similar to the SELECT clause) as well if
you want it included in the output.


On Sun, Mar 30, 2014 at 9:52 PM, Manoj Samel manojsamelt...@gmail.comwrote:

 Hi,

 If I create a groupBy('a)(Sum('b) as 'foo, Sum('c) as 'bar), then the
 resulting RDD should have 'a, 'foo and 'bar.

 The result RDD just shows 'foo and 'bar and is missing 'a

 Thoughts?

 Thanks,

 Manoj



groupBy RDD does not have grouping column ?

2014-03-30 Thread Manoj Samel
Hi,

If I create a groupBy('a)(Sum('b) as 'foo, Sum('c) as 'bar), then the
resulting RDD should have 'a, 'foo and 'bar.

The result RDD just shows 'foo and 'bar and is missing 'a

Thoughts?

Thanks,

Manoj