many terms in group by

tim robertson Fri, 03 Jul 2009 14:12:29 -0700

Hi all,

I have several MapReduce jobs that are basically doing counts with
group by on tab delimited files.
Getting tired of writing the same thing over again for each report I
am thinking of trying Hive for this.


Does Hive work ok with 9 or so terms in the group by?
(e.g. it is happy concatenating the fields to make the key to emit
from the map so it can do the count is a reduce and complete in one
mapreduce job)

I'm meaning the equivalent of:
  select a,b,c,d,e,f,g,h,i,count(*) from table x group by a,b,c,d,e,f,g,h,i;

Many thanks,
Tim

many terms in group by

Reply via email to