Wondering about performance and count... A = load 'test.csv' as (a1,a2,a3); B = GROUP A by a1; -- which preferred? C = FOREACH B GENERATE COUNT(A); -- or would this only send a single field through the COUNT and be more performant? C = FOREACH B GENERATE COUNT(A.a2);
- COUNT(A.field1) Corbin Hoenes
- Re: COUNT(A.field1) Dmitriy Ryaboy
- Re: COUNT(A.field1) Mridul Muralidharan
- Re: COUNT(A.field1) Dmitriy Ryaboy
- Re: COUNT(A.field1) Mridul Muralidharan
- Re: COUNT(A.field1) Mridul Muralidharan
- Re: COUNT(A.field1) Renato Marroquín Mogrovejo
- Re: COUNT(A.fiel... Thejas M Nair
- Re: COUNT(A.fiel... Mridul Muralidharan
- Re: COUNT(A.fiel... Corbin Hoenes
- Re: COUNT(A.fiel... Renato Marroquín Mogrovejo