I set default_parallel=15 but when I did a
y = group z ALL; x = foreach y generate SIZE(z); the 2 lines generate a MR job with only 1 reducer. I guess it's because SIZE() needs to count all the groups. but don't we have the sort of cumulative/additive UDFs ? it would be faster if we could parallelize SIZE() thanks Yang
