Hi Chao,

> Your approach is tricky. I agree that this kind of MR logic is pretty
> common. So it would be nice to add such feature to crunch. At the first
> glance, I think the problem in PTable#collectValues is that it return a
> PTable rather than a PGroupedTable (I haven't check the internal logic yet).
>
>
I think that PTable#collectValues is for a different kind of use case --
internally it just does a groupByKey and then puts all the values in a
single collection for each key, so I'm not sure how it would apply here. Or
did you mean the combineValues method?

- Gabriel

Reply via email to