It would be nice if there was an interface compliant way. Perhaps it becomes available in the 0.20 and beyond api's.
On Sat, Jun 13, 2009 at 3:40 PM, Rares Vernica <rvern...@gmail.com> wrote: > Hello, > > In Reduce, can I get the number of values for the current key without > iterating over them? Does Hadoop has this number? > > Or, at least the total number of pairs that will be processed by the > current Reduce instance. I am pretty sure that Hadoop already knows > this number because it sorted them. > > BTW, the iterators given to Reduce are one-time use iterators, right? > > Thanks! > Rares > -- Pro Hadoop, a book to guide you from beginner to hadoop mastery, http://www.apress.com/book/view/9781430219422 www.prohadoopbook.com a community for Hadoop Professionals