The cheapest way would be to check the counters as you write them in
the first place and keep a running score. :)

Sent from my mobile. Please excuse the typos.

On 2011-05-20, at 10:35 AM, "W.P. McNeill" <[email protected]> wrote:

> I've got a directory with a bunch of MapReduce data in it.  I want to know
> how many <Key, Value> pairs it contains.  I could write a mapper-only
> process that takes <Writeable, Writeable> pairs as input and updates a
> counter, but it seems like this utility should already exist.  Does it, or
> do I have to roll my own?
>
> Bonus question, is there a way to count the number of <Key, Value> pairs
> without deserializing the values?  This can be expensive for the data I'm
> working with.

Reply via email to