Hi Amit, I know this thread is a bit old now, however it is also something which bugged me when I was looking into something else (InjectorJob counters).
On Tue, Mar 5, 2013 at 3:16 AM, Amit Sela <[email protected]> wrote: > > And summing all counters does not equal the total map input... > > Summing all of theses numbers does equal the total map input. I also note > that the total hit by time limit here is 50927 but the job counters show > 7493. > > Basically, the easiest way to see and generally understand counters is to run the Nutch application within your Hadoop cluster (if no cluster available then use psudo mode) and use the web application interface to Hadoop. You will clearly see all counters associated with the job and you can take it from there. I like the notion of creating custom counters to obtain specific metrics but this is solely driven by user requirements. Do you want to learn more about counters? Look into the code. Do you want to know more about Nutch counters, or make the counters more explicit? Then consider opening a Jira issue and we can discuss this in more detail. With regards to the Fetcher, there are many possible areas where counters are (and could be) really useful... as I said though this s only driven by user requirements.

