See the code snippet below from org.apache.nutch.crawl.Crawl. I think parsing happens opposite to what the nutch-site.xml config file indicates.
public static void main(...) {
...
if (!Fetcher.isParsing(job)) {
parseSegment.parse(segment); // parse it, if needed
}
...
}
Kevin
