See the code snippet below from org.apache.nutch.crawl.Crawl. I think
parsing happens opposite to what the nutch-site.xml config file indicates.

public static void main(...) {
     ...

      if (!Fetcher.isParsing(job)) {
        parseSegment.parse(segment);    // parse it, if needed
      }

     ...
}


Kevin

Reply via email to