I see you already found the filtering code. If you don't want any filters to run you can also remove all filter plugins from the config at the parsing stage.
Anyway, it seems you can use such a configuration directive but other may also like to disable normalizing at that stage for whatever reason. Can you provide a patch for trunk? The 1.4 code is already slightly outdated but it should be easy to modify. https://issues.apache.org/jira/browse/NUTCH-1242 On Wednesday 04 January 2012 23:11:31 Eddie Drapkin wrote: > Is there any way to disable the URL filter plugins in the parse step? I > want to only filter at fetch generate time, but it seems that the parse > process runs the segment through the URL filters I have setup. I can't > even find where this happens in the code, or I'd just change it myself. > Where is this behavior happening and how can I change it? > > Thanks, > Eddie -- Markus Jelsma - CTO - Openindex

