I see you already found the filtering code. If you don't want any filters to 
run you can also remove all filter plugins from the config at the parsing 
stage.

Anyway, it seems you can use such a configuration directive but other may also 
like to disable normalizing at that stage for whatever reason. Can you provide 
a patch for trunk? The 1.4 code is already slightly outdated but it should be 
easy to modify.

https://issues.apache.org/jira/browse/NUTCH-1242

On Wednesday 04 January 2012 23:11:31 Eddie Drapkin wrote:
> Is there any way to disable the URL filter plugins in the parse step?  I
> want to only filter at fetch generate time, but it seems that the parse
> process runs the segment through the URL filters I have setup.  I can't
> even find where this happens in the code, or I'd just change it myself.
> Where is this behavior happening and how can I change it?
> 
> Thanks,
> Eddie

-- 
Markus Jelsma - CTO - Openindex

Reply via email to