Sebastian, Does it matter reverse order of normalize and filter calls? Currently, nutch first does normalize and then filter.
What about if we do reverse: filter and then normalize? Suppose we have very long urls, does it kill normalize? Thanks Erol Akarsu -- View this message in context: http://lucene.472066.n3.nabble.com/Parse-reduce-stage-take-forver-tp4072755p4072834.html Sent from the Nutch - User mailing list archive at Nabble.com.

