Sebastian,

Does it matter reverse order of normalize and filter calls?
Currently, nutch first does normalize and then filter.

What about if we do reverse: filter and then normalize? Suppose we have very
long urls, does it kill normalize? 

Thanks

Erol Akarsu



--
View this message in context: 
http://lucene.472066.n3.nabble.com/Parse-reduce-stage-take-forver-tp4072755p4072834.html
Sent from the Nutch - User mailing list archive at Nabble.com.

Reply via email to