We have once combined the two filters to one. Have a look at: http://nutch.eventax.com/ PrefixB4URLFilter
This filter was developed for the old nutch version. So you might have to modify it to the new apache code. Matthias Zhou LiBing schrieb: > nutch0.7dev accepts only one filter?? > if I should use one more filter ,what should I do? > thank you ! > > > On 5/11/05, Matthias Jaekle <[EMAIL PROTECTED]> wrote: > >>>+\.(pdf|rtf|xls|doc|txt|htm|html)$ >> >>This line accepts any url with one of this endings. >>No further testing is done. So you should remove this line. >> >> >>>+^http://([a-z0-9]*\.)*linux62.org/ <http://linux62.org/> >> >>Try : >>+^http\:\/\/([a-z0-9]*\.)*linux62.org\/ >> >>Matthias >> >>-- >>http://www.eventax.com - eventax GmbH >>http://www.umkreisfinder.de - Die Suchmaschine für Lokales und Events >> >>------------------------------------------------------- >>This SF.Net <http://SF.Net> email is sponsored by Oracle Space Sweepstakes >>Want to be the first software developer in space? >>Enter now for the Oracle Space Sweepstakes! >>http://ads.osdn.com/?ad_id=7393&alloc_id=16281&op=click >>_______________________________________________ >>Nutch-developers mailing list >>[email protected] >>https://lists.sourceforge.net/lists/listinfo/nutch-developers >> > > > > -- http://www.eventax.com - eventax GmbH http://www.umkreisfinder.de - Die Suchmaschine für Lokales und Events
