We have once combined the two filters to one.

Have a look at: http://nutch.eventax.com/
PrefixB4URLFilter

This filter was developed for the old nutch version.
So you might have to modify it to the new apache code.

Matthias


Zhou LiBing schrieb:
> nutch0.7dev accepts only one filter??
> if I should use one more filter ,what should I do?
> thank you !
> 
> 
>  On 5/11/05, Matthias Jaekle <[EMAIL PROTECTED]> wrote: 
> 
>>>+\.(pdf|rtf|xls|doc|txt|htm|html)$
>>
>>This line accepts any url with one of this endings.
>>No further testing is done. So you should remove this line.
>>
>>
>>>+^http://([a-z0-9]*\.)*linux62.org/ <http://linux62.org/>
>>
>>Try :
>>+^http\:\/\/([a-z0-9]*\.)*linux62.org\/
>>
>>Matthias
>>
>>--
>>http://www.eventax.com - eventax GmbH
>>http://www.umkreisfinder.de - Die Suchmaschine für Lokales und Events
>>
>>-------------------------------------------------------
>>This SF.Net <http://SF.Net> email is sponsored by Oracle Space Sweepstakes
>>Want to be the first software developer in space?
>>Enter now for the Oracle Space Sweepstakes!
>>http://ads.osdn.com/?ad_id=7393&alloc_id=16281&op=click
>>_______________________________________________
>>Nutch-developers mailing list
>>[email protected]
>>https://lists.sourceforge.net/lists/listinfo/nutch-developers
>>
> 
> 
> 
> 

-- 
http://www.eventax.com - eventax GmbH
http://www.umkreisfinder.de - Die Suchmaschine für Lokales und Events

Reply via email to