Hi all. Tejas. Im tring changing nutch to 1.5.1 and not use 1.4 anymore for images, i need a explanation about how function url filters in nutch and how avoid colisions betwen rules in regex urlfilter files.
----- Mensaje original ----- De: "Eyeris Rodriguez Rueda" <[email protected]> Para: [email protected] Enviados: Jueves, 7 de Marzo 2013 9:31:22 Asunto: Re: image crawling with nutch Thanks tejas for yor reply, last month i was asking about a similar topic and you anwer me a recomendation that i implemented in regex-urlfilter.txt as you can see, i have tried to crawl only image(+\.(gif|GIF|jpg|JPG|png|PNG|ico|ICO|bmp|BMP)$) but nutch is telling me that no url to fetch and I don“t understand why is hapenning

