I think, as of 1.12, there is a parameter to disable the robots check, i am not 
sure. Check nutch-default, it might be there.
M.

 
 
-----Original message-----
> From:Nestor <[email protected]>
> Sent: Wednesday 5th October 2016 0:05
> To: [email protected]
> Subject: Re: crawling a subfolder
> 
> OK, Thanks for your help
> I found out that part of my problem was that there was a robots.txt that
> would not allow me to  crawl my site.
> The lessons and gotchas of learning nutch
> 
> 
> 
> --
> View this message in context: 
> http://lucene.472066.n3.nabble.com/crawling-a-subfolder-tp4299300p4299593.html
> Sent from the Nutch - User mailing list archive at Nabble.com.
> 

Reply via email to