If you use crawl command you need to set crawl-filters.txt for your
site (not the regex filter)
Depth of 200 is huge, becareful depth actually means the number of
fetch/parse/index cycles.

2010/1/23, Lyndon Maydwell <maydw...@gmail.com>:
> Have you set up your regex-urlfilter.txt correctly? I've been caught
> out by this before.
>
> On Sat, Jan 23, 2010 at 4:31 PM, zud <praveenmotur...@gmail.com> wrote:
>>
>> hi
>>
>>   i am running nutch crawl and i have specified the depth as 200 but in
>> the
>> console it is showing  Stopping at depth=1 - no more URLs to fetch.
>>
>> it is not crawling the website completely
>>  even i cant find the errors in log file
>>
>> please help me out
>>
>> --
>> View this message in context:
>> http://old.nabble.com/Crawl-depth-problem-tp27284306p27284306.html
>> Sent from the Nutch - User mailing list archive at Nabble.com.
>>
>>
>


-- 
-MilleBii-

Reply via email to