Michael,

Thanks for the reply.  I guess what I'm really asking for is how do I
crawl more than just the home page of a site?  Looking at
nutch-default.xml there is a property named db.ignore.internal.links, so
do I just say false here and get more in depth searching?

Thanks for an advice.
Paul.

-----Original Message-----
From: Michael Nebel [mailto:[EMAIL PROTECTED] 
Sent: 14 September 2005 10:05
To: [email protected]
Subject: Re: Whole web search depth

Hi Paul,

just call the "generate - fetch - updatedb" loop as often as you want.
:-)

Perhaps the parameter "depth" is the wrong name and causes the 
confusion. Depth does not mean, that the crawler follows one link to a 
depth of x and then takes the next link. Depth does mean the number of 
times, the loop "generate - fetch - updatedb" is done. Just take a look 
at output of the crawl.  The result of calling the loop is (should be) 
the same as if you follow one link to the depth of x!

Regards

        Michael

Paul Williams wrote:

> Hi,
> 
>  
> 
> I'm fairly new to using Nutch and so this is probably a newbie
question
> (I've already looked in the mailing lists and can't see an answer).
> 
>  
> 
> I'm trying to do a web search (limited to around 10 sites at the
moment)
> but I'm unsure on how to set the depth of searching.  How is this
done?
> 
>  
> 
>  
> 
> Cheers.
> 
> 


-- 
Michael Nebel
http://www.nebel.de/
http://www.netluchs.de/





Reply via email to