I would recommend to use the domain-urlfilter, it is the most straightforward 
method of controlling the list of hosts in the crawldb.
M

 
 
-----Original message-----
> From:Shadi Saleh <[email protected]>
> Sent: Sunday 4th January 2015 16:23
> To: user <[email protected]>
> Subject: Depth option
> 
> Hello,
> 
> I want to check this point please.
> 
> I am using crawl to crawl www.example.com with depth =1 option, So if that
> website contains url to other website e.g. www.example2.com nutch will not
> crawl it , is it enogh to use depth option or should I use url filer?
> 
> 
> Best
> 
> 
> -- 
> 
> 
> 
> 
> *Shadi SalehPh.D StudentInstitute of Formal and Applied LinguisticsFaculty
> of Mathematics and Physics*
> *-Charles University in Prague*
> 
> *16017 Prague 6 - Czech Republic Mob +420773515578*
> 

Reply via email to