RE: fetch depth

Arkadi.Kosmynin Mon, 19 Apr 2010 15:00:15 -0700

Hi Fernando

Crawling is done in iterations. At each iteration next portion of URLs selected 
for fetching are fetched. It is normal that only your seed URLs are fetched at 
the first iteration. See example of a crawling script here:


http://wiki.apache.org/nutch/Crawl

Regards,

Arkadi

> -----Original Message-----
> From: Fernando Navarro [mailto:f.navarrolope...@gmail.com]
> Sent: Monday, 19 April 2010 6:38 PM
> To: nutch-user@lucene.apache.org
> Subject: fetch depth
> 
> Hello,
> 
> when i try to create a new segment and fecth it, only the frontpage  is
> fetched. Everything else are un_fetched.
> However, if i execute a bin/nutch crawl everything runs ok. I don´t
> know how
> to set the depth value in a segment in a inject > generate>
> fetch>updatedb>invertlinks>index process.
> 
> Thank you for your help,
> 
> Fernando

RE: fetch depth

Reply via email to