There is an open issue
(NUTCH-817<https://issues.apache.org/jira/browse/NUTCH-817>)
that can related with your problem !!

2010/7/16 jeff-4 [via Lucene]
<[email protected]<ml-node%[email protected]>
>

> I did check. Nutch 1.0 crawled over 300 links while Nutch 1.1 only 2.
>
> On Fri, 2010-07-16 at 14:21 +0800, xiao yang wrote:
>
> > You can use “bin/nutch readdb crawl/crawldb -stats” to check the
> > number of pages they crawled.
> >
> > On Fri, Jul 16, 2010 at 2:07 PM, jeff <[hidden 
> > email]<http://user/SendEmail.jtp?type=node&node=971632&i=0>>
> wrote:
> > > Hi,
> > >
> > > I am testing nutch 1.1 with the exactly same configuration as that
> > > tested on nutch 1.0. It has taken 1.0 to crawl the bestbuy site by a
> few
> > > hours, while it only takes 2-3 minutes for 1.1. Does anyone have the
> > > similar experience and know why?
> > >
> > > Thanks.
> > >
> > >
>
>
>
>
> ------------------------------
>  View message @
> http://lucene.472066.n3.nabble.com/Nutch-1-1-crawls-fewer-links-than-1-0-tp971589p971632.html
> To unsubscribe from Nutch, click here< (link removed) >.
>
>
>

-- 
View this message in context: 
http://lucene.472066.n3.nabble.com/Nutch-1-1-crawls-fewer-links-than-1-0-tp971589p976259.html
Sent from the Nutch - User mailing list archive at Nabble.com.

Reply via email to