The only difference from the previous configuration was that I enabled the
"js" parser. However, one crash happened at a pdf file and I don't know
about the other one. Unfortunately, the urls were not saved at the time.

-----Original Message-----
From: Byron Miller [mailto:[EMAIL PROTECTED] 
Sent: Friday, August 19, 2005 5:21 AM
To: [email protected]
Subject: RE: Nutch 0.7 released

Could be the language identifier process and such. There is a lot more
going on in .7 that .6.

What do you have enabled in your plugins & parsers?

-----Original Message-----
From: "EM" <[EMAIL PROTECTED]>
To: <[email protected]>
Date: Fri, 19 Aug 2005 04:28:34 -0400
Subject: RE: Nutch 0.7 released

> I installed 0.7 and started spidering sites.
> 
> However, some fetching processes would block for a really long time
> (hours),
> this wasn't the case with 0.6 on the same set of sites. 
> 
> I've saved a couple ctrl-breaks so far (two stalled processes in 3
> hours,
> this is a record). Would somebody be interested looking into them and
> any
> new ones?
> 
> 
> Regards,
> EM
> 
> 
> 
> 
> -----Original Message-----
> From: Piotr Kosiorowski [mailto:[EMAIL PROTECTED] 
> Sent: Wednesday, August 17, 2005 8:14 AM
> To: [email protected]
> Subject: Nutch 0.7 released
> 
> Hi,
> New Nutch release was prepared today. This is the first Nutch release
> as 
> an Apache Lucene sub-project. You can download it from 
> http://lucene.apache.org/nutch/release/nutch-0.7.tar.gz.
> 
> There was a package name change from net.nutch.* to org.apache.nutch.* 
> for this release, so local modifications and configuration files 
> containing class names may require an update.
> 
> Release numbers were created in JIRA too, so please use them while 
> reporting a bug.
> 
> Regards,
> Piotr
> 





Reply via email to