The only difference from the previous configuration was that I enabled the "js" parser. However, one crash happened at a pdf file and I don't know about the other one. Unfortunately, the urls were not saved at the time.
-----Original Message----- From: Byron Miller [mailto:[EMAIL PROTECTED] Sent: Friday, August 19, 2005 5:21 AM To: [email protected] Subject: RE: Nutch 0.7 released Could be the language identifier process and such. There is a lot more going on in .7 that .6. What do you have enabled in your plugins & parsers? -----Original Message----- From: "EM" <[EMAIL PROTECTED]> To: <[email protected]> Date: Fri, 19 Aug 2005 04:28:34 -0400 Subject: RE: Nutch 0.7 released > I installed 0.7 and started spidering sites. > > However, some fetching processes would block for a really long time > (hours), > this wasn't the case with 0.6 on the same set of sites. > > I've saved a couple ctrl-breaks so far (two stalled processes in 3 > hours, > this is a record). Would somebody be interested looking into them and > any > new ones? > > > Regards, > EM > > > > > -----Original Message----- > From: Piotr Kosiorowski [mailto:[EMAIL PROTECTED] > Sent: Wednesday, August 17, 2005 8:14 AM > To: [email protected] > Subject: Nutch 0.7 released > > Hi, > New Nutch release was prepared today. This is the first Nutch release > as > an Apache Lucene sub-project. You can download it from > http://lucene.apache.org/nutch/release/nutch-0.7.tar.gz. > > There was a package name change from net.nutch.* to org.apache.nutch.* > for this release, so local modifications and configuration files > containing class names may require an update. > > Release numbers were created in JIRA too, so please use them while > reporting a bug. > > Regards, > Piotr >
