Hi,Olive.
It is more stable. I spared one week on learning 0.8's conception. But, unfortunately rolled back to 0.7.1 version. The only thing I needed in 0.8 is SWF Parser. > Thank you so much for your reply! > I just sent another message - because I am having other issues with 0.8 and > somehow the > TOTAL urls is always 1 when I search big sites such as www.yahoo.com. I > thought 0.7.1 might > be more stable? > THe stats: > 060308 064418 Client connection to 9.2.13.8:8010 : starting > 060308 064418 Client connection to 9.2.13.8:8009: starting > 060308 064418 parsing file:/root/nutch/conf/nutch-default.xml > 060308 064418 parsing file:/root/nutch/conf/nutch- site.xml > 060308 064419 Running job: job_ljydgp > 060308 064420 map 0% > 060308 064427 map 100% > 060308 064433 reduce 100% > 060308 064433 Job complete: job_ljydgp > 060308 064434 parsing file:/root/nutch/conf/nutch- default.xml > 060308 064434 parsing file:/root/nutch/conf/nutch-site.xml > 060308 064436 Statistics for CrawlDb: > /user/root/crawl-20060307224144/crawldb > 060308 064436 TOTAL urls: 1 > 060308 064436 avg score: 1.0 > 060308 064436 max score: 1.0 > 060308 064436 min score: 1.0 > 060308 064436 retry 0: 1 > 060308 064436 status 2 (DB_fetched): 1 > 060308 064437 CrawlDb statistics: done >>From: Stefan Groschupf <[EMAIL PROTECTED]> >>Reply-To: [email protected] >>To: [email protected] >>Subject: Re: help - distributed crawl in 0.7.1 >>Date: Wed, 8 Mar 2006 17:51:11 +0100 >>MIME-Version: 1.0 (Apple Message framework v746.2) >>Received: from mail.apache.org ([209.237.227.199]) by >>bay0-mc7-f18.bay0.hotmail.com with Microsoft SMTPSVC(6.0.3790.211); Wed, 8 >>Mar 2006 08:51:36 -0800 >>Received: (qmail 65663 invoked by uid 500); 8 Mar 2006 16:51:35 -0000 >>Received: (qmail 65652 invoked by uid 99); 8 Mar 2006 16:51:35 -0000 >>Received: from asf.osuosl.org (HELO asf.osuosl.org) (140.211.166.49) by >>apache.org (qpsmtpd/0.29) with ESMTP; Wed, 08 Mar 2006 08:51:35 -0800 >>Received: pass (asf.osuosl.org: local policy) >>Received: from [212.122.60.61] (HELO mslinux.media-style.com) >>(212.122.60.61) by apache.org (qpsmtpd/0.29) with ESMTP; Wed, 08 Mar >>2006 08:51:32 -0800 >>Received: from localhost (localhost [127.0.0.1])by mslinux.media-style.com >>(Postfix) with ESMTP id 21540144450for >><[email protected]>; Wed, >> 8 Mar 2006 17:43:21 +0100 (CET) >>Received: from mslinux.media-style.com ([127.0.0.1])by localhost >>(mslinux.media-style.com [127.0.0.1]) (amavisd-new, port 10024)with ESMTP >>id 18258-01 for <[email protected]>;Wed, 8 Mar 2006 17:43:20 >>+0100 (CET) >>Received: from [192.168.200.39] (unknown [212.122.60.61])by >>mslinux.media-style.com (Postfix) with ESMTP id D81A1144417for >><[email protected]>; Wed, 8 Mar 2006 17:43:20 +0100 (CET) >>X-Message-Info: JGTYoYF78jEHjJx36Oi8+Z3TmmkSEdPtfpLB7P/ybN8= >>Mailing-List: contact [EMAIL PROTECTED]; run by ezmlm >>Precedence: bulk >>List-Help: <mailto:[EMAIL PROTECTED]> >>List-Unsubscribe: <mailto:[EMAIL PROTECTED]> >>List-Post: <mailto:[email protected]> >>List-Id: <nutch-user.lucene.apache.org> >>Delivered-To: mailing list [email protected] >>X-ASF-Spam-Status: No, hits=0.0 required=10.0tests=HTML_MESSAGE >>X-Spam-Check-By: apache.org >>References: <[EMAIL PROTECTED]> >>X-Mailer: Apple Mail (2.746.2) >>X-Virus-Scanned: by amavisd-new-20030616-p10 (Debian) at media-style.com >>X-Virus-Checked: Checked by ClamAV on apache.org >>Return-Path: >>[EMAIL PROTECTED] >>X-OriginalArrivalTime: 08 Mar 2006 16:51:36.0503 (UTC) >>FILETIME=[901C1C70:01C642D0] >> >>Better you use nutch .8 to run a crawl using several machines. >>There is some documentation in the wiki now. >> >>Am 08.03.2006 um 17:49 schrieb Olive g: >> >>>Hi I am new here. >>>Could someone please let me know the step-by-step instructions to set up >>>distributed crawl in 0.7.1? >>>Thank you. >>> >>>_________________________________________________________________ >>>Is your PC infected? Get a FREE online computer virus scan from McAfee® >>>Security. http://clinic.mcafee.com/clinic/ibuy/campaign.asp? cid=3963 >>> >>> >> >>--------------------------------------------------------------- >>company: http://www.media-style.com >>forum: http://www.text-mining.org >>blog: http://www.find23.net >> >> > _________________________________________________________________ > On the road to retirement? Check out MSN Life Events for advice on how to > get there! http://lifeevents.msn.com/category.aspx?cid=Retirement > __________ NOD32 1.1434 (20060308) Information __________ > This message was checked by NOD32 antivirus system. > http://www.eset.com -- Regards, Dima mailto:[EMAIL PROTECTED] ------------------------------------------------------- This SF.Net email is sponsored by xPML, a groundbreaking scripting language that extends applications into web and mobile media. Attend the live webcast and join the prime developer group breaking into this new coding territory! http://sel.as-us.falkag.net/sel?cmd=lnk&kid0944&bid$1720&dat1642 _______________________________________________ Nutch-general mailing list [email protected] https://lists.sourceforge.net/lists/listinfo/nutch-general
