Hi,Olive.

It is more stable.
I spared one week on learning 0.8's conception.
But, unfortunately rolled back to 0.7.1 version.
The only thing I needed in 0.8 is SWF Parser.


> Thank you so much for your reply!
> I just sent another message - because I am having other issues with 0.8 and
> somehow the
> TOTAL urls is always 1 when I search big sites such as www.yahoo.com. I
> thought 0.7.1 might
> be more stable?

> THe stats:
> 060308 064418 Client connection to 9.2.13.8:8010 : starting
> 060308 064418 Client connection to 9.2.13.8:8009: starting
> 060308 064418 parsing file:/root/nutch/conf/nutch-default.xml
> 060308 064418 parsing file:/root/nutch/conf/nutch- site.xml
> 060308 064419 Running job: job_ljydgp
> 060308 064420  map 0%
> 060308 064427  map 100%
> 060308 064433  reduce 100%
> 060308 064433 Job complete: job_ljydgp
> 060308 064434 parsing file:/root/nutch/conf/nutch- default.xml
> 060308 064434 parsing file:/root/nutch/conf/nutch-site.xml
> 060308 064436 Statistics for CrawlDb: 
> /user/root/crawl-20060307224144/crawldb
> 060308 064436 TOTAL urls:       1
> 060308 064436 avg score:        1.0
> 060308 064436 max score:        1.0
> 060308 064436 min score:        1.0
> 060308 064436 retry 0:  1
> 060308 064436 status 2 (DB_fetched):    1
> 060308 064437 CrawlDb statistics: done





>>From: Stefan Groschupf <[EMAIL PROTECTED]>
>>Reply-To: [email protected]
>>To: [email protected]
>>Subject: Re: help - distributed crawl in 0.7.1
>>Date: Wed, 8 Mar 2006 17:51:11 +0100
>>MIME-Version: 1.0 (Apple Message framework v746.2)
>>Received: from mail.apache.org ([209.237.227.199]) by 
>>bay0-mc7-f18.bay0.hotmail.com with Microsoft SMTPSVC(6.0.3790.211); Wed, 8
>>Mar 2006 08:51:36 -0800
>>Received: (qmail 65663 invoked by uid 500); 8 Mar 2006 16:51:35 -0000
>>Received: (qmail 65652 invoked by uid 99); 8 Mar 2006 16:51:35 -0000
>>Received: from asf.osuosl.org (HELO asf.osuosl.org) (140.211.166.49)    by
>>apache.org (qpsmtpd/0.29) with ESMTP; Wed, 08 Mar 2006 08:51:35 -0800
>>Received: pass (asf.osuosl.org: local policy)
>>Received: from [212.122.60.61] (HELO mslinux.media-style.com) 
>>(212.122.60.61)    by apache.org (qpsmtpd/0.29) with ESMTP; Wed, 08 Mar
>>2006 08:51:32 -0800
>>Received: from localhost (localhost [127.0.0.1])by mslinux.media-style.com
>>(Postfix) with ESMTP id 21540144450for
>><[email protected]>; Wed, 
>>  8 Mar 2006 17:43:21 +0100 (CET)
>>Received: from mslinux.media-style.com ([127.0.0.1])by localhost 
>>(mslinux.media-style.com [127.0.0.1]) (amavisd-new, port 10024)with ESMTP
>>id 18258-01 for <[email protected]>;Wed, 8 Mar 2006 17:43:20
>>+0100 (CET)
>>Received: from [192.168.200.39] (unknown [212.122.60.61])by 
>>mslinux.media-style.com (Postfix) with ESMTP id D81A1144417for 
>><[email protected]>; Wed,  8 Mar 2006 17:43:20 +0100 (CET)
>>X-Message-Info: JGTYoYF78jEHjJx36Oi8+Z3TmmkSEdPtfpLB7P/ybN8=
>>Mailing-List: contact [EMAIL PROTECTED]; run by ezmlm
>>Precedence: bulk
>>List-Help: <mailto:[EMAIL PROTECTED]>
>>List-Unsubscribe: <mailto:[EMAIL PROTECTED]>
>>List-Post: <mailto:[email protected]>
>>List-Id: <nutch-user.lucene.apache.org>
>>Delivered-To: mailing list [email protected]
>>X-ASF-Spam-Status: No, hits=0.0 required=10.0tests=HTML_MESSAGE
>>X-Spam-Check-By: apache.org
>>References: <[EMAIL PROTECTED]>
>>X-Mailer: Apple Mail (2.746.2)
>>X-Virus-Scanned: by amavisd-new-20030616-p10 (Debian) at media-style.com
>>X-Virus-Checked: Checked by ClamAV on apache.org
>>Return-Path: 
>>[EMAIL PROTECTED]
>>X-OriginalArrivalTime: 08 Mar 2006 16:51:36.0503 (UTC) 
>>FILETIME=[901C1C70:01C642D0]
>>
>>Better you use nutch .8 to run a crawl using several machines.
>>There is some documentation in the wiki now.
>>
>>Am 08.03.2006 um 17:49 schrieb Olive g:
>>
>>>Hi I am new here.
>>>Could someone please let me know the step-by-step instructions to  set up
>>>distributed crawl in 0.7.1?
>>>Thank you.
>>>
>>>_________________________________________________________________
>>>Is your PC infected? Get a FREE online computer virus scan from McAfee®
>>>Security. http://clinic.mcafee.com/clinic/ibuy/campaign.asp? cid=3963
>>>
>>>
>>
>>---------------------------------------------------------------
>>company:        http://www.media-style.com
>>forum:        http://www.text-mining.org
>>blog:            http://www.find23.net
>>
>>

> _________________________________________________________________
> On the road to retirement? Check out MSN Life Events for advice on how to
> get there! http://lifeevents.msn.com/category.aspx?cid=Retirement



> __________ NOD32 1.1434 (20060308) Information __________

> This message was checked by NOD32 antivirus system.
> http://www.eset.com




-- 
Regards,
 Dima                          mailto:[EMAIL PROTECTED]



-------------------------------------------------------
This SF.Net email is sponsored by xPML, a groundbreaking scripting language
that extends applications into web and mobile media. Attend the live webcast
and join the prime developer group breaking into this new coding territory!
http://sel.as-us.falkag.net/sel?cmd=lnk&kid0944&bid$1720&dat1642
_______________________________________________
Nutch-general mailing list
[email protected]
https://lists.sourceforge.net/lists/listinfo/nutch-general

Reply via email to