Hi,
I don't think its a problem of disc capacity since I am working on huge Disk
and only 10% is used,
What I decide to do is to split the seed into two part and see if I still
get this problem so one half ended succesfuly but the second had the same
problem so I continue with teh spliting,
I tart with group of 80,000 URL and now I have a group of 5000 that when I
Run them have this problem, I am continuing with this problem till I'll find
the smallest group that has this problem and let you know about the seed.
Thanks,
Rafit
From: Ken Krugler <[EMAIL PROTECTED]>
Reply-To: [email protected]
To: [email protected]
Subject: Re: Problems with MapRed-
Date: Sun, 29 Jan 2006 16:42:15 -0800
This looks like the namenode has lost connection to one of the datanodes.
The default number of replications in ndfs is 3 and it seems like the
namenode has only 2 in its list so it logs this warning. As Stefan
suggested, you should check the diskspace on your machines. If I recall
correctly datanodes crash when they run out of diskspace.
This could also explain your problem with the fetching. One datanode runs
out of diskspace and crashes while one of the tasktrackers is writing data
to it. You should also check if the partition with /tmp has enough free
space.
Yes, that also can happen.
Especially if, like us, you accidentally configure Nutch to use a directory
on the root volume, but your servers have been configured with a separate
filesystem for /data, and that's where all the disk capacity is located.
-- Ken
Stefan Groschupf schrieb:
may the hdds are full?
try:
bin/nutch ndfs -report
Nutch generates some temporarily data until processing.
Am 30.01.2006 um 00:54 schrieb Mike Smith:
I forgot to mention the namenode log file gives me thousands of these:
060129 155553 Zero targets found,
forbidden1.size=2allowSameHostTargets=false
forbidden2.size()=0
060129 155553 Zero targets found,
forbidden1.size=2allowSameHostTargets=false
forbidden2.size()=0
--
Ken Krugler
Krugle, Inc.
+1 530-470-9200
_________________________________________________________________
Express yourself instantly with MSN Messenger! Download today it's FREE!
http://messenger.msn.click-url.com/go/onm00200471ave/direct/01/
-------------------------------------------------------
This SF.net email is sponsored by: Splunk Inc. Do you grep through log files
for problems? Stop! Download the new AJAX search engine that makes
searching your log files as easy as surfing the web. DOWNLOAD SPLUNK!
http://sel.as-us.falkag.net/sel?cmd=lnk&kid=103432&bid=230486&dat=121642
_______________________________________________
Nutch-general mailing list
[email protected]
https://lists.sourceforge.net/lists/listinfo/nutch-general