160 MB list.

~~~~~~~~~~~~~~~~~~~~~
Daniel Clark, President
DAC Systems, Inc.
(703) 403-0340
~~~~~~~~~~~~~~~~~~~~~


-----Original Message-----
From: Ian Holsman [mailto:[EMAIL PROTECTED] 
Sent: Tuesday, October 02, 2007 7:51 PM
To: [email protected]
Subject: Re: Nutch Timeout

what do you consider 'huge' ?


Daniel Clark wrote:
> I'm trying to index a huge URL list using nutch with hadoop on a two
machine
> cluster.  It appears to be timing out at the injector.  I get the
following
> error.  I run a smaller URL list with no problems.  Any help would be
> appreciated.
>
>  
>
> Injector: starting
>
> Injector: crawlDb: crawl7/crawldb
>
> Injector: urlDir: urls/big_list.txt
>
> Injector: Converting injected urls to crawl db entries.
>
> task_0009_m_000001_0: log4j:ERROR setFile(null,true) call failed.
>
> task_0009_m_000001_0: java.io.FileNotFoundException:
> /home/d/daclark/nutch/search/nutch-0.9/logs (Is a directory)
>
> task_0009_m_000001_0:   at java.io.FileOutputStream.openAppend(Native
> Method)
>
> task_0009_m_000001_0:   at
> java.io.FileOutputStream.<init>(FileOutputStream.java:177)
>
> task_0009_m_000001_0:   at
> java.io.FileOutputStream.<init>(FileOutputStream.java:102)
>
> task_0009_m_000001_0:   at
> org.apache.log4j.FileAppender.setFile(FileAppender.java:289)
>
> task_0009_m_000001_0:   at
> org.apache.log4j.FileAppender.activateOptions(FileAppender.java:163)
>
> task_0009_m_000001_0:   at
>
org.apache.log4j.DailyRollingFileAppender.activateOptions(DailyRollingFileAp
> pender.java:215)
>
> task_0009_m_000001_0:   at
> org.apache.log4j.config.PropertySetter.activate(PropertySetter.java:256)
>
> task_0009_m_000001_0:   at
>
org.apache.log4j.config.PropertySetter.setProperties(PropertySetter.java:132
> )
>
> task_0009_m_000001_0:   at
>
org.apache.log4j.config.PropertySetter.setProperties(PropertySetter.java:96)
>
> task_0009_m_000001_0:   at
>
org.apache.log4j.PropertyConfigurator.parseAppender(PropertyConfigurator.jav
> a:654)
>
> task_0009_m_000001_0:   at
>
org.apache.log4j.PropertyConfigurator.parseCategory(PropertyConfigurator.jav
> a:612)
>
>  
>
>  
>
> ~~~~~~~~~~~~~~~~~~~~~
>
> Daniel Clark, President
>
> DAC Systems, Inc.
>
> (703) 403-0340
>
> ~~~~~~~~~~~~~~~~~~~~~
>
>  
>
>
>   


Reply via email to