Thanks lewis by your answer. My doubt is why /tmp is increasing while crawl process is doing, and why nutch use that folder. Im using nutch 1.5.1 in single mode and my nutch site not have properties hadoop.tmp.dir. I need reduce the space used for that folder because I only have 40 GB for nutch machine and 50 GB for solr machine. Please some advice or explanation will be accepted. Thanks for your time.
----- Mensaje original ----- De: "Lewis John Mcgibbney" <[email protected]> Para: [email protected] Enviados: Jueves, 7 de Febrero 2013 13:06:11 Asunto: Re: Could not find any valid local directory for output/file.out Hi, https://wiki.apache.org/nutch/NutchGotchas#DiskErrorException_while_fetching On Thursday, February 7, 2013, Eyeris Rodriguez Rueda <[email protected]> wrote: > Hi all. > I have a problem when i do a crawl for few hour or days, im using nutch 1.5.1 and solr 3.6, but the crawl process fails and i dont know how to fix this problem, im intersted in make a crawl process without limit with 10 cicles or more but i have problem with space on hard disk, i have detected that /etc/tmp have 29 GB used and is not good for me, any body can help me or give some advices for configure nutch to make at least one crawl process without problems ? > > here some features of my environment > Ram 2 GB > CPU:QuadCore(but im using only 2 cores) > Hard Disk:40 GB > Threads:50 > db.fetch.interval.default=2 days > > > > this is a part of my log file when nutch fails: > > **************************************************************** > 2013-02-06 18:45:25,961 INFO fetcher.Fetcher - fetching http://bibliodoc.uci.cu/TD/TD_03349_10.pdf > 2013-02-06 18:45:25,964 INFO fetcher.Fetcher - fetching http://bibliodoc.uci.cu/TD/TD_0442_07.pdf > 2013-02-06 18:45:25,977 INFO fetcher.Fetcher - -finishing thread FetcherThread, activeThreads=49 > 2013-02-06 18:45:26,109 INFO fetcher.Fetcher - -activeThreads=49, spinWaiting=39, fetchQueues.totalSize=0 > 2013-02-06 18:45:26,180 INFO fetcher.Fetcher - -finishing thread FetcherThread, activeThreads=48 > 2013-02-06 18:45:26,331 INFO fetcher.Fetcher - -finishing thread FetcherThread, activeThreads=47 > 2013-02-06 18:45:26,331 INFO fetcher.Fetcher - -finishing thread FetcherThread, activeThreads=46 > 2013-02-06 18:45:26,331 INFO fetcher.Fetcher - -finishing thread FetcherThread, activeThreads=44 > 2013-02-06 18:45:26,331 INFO fetcher.Fetcher - -finishing thread FetcherThread, activeThreads=45 > 2013-02-06 18:45:26,331 INFO fetcher.Fetcher - -finishing thread FetcherThread, activeThreads=40 > 2013-02-06 18:45:26,331 INFO fetcher.Fetcher - -finishing thread FetcherThread, activeThreads=39 > 2013-02-06 18:45:26,332 INFO fetcher.Fetcher - -finishing thread FetcherThread, activeThreads=38 > 2013-02-06 18:45:26,332 INFO fetcher.Fetcher - -finishing thread FetcherThread, activeThreads=37 > 2013-02-06 18:45:26,332 INFO fetcher.Fetcher - -finishing thread FetcherThread, activeThreads=36 > 2013-02-06 18:45:26,332 INFO fetcher.Fetcher - -finishing thread FetcherThread, activeThreads=35 > 2013-02-06 18:45:26,332 INFO fetcher.Fetcher - -finishing thread FetcherThread, activeThreads=34 > 2013-02-06 18:45:26,333 INFO fetcher.Fetcher - -finishing thread FetcherThread, activeThreads=33 > 2013-02-06 18:45:26,333 INFO fetcher.Fetcher - -finishing thread FetcherThread, activeThreads=32 > 2013-02-06 18:45:26,333 INFO fetcher.Fetcher - -finishing thread FetcherThread, activeThreads=31 > 2013-02-06 18:45:26,333 INFO fetcher.Fetcher - -finishing thread FetcherThread, activeThreads=30 > 2013-02-06 18:45:26,333 INFO fetcher.Fetcher - -finishing thread FetcherThread, activeThreads=29 > 2013-02-06 18:45:26,333 INFO fetcher.Fetcher - -finishing thread FetcherThread, activeThreads=28 > 2013-02-06 18:45:26,334 INFO fetcher.Fetcher - -finishing thread FetcherThread, activeThreads=27 > 2013-02-06 18:45:26,334 INFO fetcher.Fetcher - -finishing thread FetcherThread, activeThreads=26 > 2013-02-06 18:45:26,334 INFO fetcher.Fetcher - -finishing thread FetcherThread, activeThreads=25 > 2013-02-06 18:45:26,334 INFO fetcher.Fetcher - -finishing thread FetcherThread, activeThreads=24 > 2013-02-06 18:45:26,334 INFO fetcher.Fetcher - -finishing thread FetcherThread, activeThreads=23 > 2013-02-06 18:45:26,335 INFO fetcher.Fetcher - -finishing thread FetcherThread, activeThreads=22 > 2013-02-06 18:45:26,335 INFO fetcher.Fetcher - -finishing thread FetcherThread, activeThreads=21 > 2013-02-06 18:45:26,335 INFO fetcher.Fetcher - -finishing thread FetcherThread, activeThreads=20 > 2013-02-06 18:45:26,335 INFO fetcher.Fetcher - -finishing thread FetcherThread, activeThreads=19 > 2013-02-06 18:45:26,335 INFO fetcher.Fetcher - -finishing thread FetcherThread, activeThreads=18 > 2013-02-06 18:45:26,331 INFO fetcher.Fetcher - -finishing thread FetcherThread, activeThreads=41 > 2013-02-06 18:45:26,336 INFO fetcher.Fetcher - -finishing thread FetcherThread, activeThreads=17 > 2013-02-06 18:45:26,336 INFO fetcher.Fetcher - -finishing thread FetcherThread, activeThreads=15 > 2013-02-06 18:45:26,336 INFO fetcher.Fetcher - -finishing thread FetcherThread, activeThreads=13 > 2013-02-06 18:45:26,336 INFO fetcher.Fetcher - -finishing thread FetcherThread, activeThreads=12 > 2013-02-06 18:45:26,331 INFO fetcher.Fetcher - -finishing thread FetcherThread, activeThreads=42 > 2013-02-06 18:45:26,331 INFO fetcher.Fetcher - -finishing thread FetcherThread, activeThreads=43 > 2013-02-06 18:45:26,336 INFO fetcher.Fetcher - -finishing thread FetcherThread, activeThreads=9 > 2013-02-06 18:45:26,336 INFO fetcher.Fetcher - -finishing thread FetcherThread, activeThreads=10 > 2013-02-06 18:45:26,336 INFO fetcher.Fetcher - -finishing thread FetcherThread, activeThreads=11 > 2013-02-06 18:45:26,336 INFO fetcher.Fetcher - -finishing thread FetcherThread, activeThreads=14 > 2013-02-06 18:45:26,336 INFO fetcher.Fetcher - -finishing thread FetcherThread, activeThreads=16 > 2013-02-06 18:45:26,404 INFO fetcher.Fetcher - -finishing thread FetcherThread, activeThreads=8 > 2013-02-06 18:45:26,630 INFO fetcher.Fetcher - -finishing thread FetcherThread, activeThreads=7 > 2013-02-06 18:45:27,069 INFO fetcher.Fetcher - -finishing thread FetcherThread, activeThreads=6 > 2013-02-06 18:45:27,110 INFO fetcher.Fetcher - -activeThreads=6, spinWaiting=0, fetchQueues.totalSize=0 > 2013-02-06 18:45:27,129 INFO fetcher.Fetcher - -finishing thread FetcherThread, activeThreads=5 > 2013-02-06 18:45:28,110 INFO fetcher.Fetcher - -activeThreads=5, spinWaiting=0, fetchQueues.totalSize=0 > 2013-02-06 18:45:28,502 INFO fetcher.Fetcher - -finishing thread FetcherThread, activeThreads=4 > 2013-02-06 18:45:29,111 INFO fetcher.Fetcher - -activeThreads=4, spinWaiting=0, fetchQueues.totalSize=0 > 2013-02-06 18:45:30,123 INFO fetcher.Fetcher - -activeThreads=4, spinWaiting=0, fetchQueues.totalSize=0 > 2013-02-06 18:45:31,127 INFO fetcher.Fetcher - -activeThreads=4, spinWaiting=0, fetchQueues.totalSize=0 > 2013-02-06 18:45:31,187 INFO fetcher.Fetcher - -finishing thread FetcherThread, activeThreads=3 > 2013-02-06 18:45:32,171 INFO fetcher.Fetcher - -activeThreads=3, spinWaiting=0, fetchQueues.totalSize=0 > 2013-02-06 18:45:32,206 INFO fetcher.Fetcher - -finishing thread FetcherThread, activeThreads=2 > 2013-02-06 18:45:33,173 INFO fetcher.Fetcher - -activeThreads=2, spinWaiting=0, fetchQueues.totalSize=0 > 2013-02-06 18:45:34,173 INFO fetcher.Fetcher - -activeThreads=2, spinWaiting=0, fetchQueues.totalSize=0 > 2013-02-06 18:45:34,205 INFO fetcher.Fetcher - -finishing thread FetcherThread, activeThreads=1 > 2013-02-06 18:45:34,457 INFO fetcher.Fetcher - -finishing thread FetcherThread, activeThreads=0 > 2013-02-06 18:45:35,174 INFO fetcher.Fetcher - -activeThreads=0, spinWaiting=0, fetchQueues.totalSize=0 > 2013-02-06 18:45:35,174 INFO fetcher.Fetcher - -activeThreads=0 > 2013-02-06 18:45:35,742 WARN mapred.LocalJobRunner - job_local_0015 > org.apache.hadoop.util.DiskChecker$DiskErrorException: Could not find any valid local directory for output/file.out > at org.apache.hadoop.fs.LocalDirAllocator$AllocatorPerContext.getLocalPathForWrite(LocalDirAllocator.java:381) > at org.apache.hadoop.fs.LocalDirAllocator.getLocalPathForWrite(LocalDirAllocator.java:146) > at org.apache.hadoop.fs.LocalDirAllocator.getLocalPathForWrite(LocalDirAllocator.java:127) > at org.apache.hadoop.mapred.MapOutputFile.getOutputFileForWrite(MapOutputFile.java:69) > at org.apache.hadoop.mapred.MapTask$MapOutputBuffer.mergeParts(MapTask.java:1640) > at org.apache.hadoop.mapred.MapTask$MapOutputBuffer.flush(MapTask.java:1323) > at org.apache.hadoop.mapred.MapTask.runOldMapper(MapTask.java:437) > at org.apache.hadoop.mapred.MapTask.run(MapTask.java:372) > at org.apache.hadoop.mapred.LocalJobRunner$Job.run(LocalJobRunner.java:212) > -- *Lewis*

