Re: tmp folder problem

2012-09-20 Thread Matteo Simoncini
Any advice? Matteo 2012/9/19 Matteo Simoncini sicc...@gmail.com Hi, I'm running Nutch 1.5.1 on a Virtual Machine to crawl a big amount of url. I gave enought space to the crawl folder, the one where linkDB and crawlDB go, and to the Solr folder. It worked fine until 200.000 URL, but now

Re: tmp folder problem

2012-09-20 Thread Sebastian Nagel
Hi Matteo, have a look at the property hadoop.tmp.dir which allows you to direct the temp folder to another volume with more space on it. For local crawls: - do not share this folder for two simultaneously running Nutch jobs - you have to clean-up the temp folder, esp. after failed jobs (if

Re: tmp folder problem

2012-09-20 Thread Matteo Simoncini
Thanks, you really helped a lot. Matteo 2012/9/20 Sebastian Nagel wastl.na...@googlemail.com Hi Matteo, have a look at the property hadoop.tmp.dir which allows you to direct the temp folder to another volume with more space on it. For local crawls: - do not share this folder for two