You can override default settings (nutch-default.xml) in nutch-site.xml; but
it won't help with spacing; empty file is Ok.

"merge" may generate temporary files, but 50Gb against 2Gb looks extremely
strange; try to empty recycle bin for instance... check disk swap... OS may
report 50G available but you may be out of space... for instance heavy disk
swap during merge due to low RAM...



-Fuad
http://www.linkedin.com/in/liferay
http://www.tokenizer.org


-----Original Message-----
From: [email protected] [mailto:[email protected]] 
Sent: August-26-09 5:33 PM
To: [email protected]
Subject: content of hadoop-site.xml

Hello,

?I have run merge script? to merge two crawl dirs, one 1.6G another 120MB.
But my MacPro with 50G free space did not start, after merge crashed with no
space error. I have been told that OSX got corrupted. 
I looked inside my nutch-1.0/conf/hadoop-site.xml file and it is empty. Can
anyone let me know what must be put inside this file in order for merge not
to take too much space.

Thanks in advance.
Alex.


Reply via email to