Hi, When running in local mode, the 'mapred child opts' property is not used, since this is only for spawned map and reduce tasks on a deployed cluster.
You want to set the NUTCH_HEAPSIZE enviromental variable. (i.e. "export NUTCH_HEAPSIZE=2000") Please see the 'nutch' script for more details. It is the Nutch equivalent to Hadoop's HADOOP_HEAPSIZE. Ferdy On Mon, Feb 20, 2012 at 11:10 AM, Michael Kazekin < [email protected]> wrote: > Hi! > > I've had a > > 2012-02-18 13:11:08,347 WARN mapred.LocalJobRunner - job_local_0001 > java.lang.OutOfMemoryError: Java heap space > at org.apache.hadoop.mapred.**IFile$Reader.readNextBlock(** > IFile.java:342) > at org.apache.hadoop.mapred.**IFile$Reader.next(IFile.java:**404) > at org.apache.hadoop.mapred.**Merger$Segment.next(Merger.**java:220) > at org.apache.hadoop.mapred.**Merger$MergeQueue.** > adjustPriorityQueue(Merger.**java:330) > at org.apache.hadoop.mapred.**Merger$MergeQueue.next(Merger.**java:350) > at org.apache.hadoop.mapred.**Merger.writeFile(Merger.java:**156) > at org.apache.hadoop.mapred.**MapTask$MapOutputBuffer.** > mergeParts(MapTask.java:1535) > at org.apache.hadoop.mapred.**MapTask$MapOutputBuffer.flush(** > MapTask.java:1154) > at org.apache.hadoop.mapred.**MapTask.runOldMapper(MapTask.**java:359) > at org.apache.hadoop.mapred.**MapTask.run(MapTask.java:307) > at org.apache.hadoop.mapred.**LocalJobRunner$Job.run(** > LocalJobRunner.java:177) > 2012-02-18 13:11:08,603 ERROR fetcher.Fetcher - Fetcher: > java.io.IOException: Job failed! > at org.apache.hadoop.mapred.**JobClient.runJob(JobClient.**java:1252) > at org.apache.nutch.fetcher.**Fetcher.fetch(Fetcher.java:**1204) > at org.apache.nutch.fetcher.**Fetcher.run(Fetcher.java:1240) > at org.apache.hadoop.util.**ToolRunner.run(ToolRunner.**java:65) > at org.apache.nutch.fetcher.**Fetcher.main(Fetcher.java:**1213) > > > on ~175K links. > > AFAIK, I have two options: 1) increase a number of segments through > bin/nutch generate tool, and 2) increase heap memory through > "mapred.map.child.java.opts" parameter > > Can I do both of them in 'local' mode? > > If not, could you please tell me how could I do it? > >

