Sebastian Nagel created NUTCH-2781:
--------------------------------------

             Summary: Increase default Java heap size
                 Key: NUTCH-2781
                 URL: https://issues.apache.org/jira/browse/NUTCH-2781
             Project: Nutch
          Issue Type: Improvement
          Components: runtime
    Affects Versions: 1.16
            Reporter: Sebastian Nagel
             Fix For: 1.17


The Nutch run script (bin/nutch) sets a "conservative" Java heap size of 1000 
MB. This default was defined [15 years 
ago|https://github.com/apache/nutch/blame/dcbb0f2bf450c6bec6f45125c68f5c7a0f061474/src/bin/nutch#L24].
 It's probably safe to increase the heap size to a value suitable to process 
more pages or larger documents. What about 4096 MB?

Note this overlaps with NUTCH-2501 (Java heap size defined via 
mapred.child.java.opts in distributed mode).



--
This message was sent by Atlassian Jira
(v8.3.4#803005)

Reply via email to