Sebastian Nagel created NUTCH-2781:
--------------------------------------
Summary: Increase default Java heap size
Key: NUTCH-2781
URL: https://issues.apache.org/jira/browse/NUTCH-2781
Project: Nutch
Issue Type: Improvement
Components: runtime
Affects Versions: 1.16
Reporter: Sebastian Nagel
Fix For: 1.17
The Nutch run script (bin/nutch) sets a "conservative" Java heap size of 1000
MB. This default was defined [15 years
ago|https://github.com/apache/nutch/blame/dcbb0f2bf450c6bec6f45125c68f5c7a0f061474/src/bin/nutch#L24].
It's probably safe to increase the heap size to a value suitable to process
more pages or larger documents. What about 4096 MB?
Note this overlaps with NUTCH-2501 (Java heap size defined via
mapred.child.java.opts in distributed mode).
--
This message was sent by Atlassian Jira
(v8.3.4#803005)