Hi, It is more likely to be the https://issues.apache.org/jira/browse/NUTCH-1640 which is fixed in the trunk. Note that the all in one crawl command has been deprecated and won't be found in the trunk. Either apply the patch to the 1.7 code or call the commands separately. We planned to backport the REST API from Nutch 2.x to 1.x in https://issues.apache.org/jira/browse/NUTCH-1040 , the code in 2.x could be a good starting point.
Julien On 16 December 2013 20:30, yann <[email protected]> wrote: > Hi Tejas, > > the PermGen space eventually fills out, but I don't have any other clue at > this point. > > Wondering if it might be related to this: > https://issues.apache.org/jira/browse/NUTCH-356 ? > > Yann > > > > -- > View this message in context: > http://lucene.472066.n3.nabble.com/Memory-leak-when-crawling-repeatedly-tp4106960p4106996.html > Sent from the Nutch - User mailing list archive at Nabble.com. > -- Open Source Solutions for Text Engineering http://digitalpebble.blogspot.com/ http://www.digitalpebble.com http://twitter.com/digitalpebble

