[ https://issues.apache.org/jira/browse/NUTCH-709?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]
Julien Nioche updated NUTCH-709: -------------------------------- Attachment: JSParseFilter.error.patch This patch catches errors in the walk method of JSParser and returns instead of breaking the whole job > JSParseFilter gets into an infinate loop and ets all the stack > --------------------------------------------------------------- > > Key: NUTCH-709 > URL: https://issues.apache.org/jira/browse/NUTCH-709 > Project: Nutch > Issue Type: Bug > Components: fetcher > Affects Versions: 1.0.0 > Environment: Hadoop 0.19.0 running nutch trunk > Reporter: Tim Hawkins > Attachments: JSParseFilter.error.patch > > > When crawling pages with seperate fetch and parse, I see processes die > becuase of stack overflow. > Output is generaly. > java.lang.StackOverflowError > at org.apache.nutch.parse.js.JSParseFilter.walk(JSParseFilter.java:146) > at org.apache.nutch.parse.js.JSParseFilter.walk(JSParseFilter.java:148) > at org.apache.nutch.parse.js.JSParseFilter.walk(JSParseFilter.java:148) > at org.apache.nutch.parse.js.JSParseFilter.walk(JSParseFilter.java:148) > at org.apache.nutch.parse.js.JSParseFilter.walk(JSParseFilter.java:148) > at org.apache.nutch.parse.js.JSParseFilter.walk(JSParseFilter.java:148) > at org.apache.nutch.parse.js.JSParseFilter.walk(JSParseFilter.java:148) > at org.apache.nutch.parse.js.JSParseFilter.walk(JSParseFilter.java:148) > at org.apache.nutch.parse.js.JSParseFilter.walk(JSParseFilter.java:148) > at org.apache.nutch.parse.js.JSParseFilter.walk(JSParseFilter.java:148) > at org.apache.nutch.parse.js.JSParseFilter.walk(JSParseFilter.java:148) > at org.apache.nutch.parse.js.JSParseFilter.walk(JSParseFilter.java:148) > at org.apache.nutch.parse.js.JSParseFilter.walk(JSParseFilter.java:148) > at org.apache.nutch.parse.js.JSParseFilter.walk(JSParseFilter.java:148) > at org.apache.nutch.parse.js.JSParseFilter.walk(JSParseFilter.java:148) > at org.apache.nutch.parse.js.JSParseFilter.walk(JSParseFilter.java:148) > at org.apache.nutch.parse.js.JSParseFilter.walk(JSParseFilter.java:148) > at org.apache.nutch.parse.js.JSParseFilter.walk(JSParseFilter.java:148) > at org.apache.nutch.parse.js.JSParseFilter.walk(JSParseFilter.java:148) > Inspection of the code shows that this is a recursive call to walk(.....) -- This message is automatically generated by JIRA. - You can reply to this email to add a comment to the issue online.