Hi ,

I was getting this error
"
Exception in thread "main" java.lang.NoClassDefFoundError:
org/cyberneko/html/parsers/DOMFragmentParser
 at org.apache.nutch.parse.html.HtmlParser.parseNeko(HtmlParser.java:255)
 at org.apache.nutch.parse.html.HtmlParser.parse(HtmlParser.java:238)
....."

and then I applied this patch .
https://issues.apache.org/jira/browse/NUTCH-1253

And even after the patch I am still getting this error

Exception in thread "main" java.lang.NoClassDefFoundError:
org/cyberneko/html/parsers/DOMFragmentParser
    at org.apache.nutch.parse.html.HtmlParser.parseNeko(HtmlParser.java:257)
    at org.apache.nutch.parse.html.HtmlParser.parse(HtmlParser.java:238)
    at org.apache.nutch.parse.html.HtmlParser.getParse(HtmlParser.java:173)
    at org.apache.nutch.parse.ParseUtil.parse(ParseUtil.java:131)
    at org.apache.nutch.parse.ParserChecker.run(ParserChecker.java:146)
    at org.apache.hadoop.util.ToolRunner.run(ToolRunner.java:65)
    at org.apache.nutch.parse.ParserChecker.main(ParserChecker.java:197)
Caused by: java.lang.ClassNotFoundException:
org.cyberneko.html.parsers.DOMFragmentParser

Any idea how to resolve this issue ?

Thanks,
Tony.

Reply via email to