[ http://issues.apache.org/jira/browse/NUTCH-35?page=comments#action_62762 ] Doug Cutting commented on NUTCH-35: -----------------------------------
TestFetcher is still failing for me with this patch: fetch of http://sourceforge.net/projects/nutch/ failed with: java.lang.NoClassDefFoundError: org/cyberneko/html/parsers/DOMParser stack java.lang.NoClassDefFoundError: org/cyberneko/html/parsers/DOMParser at org.apache.nutch.parse.html.HtmlParser.getParse(HtmlParser.java:98) at org.apache.nutch.fetcher.Fetcher$FetcherThread.handleFetch(Fetcher.java:190) at org.apache.nutch.fetcher.Fetcher$FetcherThread.run(Fetcher.java:122) > modify XML parsing code in Nutch to use single API > -------------------------------------------------- > > Key: NUTCH-35 > URL: http://issues.apache.org/jira/browse/NUTCH-35 > Project: Nutch > Type: Improvement > Environment: nutch target platform (UNIX/Linux/Mac OSX/Windows/etc.) > Reporter: Chris A. Mattmann > Assignee: Stefan Grroschupf > Priority: Minor > Attachments: xmlApiPatchII.patch, xml_API_patch.txt > > Nutch uses more than a single XML reading API to parse xml configuration > files and perform other such XML operations. The goal of this particular > reported issue would be to modfiy the nutch xml parsing functionality to use > a single XML reading and writing API. As suggested by Doug, it would be nice > if the XML reading/writing could be done through the native Java classes > provided with the JDK. I can get this done by the end of the month if its > assigned to me. > Thanks! > --Chris -- This message is automatically generated by JIRA. - If you think it was sent incorrectly contact one of the administrators: http://issues.apache.org/jira/secure/Administrators.jspa - If you want more information on JIRA, or have a bug to report see: http://www.atlassian.com/software/jira
