[ http://issues.apache.org/jira/browse/NUTCH-35?page=comments#action_62762 
]
     
Doug Cutting commented on NUTCH-35:
-----------------------------------

TestFetcher is still failing for me with this patch:

fetch of http://sourceforge.net/projects/nutch/ failed with: 
java.lang.NoClassDefFoundError: org/cyberneko/html/parsers/DOMParser
stack
java.lang.NoClassDefFoundError: org/cyberneko/html/parsers/DOMParser
        at org.apache.nutch.parse.html.HtmlParser.getParse(HtmlParser.java:98)
        at 
org.apache.nutch.fetcher.Fetcher$FetcherThread.handleFetch(Fetcher.java:190)
        at org.apache.nutch.fetcher.Fetcher$FetcherThread.run(Fetcher.java:122)


> modify XML parsing code in Nutch to use single API
> --------------------------------------------------
>
>          Key: NUTCH-35
>          URL: http://issues.apache.org/jira/browse/NUTCH-35
>      Project: Nutch
>         Type: Improvement
>  Environment: nutch target platform (UNIX/Linux/Mac OSX/Windows/etc.)
>     Reporter: Chris A. Mattmann
>     Assignee: Stefan Grroschupf
>     Priority: Minor
>  Attachments: xmlApiPatchII.patch, xml_API_patch.txt
>
> Nutch uses more than a single XML reading API to parse xml configuration 
> files and perform other such XML operations. The goal of this particular 
> reported issue would be to modfiy the nutch xml parsing functionality to use 
> a single XML reading and writing API. As suggested by Doug, it would be nice 
> if the XML reading/writing could be done through the native Java classes 
> provided with the JDK. I can get this done by the end of the month if its 
> assigned to me.
> Thanks!
> --Chris

-- 
This message is automatically generated by JIRA.
-
If you think it was sent incorrectly contact one of the administrators:
   http://issues.apache.org/jira/secure/Administrators.jspa
-
If you want more information on JIRA, or have a bug to report see:
   http://www.atlassian.com/software/jira

Reply via email to