[ http://issues.apache.org/jira/browse/NUTCH-35?page=history ]
Stefan Grroschupf updated NUTCH-35:
-----------------------------------
Attachment: xmlApiPatchIII.patch
It's a shame, however I'm sure one day there will be a patch from me that just
need to be assigned - I hope. :-)
The problem was more a bug in the old implementation. The plugin.xml parser was
written in a way that it require a export subnode for any library element. But
this wasn't working correct in the dom4j parser so all library elements was
accepted, the new parser now was checking if the element exists.
I changed the parser to be less restrict.
[junit] Running org.apache.nutch.analysis.TestQueryParser
[junit] Tests run: 1, Failures: 0, Errors: 0, Time elapsed: 0,903 sec
[junit] Running org.apache.nutch.db.TestWebDB
[junit] Tests run: 1, Failures: 0, Errors: 0, Time elapsed: 34,282 sec
[junit] Running org.apache.nutch.fetcher.TestFetcher
[junit] Tests run: 1, Failures: 0, Errors: 0, Time elapsed: 18,086 sec
[junit] Running org.apache.nutch.fetcher.TestFetcherOutput
[junit] Tests run: 1, Failures: 0, Errors: 0, Time elapsed: 0,677 sec
[junit] Running org.apache.nutch.io.TestArrayFile
[junit] 050413 224058 parsing
file:/Applications/eclipse/workspace/nutch/conf/nutch-default.xml
[junit] 050413 224058 parsing
file:/Applications/eclipse/workspace/nutch/build/test/classes/nutch-site.xml
[junit] Tests run: 2, Failures: 0, Errors: 0, Time elapsed: 1,621 sec
[junit] Running org.apache.nutch.io.TestMD5Hash
[junit] Tests run: 1, Failures: 0, Errors: 0, Time elapsed: 0,047 sec
[junit] Running org.apache.nutch.io.TestSequenceFile
[junit] 050413 224100 parsing
file:/Applications/eclipse/workspace/nutch/conf/nutch-default.xml
[junit] 050413 224101 parsing
file:/Applications/eclipse/workspace/nutch/build/test/classes/nutch-site.xml
[junit] Tests run: 1, Failures: 0, Errors: 0, Time elapsed: 3,435 sec
[junit] Running org.apache.nutch.io.TestSetFile
[junit] 050413 224104 parsing
file:/Applications/eclipse/workspace/nutch/conf/nutch-default.xml
[junit] 050413 224105 parsing
file:/Applications/eclipse/workspace/nutch/build/test/classes/nutch-site.xml
[junit] Tests run: 1, Failures: 0, Errors: 0, Time elapsed: 1,026 sec
[junit] Running org.apache.nutch.io.TestUTF8
[junit] Tests run: 3, Failures: 0, Errors: 0, Time elapsed: 0,138 sec
[junit] Running org.apache.nutch.io.TestVersionedWritable
[junit] Tests run: 3, Failures: 0, Errors: 0, Time elapsed: 0,034 sec
[junit] Running org.apache.nutch.io.TestWritable
[junit] Tests run: 1, Failures: 0, Errors: 0, Time elapsed: 0,017 sec
[junit] Running org.apache.nutch.ipc.TestIPC
[junit] 050413 224107 parsing
file:/Applications/eclipse/workspace/nutch/conf/nutch-default.xml
[junit] 050413 224107 parsing
file:/Applications/eclipse/workspace/nutch/build/test/classes/nutch-site.xml
[junit] Tests run: 2, Failures: 0, Errors: 0, Time elapsed: 8,489 sec
[junit] Running org.apache.nutch.ipc.TestRPC
[junit] 050413 224116 parsing
file:/Applications/eclipse/workspace/nutch/conf/nutch-default.xml
[junit] 050413 224116 parsing
file:/Applications/eclipse/workspace/nutch/build/test/classes/nutch-site.xml
[junit] Tests run: 1, Failures: 0, Errors: 0, Time elapsed: 10,633 sec
[junit] Running org.apache.nutch.net.TestBasicUrlNormalizer
[junit] Tests run: 1, Failures: 0, Errors: 0, Time elapsed: 0,661 sec
[junit] Running org.apache.nutch.net.TestRegexUrlNormalizer
[junit] Tests run: 1, Failures: 0, Errors: 0, Time elapsed: 0,63 sec
[junit] Running org.apache.nutch.pagedb.TestFetchListEntry
[junit] Tests run: 1, Failures: 0, Errors: 0, Time elapsed: 0,678 sec
[junit] Running org.apache.nutch.pagedb.TestPage
[junit] Tests run: 2, Failures: 0, Errors: 0, Time elapsed: 0,674 sec
[junit] Running org.apache.nutch.parse.TestParseData
[junit] Tests run: 1, Failures: 0, Errors: 0, Time elapsed: 0,659 sec
[junit] Running org.apache.nutch.parse.TestParseText
[junit] Tests run: 1, Failures: 0, Errors: 0, Time elapsed: 0,03 sec
[junit] Running org.apache.nutch.plugin.TestPluginSystem
[junit] Tests run: 6, Failures: 0, Errors: 0, Time elapsed: 0,932 sec
[junit] Running org.apache.nutch.protocol.TestContent
[junit] Tests run: 1, Failures: 0, Errors: 0, Time elapsed: 0,107 sec
[junit] Running org.apache.nutch.searcher.TestHitDetails
[junit] Tests run: 1, Failures: 0, Errors: 0, Time elapsed: 0,081 sec
[junit] Running org.apache.nutch.searcher.TestQuery
[junit] Tests run: 6, Failures: 0, Errors: 0, Time elapsed: 0,894 sec
[junit] Running org.apache.nutch.tools.TestSegmentMergeTool
[junit] Tests run: 3, Failures: 1, Errors: 0, Time elapsed: 29,829 sec
[junit] Test org.apache.nutch.tools.TestSegmentMergeTool FAILED
[junit] Running org.apache.nutch.util.TestFibonacciHeap
[junit] Tests run: 1, Failures: 0, Errors: 0, Time elapsed: 0,246 sec
[junit] Running org.apache.nutch.util.TestGZIPUtils
[junit] Tests run: 4, Failures: 0, Errors: 0, Time elapsed: 0,898 sec
[junit] Running org.apache.nutch.util.TestPrefixStringMatcher
[junit] Tests run: 1, Failures: 0, Errors: 0, Time elapsed: 0,094 sec
[junit] Running org.apache.nutch.util.TestSoftHashMap
[junit] Tests run: 2, Failures: 0, Errors: 0, Time elapsed: 20,283 sec
[junit] Running org.apache.nutch.util.TestStringUtil
[junit] Tests run: 2, Failures: 0, Errors: 0, Time elapsed: 0,009 sec
[junit] Running org.apache.nutch.util.TestSuffixStringMatcher
[junit] Tests run: 1, Failures: 0, Errors: 0, Time elapsed: 0,097 sec
> modify XML parsing code in Nutch to use single API
> --------------------------------------------------
>
> Key: NUTCH-35
> URL: http://issues.apache.org/jira/browse/NUTCH-35
> Project: Nutch
> Type: Improvement
> Environment: nutch target platform (UNIX/Linux/Mac OSX/Windows/etc.)
> Reporter: Chris A. Mattmann
> Assignee: Stefan Grroschupf
> Priority: Minor
> Attachments: xmlApiPatchII.patch, xmlApiPatchIII.patch, xml_API_patch.txt
>
> Nutch uses more than a single XML reading API to parse xml configuration
> files and perform other such XML operations. The goal of this particular
> reported issue would be to modfiy the nutch xml parsing functionality to use
> a single XML reading and writing API. As suggested by Doug, it would be nice
> if the XML reading/writing could be done through the native Java classes
> provided with the JDK. I can get this done by the end of the month if its
> assigned to me.
> Thanks!
> --Chris
--
This message is automatically generated by JIRA.
-
If you think it was sent incorrectly contact one of the administrators:
http://issues.apache.org/jira/secure/Administrators.jspa
-
If you want more information on JIRA, or have a bug to report see:
http://www.atlassian.com/software/jira
-------------------------------------------------------
SF email is sponsored by - The IT Product Guide
Read honest & candid reviews on hundreds of IT Products from real users.
Discover which products truly live up to the hype. Start reading now.
http://ads.osdn.com/?ad_id=6595&alloc_id=14396&op=click
_______________________________________________
Nutch-developers mailing list
[email protected]
https://lists.sourceforge.net/lists/listinfo/nutch-developers