[
https://issues.apache.org/jira/browse/NUTCH-2059?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14625450#comment-14625450
]
Peter Ciuffetti commented on NUTCH-2059:
----------------------------------------
I've downloaded the XSD that defines the XML format expected by xUnit. See:
https://svn.jenkins-ci.org/trunk/hudson/dtkit/dtkit-format/dtkit-junit-model/src/main/resources/com/thalesgroup/dtkit/junit/model/xsd/junit-7.xsd
Ant then I rant all the XML files produced by ant through xmllint and they all
validate.
{code}
nutch pciuffetti$ find . | egrep ".*TEST.*\.xml" | xargs xmllint --noout
--schema junit-7.xsd
./build/creativecommons/test/TEST-org.creativecommons.nutch.TestCCParseFilter.xml
validates
./build/feed/test/TEST-org.apache.nutch.parse.feed.TestFeedParser.xml validates
./build/index-anchor/test/TEST-org.apache.nutch.indexer.anchor.TestAnchorIndexingFilter.xml
validates
./build/index-basic/test/TEST-org.apache.nutch.indexer.basic.TestBasicIndexingFilter.xml
validates
./build/index-more/test/TEST-org.apache.nutch.indexer.more.TestMoreIndexingFilter.xml
validates
./build/index-replace/test/TEST-org.apache.nutch.indexer.replace.TestIndexReplace.xml
validates
./build/index-static/test/TEST-org.apache.nutch.indexer.staticfield.TestStaticFieldIndexerTest.xml
validates
./build/language-identifier/test/TEST-org.apache.nutch.analysis.lang.TestHTMLLanguageParser.xml
validates
./build/lib-http/test/TEST-org.apache.nutch.protocol.http.api.TestRobotRulesParser.xml
validates
./build/parse-html/test/TEST-org.apache.nutch.parse.html.TestDOMContentUtils.xml
validates
./build/parse-html/test/TEST-org.apache.nutch.parse.html.TestHtmlParser.xml
validates
./build/parse-html/test/TEST-org.apache.nutch.parse.html.TestRobotsMetaProcessor.xml
validates
./build/parse-metatags/test/TEST-org.apache.nutch.parse.metatags.TestMetatagParser.xml
validates
./build/parse-swf/test/TEST-org.apache.nutch.parse.swf.TestSWFParser.xml
validates
./build/parse-tika/test/TEST-org.apache.nutch.tika.TestDOMContentUtils.xml
validates
./build/parse-tika/test/TEST-org.apache.nutch.tika.TestFeedParser.xml validates
./build/parse-tika/test/TEST-org.apache.nutch.tika.TestImageMetadata.xml
validates
./build/parse-tika/test/TEST-org.apache.nutch.tika.TestMSWordParser.xml
validates
./build/parse-tika/test/TEST-org.apache.nutch.tika.TestOOParser.xml validates
./build/parse-tika/test/TEST-org.apache.nutch.tika.TestPdfParser.xml validates
./build/parse-tika/test/TEST-org.apache.nutch.tika.TestRobotsMetaProcessor.xml
validates
./build/parse-tika/test/TEST-org.apache.nutch.tika.TestRTFParser.xml validates
./build/parse-zip/test/TEST-org.apache.nutch.parse.zip.TestZipParser.xml
validates
./build/protocol-file/test/TEST-org.apache.nutch.protocol.file.TestProtocolFile.xml
validates
./build/protocol-http/test/TEST-org.apache.nutch.protocol.http.TestProtocolHttp.xml
validates
./build/protocol-httpclient/test/TEST-org.apache.nutch.protocol.httpclient.TestProtocolHttpClient.xml
validates
./build/subcollection/test/TEST-org.apache.nutch.collection.TestSubcollection.xml
validates
./build/test/TEST-org.apache.nutch.crawl.TestAdaptiveFetchSchedule.xml validates
./build/test/TEST-org.apache.nutch.crawl.TestCrawlDbFilter.xml validates
./build/test/TEST-org.apache.nutch.crawl.TestCrawlDbMerger.xml validates
./build/test/TEST-org.apache.nutch.crawl.TestCrawlDbStates.xml validates
./build/test/TEST-org.apache.nutch.crawl.TestGenerator.xml validates
./build/test/TEST-org.apache.nutch.crawl.TestInjector.xml validates
./build/test/TEST-org.apache.nutch.crawl.TestLinkDbMerger.xml validates
./build/test/TEST-org.apache.nutch.crawl.TestSignatureFactory.xml validates
./build/test/TEST-org.apache.nutch.fetcher.TestFetcher.xml validates
./build/test/TEST-org.apache.nutch.indexer.TestIndexingFilters.xml validates
./build/test/TEST-org.apache.nutch.metadata.TestMetadata.xml validates
./build/test/TEST-org.apache.nutch.metadata.TestSpellCheckedMetadata.xml
validates
./build/test/TEST-org.apache.nutch.net.TestURLFilters.xml validates
./build/test/TEST-org.apache.nutch.net.TestURLNormalizers.xml validates
./build/test/TEST-org.apache.nutch.parse.TestOutlinkExtractor.xml validates
./build/test/TEST-org.apache.nutch.parse.TestParseData.xml validates
./build/test/TEST-org.apache.nutch.parse.TestParserFactory.xml validates
./build/test/TEST-org.apache.nutch.parse.TestParseText.xml validates
./build/test/TEST-org.apache.nutch.plugin.TestPluginSystem.xml validates
./build/test/TEST-org.apache.nutch.protocol.TestContent.xml validates
./build/test/TEST-org.apache.nutch.protocol.TestProtocolFactory.xml validates
./build/test/TEST-org.apache.nutch.segment.TestSegmentMerger.xml validates
./build/test/TEST-org.apache.nutch.segment.TestSegmentMergerCrawlDatums.xml
validates
./build/test/TEST-org.apache.nutch.tools.TestCommonCrawlDataDumper.xml validates
./build/test/TEST-org.apache.nutch.util.TestEncodingDetector.xml validates
./build/test/TEST-org.apache.nutch.util.TestGZIPUtils.xml validates
./build/test/TEST-org.apache.nutch.util.TestMimeUtil.xml validates
./build/test/TEST-org.apache.nutch.util.TestNodeWalker.xml validates
./build/test/TEST-org.apache.nutch.util.TestPrefixStringMatcher.xml validates
./build/test/TEST-org.apache.nutch.util.TestStringUtil.xml validates
./build/test/TEST-org.apache.nutch.util.TestSuffixStringMatcher.xml validates
./build/test/TEST-org.apache.nutch.util.TestURLUtil.xml validates
./build/urlfilter-automaton/test/TEST-org.apache.nutch.urlfilter.automaton.TestAutomatonURLFilter.xml
validates
./build/urlfilter-domain/test/TEST-org.apache.nutch.urlfilter.domain.TestDomainURLFilter.xml
validates
./build/urlfilter-domainblacklist/test/TEST-org.apache.nutch.urlfilter.domainblacklist.TestDomainBlacklistURLFilter.xml
validates
./build/urlfilter-prefix/test/TEST-org.apache.nutch.urlfilter.prefix.TestPrefixURLFilter.xml
validates
./build/urlfilter-regex/test/TEST-org.apache.nutch.urlfilter.regex.TestRegexURLFilter.xml
validates
./build/urlfilter-suffix/test/TEST-org.apache.nutch.urlfilter.suffix.TestSuffixURLFilter.xml
validates
./build/urlfilter-validator/test/TEST-org.apache.nutch.urlfilter.validator.TestUrlValidator.xml
validates
./build/urlnormalizer-ajax/test/TEST-org.apache.nutch.net.urlnormalizer.ajax.TestAjaxURLNormalizer.xml
validates
./build/urlnormalizer-basic/test/TEST-org.apache.nutch.net.urlnormalizer.basic.TestBasicURLNormalizer.xml
validates
./build/urlnormalizer-host/test/TEST-org.apache.nutch.net.urlnormalizer.host.TestHostURLNormalizer.xml
validates
./build/urlnormalizer-pass/test/TEST-org.apache.nutch.net.urlnormalizer.pass.TestPassURLNormalizer.xml
validates
./build/urlnormalizer-querystring/test/TEST-org.apache.nutch.net.urlnormalizer.querystring.TestQuerystringURLNormalizer.xml
validates
./build/urlnormalizer-regex/test/TEST-org.apache.nutch.net.urlnormalizer.regex.TestRegexURLNormalizer.xml
validates
./build/urlnormalizer-slash/test/TEST-org.apache.nutch.net.urlnormalizer.slash.TestSlashURLNormalizer.xml
validates
{code}
> protocol-httpclient, protocol-http unit test errors on Jenkins
> --------------------------------------------------------------
>
> Key: NUTCH-2059
> URL: https://issues.apache.org/jira/browse/NUTCH-2059
> Project: Nutch
> Issue Type: Bug
> Components: fetcher
> Reporter: Peter Ciuffetti
> Assignee: Chris A. Mattmann
> Fix For: 1.11
>
>
> This is an occasional error on the build of the Nutch trunk visible in
> Jenkins builds. It happens on either protocol-http or protocol-httpclient,
> which can be running at the same time given the multi-threaded test setup.
> {code}
> [junit] Running org.apache.nutch.protocol.httpclient.TestProtocolHttpClient
> [junit] Tests run: 2, Failures: 0, Errors: 1, Skipped: 0, Time elapsed: 2.377
> sec
> [junit] Test org.apache.nutch.protocol.http.TestProtocolHttp FAILED
> {code}
> Evidence of failure on Jenkins go back to
> Failed > Console Output #3154 Jun 8, 2015 4:00:00 AM
> https://builds.apache.org/view/All/job/Nutch-trunk/3154/consoleFull
> And are repeated at...
> https://builds.apache.org/view/All/job/Nutch-trunk/3190/console
> https://builds.apache.org/view/All/job/Nutch-trunk/3189/console
> Some possibly related tickets
> NUTCH-1836 Timeouts in protocol-httpclient when crawling same host with >2
> threads
> NUTCH-1086 Rewrite protocol-httpclient
> The unit tests are not failing for me on my sandbox, but there are some
> exceptions being output to the log related to headers being sent on JSP pages
> after the response writer is invoked.
> {code}
> java.lang.IllegalStateException: STREAM
> at org.mortbay.jetty.Response.getWriter(Response.java:616)
> {code}
--
This message was sent by Atlassian JIRA
(v6.3.4#6332)