Hi all,
I got past the JCE issue, but now some tests are failing with timeouts.
For this test:
[INFO] Running org.apache.tika.parser.microsoft.ooxml.OOXMLParserTest
I get 100s of these warnings:
Nov 21, 2020 10:28:38 PM org.apache.tika.utils.XMLReaderUtils acquireSAXParser
WARNING: Contention waiting for a SAXParser. Consider increasing the
XMLReaderUtils.POOL_SIZE
And then:
[ERROR] Tests run: 87, Failures: 0, Errors: 1, Skipped: 3, Time elapsed:
318.512 s <<< FAILURE! - in
org.apache.tika.parser.microsoft.ooxml.OOXMLParserTest
[ERROR]
org.apache.tika.parser.microsoft.ooxml.OOXMLParserTest.testUnsupportedPowerPoint
Time elapsed: 308.223 s <<< ERROR!
org.apache.tika.exception.TikaException: TIKA-237: Illegal SAXException from
org.apache.tika.parser.microsoft.ooxml.OOXMLParser@e30d60
at
org.apache.tika.parser.microsoft.ooxml.OOXMLParserTest.testUnsupportedPowerPoint(OOXMLParserTest.java:341)
Caused by: org.xml.sax.SAXException: Waited more than 5 minutes for a
SAXParser; This could indicate that a parser has not correctly released its
SAXParser. Please report this to the Tika team: [email protected]
at
org.apache.tika.parser.microsoft.ooxml.OOXMLParserTest.testUnsupportedPowerPoint(OOXMLParserTest.java:341)
Caused by: org.apache.tika.exception.TikaException: Waited more than 5 minutes
for a SAXParser; This could indicate that a parser has not correctly released
its SAXParser. Please report this to the Tika team: [email protected]
at
org.apache.tika.parser.microsoft.ooxml.OOXMLParserTest.testUnsupportedPowerPoint(OOXMLParserTest.java:341)
Similarly, for:
[INFO] Running org.apache.tika.parser.microsoft.ooxml.SXSLFExtractorTest
Many of these:
Nov 21, 2020 10:33:55 PM org.apache.tika.utils.XMLReaderUtils acquireSAXParser
WARNING: Contention waiting for a SAXParser. Consider increasing the
XMLReaderUtils.POOL_SIZE
And then similarly:
[ERROR] Tests run: 24, Failures: 0, Errors: 1, Skipped: 3, Time elapsed:
309.375 s <<< FAILURE! - in
org.apache.tika.parser.microsoft.ooxml.SXSLFExtractorTest
[ERROR]
org.apache.tika.parser.microsoft.ooxml.SXSLFExtractorTest.testUnsupportedPowerPoint
Time elapsed: 307.9 s <<< ERROR!
org.apache.tika.exception.TikaException: TIKA-237: Illegal SAXException from
org.apache.tika.parser.microsoft.ooxml.OOXMLParser@e30d60
at
org.apache.tika.parser.microsoft.ooxml.SXSLFExtractorTest.testUnsupportedPowerPoint(SXSLFExtractorTest.java:281)
Caused by: org.xml.sax.SAXException: Waited more than 5 minutes for a
SAXParser; This could indicate that a parser has not correctly released its
SAXParser. Please report this to the Tika team: [email protected]
at
org.apache.tika.parser.microsoft.ooxml.SXSLFExtractorTest.testUnsupportedPowerPoint(SXSLFExtractorTest.java:281)
Caused by: org.apache.tika.exception.TikaException: Waited more than 5 minutes
for a SAXParser; This could indicate that a parser has not correctly released
its SAXParser. Please report this to the Tika team: [email protected]
at
org.apache.tika.parser.microsoft.ooxml.SXSLFExtractorTest.testUnsupportedPowerPoint(SXSLFExtractorTest.java:281)
And now:
[INFO] Running org.apache.tika.parser.microsoft.ooxml.SXWPFExtractorTest
[INFO] Tests run: 36, Failures: 0, Errors: 0, Skipped: 0, Time elapsed: 0.832 s
- in org.apache.tika.parser.microsoft.ooxml.SXWPFExtractorTest
[INFO] Running org.apache.tika.parser.microsoft.ooxml.TruncatedOOXMLTest
[WARNING] Tests run: 5, Failures: 0, Errors: 0, Skipped: 1, Time elapsed: 0.053
s - in org.apache.tika.parser.microsoft.ooxml.TruncatedOOXMLTest
[INFO] Running org.apache.tika.parser.microsoft.ooxml.xps.XPSParserTest
Nov 21, 2020 10:39:05 PM org.apache.tika.utils.XMLReaderUtils acquireSAXParser
WARNING: Contention waiting for a SAXParser. Consider increasing the
XMLReaderUtils.POOL_SIZE
Nov 21, 2020 10:39:06 PM org.apache.tika.utils.XMLReaderUtils acquireSAXParser
WARNING: Contention waiting for a SAXParser. Consider increasing the
XMLReaderUtils.POOL_SIZE
Nov 21, 2020 10:39:07 PM org.apache.tika.utils.XMLReaderUtils acquireSAXParser
WARNING: Contention waiting for a SAXParser. Consider increasing the
XMLReaderUtils.POOL_SIZE
… and so on…
Any suggestions?
Thanks!
— Ken
--------------------------
Ken Krugler
http://www.scaleunlimited.com
custom big data solutions & training
Hadoop, Cascading, Cassandra & Solr