Hi all,

I got past the JCE issue, but now some tests are failing with timeouts.

For this test:

[INFO] Running org.apache.tika.parser.microsoft.ooxml.OOXMLParserTest

I get 100s of these warnings:

Nov 21, 2020 10:28:38 PM org.apache.tika.utils.XMLReaderUtils acquireSAXParser
WARNING: Contention waiting for a SAXParser. Consider increasing the 
XMLReaderUtils.POOL_SIZE

And then:

[ERROR] Tests run: 87, Failures: 0, Errors: 1, Skipped: 3, Time elapsed: 
318.512 s <<< FAILURE! - in 
org.apache.tika.parser.microsoft.ooxml.OOXMLParserTest
[ERROR] 
org.apache.tika.parser.microsoft.ooxml.OOXMLParserTest.testUnsupportedPowerPoint
  Time elapsed: 308.223 s  <<< ERROR!
org.apache.tika.exception.TikaException: TIKA-237: Illegal SAXException from 
org.apache.tika.parser.microsoft.ooxml.OOXMLParser@e30d60
        at 
org.apache.tika.parser.microsoft.ooxml.OOXMLParserTest.testUnsupportedPowerPoint(OOXMLParserTest.java:341)
Caused by: org.xml.sax.SAXException: Waited more than 5 minutes for a 
SAXParser; This could indicate that a parser has not correctly released its 
SAXParser. Please report this to the Tika team: dev@tika.apache.org 
<mailto:dev@tika.apache.org>
        at 
org.apache.tika.parser.microsoft.ooxml.OOXMLParserTest.testUnsupportedPowerPoint(OOXMLParserTest.java:341)
Caused by: org.apache.tika.exception.TikaException: Waited more than 5 minutes 
for a SAXParser; This could indicate that a parser has not correctly released 
its SAXParser. Please report this to the Tika team: dev@tika.apache.org 
<mailto:dev@tika.apache.org>
        at 
org.apache.tika.parser.microsoft.ooxml.OOXMLParserTest.testUnsupportedPowerPoint(OOXMLParserTest.java:341)

Similarly, for:

[INFO] Running org.apache.tika.parser.microsoft.ooxml.SXSLFExtractorTest

Many of these:

Nov 21, 2020 10:33:55 PM org.apache.tika.utils.XMLReaderUtils acquireSAXParser
WARNING: Contention waiting for a SAXParser. Consider increasing the 
XMLReaderUtils.POOL_SIZE

And then similarly:

[ERROR] Tests run: 24, Failures: 0, Errors: 1, Skipped: 3, Time elapsed: 
309.375 s <<< FAILURE! - in 
org.apache.tika.parser.microsoft.ooxml.SXSLFExtractorTest
[ERROR] 
org.apache.tika.parser.microsoft.ooxml.SXSLFExtractorTest.testUnsupportedPowerPoint
  Time elapsed: 307.9 s  <<< ERROR!
org.apache.tika.exception.TikaException: TIKA-237: Illegal SAXException from 
org.apache.tika.parser.microsoft.ooxml.OOXMLParser@e30d60
        at 
org.apache.tika.parser.microsoft.ooxml.SXSLFExtractorTest.testUnsupportedPowerPoint(SXSLFExtractorTest.java:281)
Caused by: org.xml.sax.SAXException: Waited more than 5 minutes for a 
SAXParser; This could indicate that a parser has not correctly released its 
SAXParser. Please report this to the Tika team: dev@tika.apache.org 
<mailto:dev@tika.apache.org>
        at 
org.apache.tika.parser.microsoft.ooxml.SXSLFExtractorTest.testUnsupportedPowerPoint(SXSLFExtractorTest.java:281)
Caused by: org.apache.tika.exception.TikaException: Waited more than 5 minutes 
for a SAXParser; This could indicate that a parser has not correctly released 
its SAXParser. Please report this to the Tika team: dev@tika.apache.org 
<mailto:dev@tika.apache.org>
        at 
org.apache.tika.parser.microsoft.ooxml.SXSLFExtractorTest.testUnsupportedPowerPoint(SXSLFExtractorTest.java:281)

And now:

[INFO] Running org.apache.tika.parser.microsoft.ooxml.SXWPFExtractorTest
[INFO] Tests run: 36, Failures: 0, Errors: 0, Skipped: 0, Time elapsed: 0.832 s 
- in org.apache.tika.parser.microsoft.ooxml.SXWPFExtractorTest
[INFO] Running org.apache.tika.parser.microsoft.ooxml.TruncatedOOXMLTest
[WARNING] Tests run: 5, Failures: 0, Errors: 0, Skipped: 1, Time elapsed: 0.053 
s - in org.apache.tika.parser.microsoft.ooxml.TruncatedOOXMLTest
[INFO] Running org.apache.tika.parser.microsoft.ooxml.xps.XPSParserTest
Nov 21, 2020 10:39:05 PM org.apache.tika.utils.XMLReaderUtils acquireSAXParser
WARNING: Contention waiting for a SAXParser. Consider increasing the 
XMLReaderUtils.POOL_SIZE
Nov 21, 2020 10:39:06 PM org.apache.tika.utils.XMLReaderUtils acquireSAXParser
WARNING: Contention waiting for a SAXParser. Consider increasing the 
XMLReaderUtils.POOL_SIZE
Nov 21, 2020 10:39:07 PM org.apache.tika.utils.XMLReaderUtils acquireSAXParser
WARNING: Contention waiting for a SAXParser. Consider increasing the 
XMLReaderUtils.POOL_SIZE
… and so on…

Any suggestions?

Thanks!

— Ken

--------------------------
Ken Krugler
http://www.scaleunlimited.com
custom big data solutions & training
Hadoop, Cascading, Cassandra & Solr

Reply via email to