Hi Ken, just to double check, did you intend to send this mail to the tika dev list? I actually don't know what to do with your email.
Best, Arvid On Sat, Nov 21, 2020 at 11:43 PM Ken Krugler <[email protected]> wrote: > Hi all, > > I got past the JCE issue, but now some tests are failing with timeouts. > > For this test: > > [INFO] Running org.apache.tika.parser.microsoft.ooxml.OOXMLParserTest > > I get 100s of these warnings: > > Nov 21, 2020 10:28:38 PM org.apache.tika.utils.XMLReaderUtils > acquireSAXParser > WARNING: Contention waiting for a SAXParser. Consider increasing the > XMLReaderUtils.POOL_SIZE > > And then: > > [ERROR] Tests run: 87, Failures: 0, Errors: 1, Skipped: 3, Time elapsed: > 318.512 s <<< FAILURE! - in > org.apache.tika.parser.microsoft.ooxml.OOXMLParserTest > [ERROR] > org.apache.tika.parser.microsoft.ooxml.OOXMLParserTest.testUnsupportedPowerPoint > Time elapsed: 308.223 s <<< ERROR! > org.apache.tika.exception.TikaException: TIKA-237: Illegal SAXException > from org.apache.tika.parser.microsoft.ooxml.OOXMLParser@e30d60 > at > org.apache.tika.parser.microsoft.ooxml.OOXMLParserTest.testUnsupportedPowerPoint(OOXMLParserTest.java:341) > Caused by: org.xml.sax.SAXException: Waited more than 5 minutes for a > SAXParser; This could indicate that a parser has not correctly released its > SAXParser. Please report this to the Tika team: [email protected] > at > org.apache.tika.parser.microsoft.ooxml.OOXMLParserTest.testUnsupportedPowerPoint(OOXMLParserTest.java:341) > Caused by: org.apache.tika.exception.TikaException: Waited more than 5 > minutes for a SAXParser; This could indicate that a parser has not > correctly released its SAXParser. Please report this to the Tika team: > [email protected] > at > org.apache.tika.parser.microsoft.ooxml.OOXMLParserTest.testUnsupportedPowerPoint(OOXMLParserTest.java:341) > > Similarly, for: > > [INFO] Running org.apache.tika.parser.microsoft.ooxml.SXSLFExtractorTest > > Many of these: > > Nov 21, 2020 10:33:55 PM org.apache.tika.utils.XMLReaderUtils > acquireSAXParser > WARNING: Contention waiting for a SAXParser. Consider increasing the > XMLReaderUtils.POOL_SIZE > > And then similarly: > > [ERROR] Tests run: 24, Failures: 0, Errors: 1, Skipped: 3, Time elapsed: > 309.375 s <<< FAILURE! - in > org.apache.tika.parser.microsoft.ooxml.SXSLFExtractorTest > [ERROR] > org.apache.tika.parser.microsoft.ooxml.SXSLFExtractorTest.testUnsupportedPowerPoint > Time elapsed: 307.9 s <<< ERROR! > org.apache.tika.exception.TikaException: TIKA-237: Illegal SAXException > from org.apache.tika.parser.microsoft.ooxml.OOXMLParser@e30d60 > at > org.apache.tika.parser.microsoft.ooxml.SXSLFExtractorTest.testUnsupportedPowerPoint(SXSLFExtractorTest.java:281) > Caused by: org.xml.sax.SAXException: Waited more than 5 minutes for a > SAXParser; This could indicate that a parser has not correctly released its > SAXParser. Please report this to the Tika team: [email protected] > at > org.apache.tika.parser.microsoft.ooxml.SXSLFExtractorTest.testUnsupportedPowerPoint(SXSLFExtractorTest.java:281) > Caused by: org.apache.tika.exception.TikaException: Waited more than 5 > minutes for a SAXParser; This could indicate that a parser has not > correctly released its SAXParser. Please report this to the Tika team: > [email protected] > at > org.apache.tika.parser.microsoft.ooxml.SXSLFExtractorTest.testUnsupportedPowerPoint(SXSLFExtractorTest.java:281) > > And now: > > [INFO] Running org.apache.tika.parser.microsoft.ooxml.SXWPFExtractorTest > [INFO] Tests run: 36, Failures: 0, Errors: 0, Skipped: 0, Time elapsed: > 0.832 s - in org.apache.tika.parser.microsoft.ooxml.SXWPFExtractorTest > [INFO] Running org.apache.tika.parser.microsoft.ooxml.TruncatedOOXMLTest > [WARNING] Tests run: 5, Failures: 0, Errors: 0, Skipped: 1, Time elapsed: > 0.053 s - in org.apache.tika.parser.microsoft.ooxml.TruncatedOOXMLTest > [INFO] Running org.apache.tika.parser.microsoft.ooxml.xps.XPSParserTest > Nov 21, 2020 10:39:05 PM org.apache.tika.utils.XMLReaderUtils > acquireSAXParser > WARNING: Contention waiting for a SAXParser. Consider increasing the > XMLReaderUtils.POOL_SIZE > Nov 21, 2020 10:39:06 PM org.apache.tika.utils.XMLReaderUtils > acquireSAXParser > WARNING: Contention waiting for a SAXParser. Consider increasing the > XMLReaderUtils.POOL_SIZE > Nov 21, 2020 10:39:07 PM org.apache.tika.utils.XMLReaderUtils > acquireSAXParser > WARNING: Contention waiting for a SAXParser. Consider increasing the > XMLReaderUtils.POOL_SIZE > … and so on… > > Any suggestions? > > Thanks! > > — Ken > > > -------------------------- > Ken Krugler > http://www.scaleunlimited.com > custom big data solutions & training > Hadoop, Cascading, Cassandra & Solr > > -- Arvid Heise | Senior Java Developer <https://www.ververica.com/> Follow us @VervericaData -- Join Flink Forward <https://flink-forward.org/> - The Apache Flink Conference Stream Processing | Event Driven | Real Time -- Ververica GmbH | Invalidenstrasse 115, 10115 Berlin, Germany -- Ververica GmbH Registered at Amtsgericht Charlottenburg: HRB 158244 B Managing Directors: Timothy Alexander Steinert, Yip Park Tung Jason, Ji (Toni) Cheng
