Hi Ken,

just to double check, did you intend to send this mail to the tika dev
list? I actually don't know what to do with your email.

Best,

Arvid

On Sat, Nov 21, 2020 at 11:43 PM Ken Krugler <[email protected]>
wrote:

> Hi all,
>
> I got past the JCE issue, but now some tests are failing with timeouts.
>
> For this test:
>
> [INFO] Running org.apache.tika.parser.microsoft.ooxml.OOXMLParserTest
>
> I get 100s of these warnings:
>
> Nov 21, 2020 10:28:38 PM org.apache.tika.utils.XMLReaderUtils
> acquireSAXParser
> WARNING: Contention waiting for a SAXParser. Consider increasing the
> XMLReaderUtils.POOL_SIZE
>
> And then:
>
> [ERROR] Tests run: 87, Failures: 0, Errors: 1, Skipped: 3, Time elapsed:
> 318.512 s <<< FAILURE! - in
> org.apache.tika.parser.microsoft.ooxml.OOXMLParserTest
> [ERROR]
> org.apache.tika.parser.microsoft.ooxml.OOXMLParserTest.testUnsupportedPowerPoint
> Time elapsed: 308.223 s  <<< ERROR!
> org.apache.tika.exception.TikaException: TIKA-237: Illegal SAXException
> from org.apache.tika.parser.microsoft.ooxml.OOXMLParser@e30d60
>         at
> org.apache.tika.parser.microsoft.ooxml.OOXMLParserTest.testUnsupportedPowerPoint(OOXMLParserTest.java:341)
> Caused by: org.xml.sax.SAXException: Waited more than 5 minutes for a
> SAXParser; This could indicate that a parser has not correctly released its
> SAXParser. Please report this to the Tika team: [email protected]
>         at
> org.apache.tika.parser.microsoft.ooxml.OOXMLParserTest.testUnsupportedPowerPoint(OOXMLParserTest.java:341)
> Caused by: org.apache.tika.exception.TikaException: Waited more than 5
> minutes for a SAXParser; This could indicate that a parser has not
> correctly released its SAXParser. Please report this to the Tika team:
> [email protected]
>         at
> org.apache.tika.parser.microsoft.ooxml.OOXMLParserTest.testUnsupportedPowerPoint(OOXMLParserTest.java:341)
>
> Similarly, for:
>
> [INFO] Running org.apache.tika.parser.microsoft.ooxml.SXSLFExtractorTest
>
> Many of these:
>
> Nov 21, 2020 10:33:55 PM org.apache.tika.utils.XMLReaderUtils
> acquireSAXParser
> WARNING: Contention waiting for a SAXParser. Consider increasing the
> XMLReaderUtils.POOL_SIZE
>
> And then similarly:
>
> [ERROR] Tests run: 24, Failures: 0, Errors: 1, Skipped: 3, Time elapsed:
> 309.375 s <<< FAILURE! - in
> org.apache.tika.parser.microsoft.ooxml.SXSLFExtractorTest
> [ERROR]
> org.apache.tika.parser.microsoft.ooxml.SXSLFExtractorTest.testUnsupportedPowerPoint
> Time elapsed: 307.9 s  <<< ERROR!
> org.apache.tika.exception.TikaException: TIKA-237: Illegal SAXException
> from org.apache.tika.parser.microsoft.ooxml.OOXMLParser@e30d60
>         at
> org.apache.tika.parser.microsoft.ooxml.SXSLFExtractorTest.testUnsupportedPowerPoint(SXSLFExtractorTest.java:281)
> Caused by: org.xml.sax.SAXException: Waited more than 5 minutes for a
> SAXParser; This could indicate that a parser has not correctly released its
> SAXParser. Please report this to the Tika team: [email protected]
>         at
> org.apache.tika.parser.microsoft.ooxml.SXSLFExtractorTest.testUnsupportedPowerPoint(SXSLFExtractorTest.java:281)
> Caused by: org.apache.tika.exception.TikaException: Waited more than 5
> minutes for a SAXParser; This could indicate that a parser has not
> correctly released its SAXParser. Please report this to the Tika team:
> [email protected]
>         at
> org.apache.tika.parser.microsoft.ooxml.SXSLFExtractorTest.testUnsupportedPowerPoint(SXSLFExtractorTest.java:281)
>
> And now:
>
> [INFO] Running org.apache.tika.parser.microsoft.ooxml.SXWPFExtractorTest
> [INFO] Tests run: 36, Failures: 0, Errors: 0, Skipped: 0, Time elapsed:
> 0.832 s - in org.apache.tika.parser.microsoft.ooxml.SXWPFExtractorTest
> [INFO] Running org.apache.tika.parser.microsoft.ooxml.TruncatedOOXMLTest
> [WARNING] Tests run: 5, Failures: 0, Errors: 0, Skipped: 1, Time elapsed:
> 0.053 s - in org.apache.tika.parser.microsoft.ooxml.TruncatedOOXMLTest
> [INFO] Running org.apache.tika.parser.microsoft.ooxml.xps.XPSParserTest
> Nov 21, 2020 10:39:05 PM org.apache.tika.utils.XMLReaderUtils
> acquireSAXParser
> WARNING: Contention waiting for a SAXParser. Consider increasing the
> XMLReaderUtils.POOL_SIZE
> Nov 21, 2020 10:39:06 PM org.apache.tika.utils.XMLReaderUtils
> acquireSAXParser
> WARNING: Contention waiting for a SAXParser. Consider increasing the
> XMLReaderUtils.POOL_SIZE
> Nov 21, 2020 10:39:07 PM org.apache.tika.utils.XMLReaderUtils
> acquireSAXParser
> WARNING: Contention waiting for a SAXParser. Consider increasing the
> XMLReaderUtils.POOL_SIZE
> … and so on…
>
> Any suggestions?
>
> Thanks!
>
> — Ken
>
>
> --------------------------
> Ken Krugler
> http://www.scaleunlimited.com
> custom big data solutions & training
> Hadoop, Cascading, Cassandra & Solr
>
>

-- 

Arvid Heise | Senior Java Developer

<https://www.ververica.com/>

Follow us @VervericaData

--

Join Flink Forward <https://flink-forward.org/> - The Apache Flink
Conference

Stream Processing | Event Driven | Real Time

--

Ververica GmbH | Invalidenstrasse 115, 10115 Berlin, Germany

--
Ververica GmbH
Registered at Amtsgericht Charlottenburg: HRB 158244 B
Managing Directors: Timothy Alexander Steinert, Yip Park Tung Jason, Ji
(Toni) Cheng

Reply via email to