Hi Tim,

> On Nov 23, 2020, at 10:46 AM, Tim Allison <talli...@apache.org> wrote:
> 
> Ken,
>  Thank you for finding this and sharing it.  I haven't seen this on my mac
> or ubuntu...not denying what you are seeing!
> 
>  Are you able to build 1.24.1 with no problem?  

Trying that now…

> I wonder if your system is
> using a different SAXParser which is not handled correctly in
> XMLReaderUtils?  What OS, what version of java?

Mac OS 10.12.6

java version "1.8.0_131"
Java(TM) SE Runtime Environment (build 1.8.0_131-b11)
Java HotSpot(TM) 64-Bit Server VM (build 25.131-b11, mixed mode)

— Ken

> On Mon, Nov 23, 2020 at 1:40 PM Ken Krugler <kkrugler_li...@transpac.com>
> wrote:
> 
>> Hi all,
>> 
>> I got past the JCE issue, but now some tests are failing with timeouts.
>> 
>> For this test:
>> 
>> [INFO] Running org.apache.tika.parser.microsoft.ooxml.OOXMLParserTest
>> 
>> I get 100s of these warnings:
>> 
>> Nov 21, 2020 10:28:38 PM org.apache.tika.utils.XMLReaderUtils
>> acquireSAXParser
>> WARNING: Contention waiting for a SAXParser. Consider increasing the
>> XMLReaderUtils.POOL_SIZE
>> 
>> And then:
>> 
>> [ERROR] Tests run: 87, Failures: 0, Errors: 1, Skipped: 3, Time elapsed:
>> 318.512 s <<< FAILURE! - in
>> org.apache.tika.parser.microsoft.ooxml.OOXMLParserTest
>> [ERROR]
>> org.apache.tika.parser.microsoft.ooxml.OOXMLParserTest.testUnsupportedPowerPoint
>> Time elapsed: 308.223 s  <<< ERROR!
>> org.apache.tika.exception.TikaException: TIKA-237: Illegal SAXException
>> from org.apache.tika.parser.microsoft.ooxml.OOXMLParser@e30d60
>>        at
>> org.apache.tika.parser.microsoft.ooxml.OOXMLParserTest.testUnsupportedPowerPoint(OOXMLParserTest.java:341)
>> Caused by: org.xml.sax.SAXException: Waited more than 5 minutes for a
>> SAXParser; This could indicate that a parser has not correctly released its
>> SAXParser. Please report this to the Tika team: dev@tika.apache.org
>> <mailto:dev@tika.apache.org>
>>        at
>> org.apache.tika.parser.microsoft.ooxml.OOXMLParserTest.testUnsupportedPowerPoint(OOXMLParserTest.java:341)
>> Caused by: org.apache.tika.exception.TikaException: Waited more than 5
>> minutes for a SAXParser; This could indicate that a parser has not
>> correctly released its SAXParser. Please report this to the Tika team:
>> dev@tika.apache.org <mailto:dev@tika.apache.org>
>>        at
>> org.apache.tika.parser.microsoft.ooxml.OOXMLParserTest.testUnsupportedPowerPoint(OOXMLParserTest.java:341)
>> 
>> Similarly, for:
>> 
>> [INFO] Running org.apache.tika.parser.microsoft.ooxml.SXSLFExtractorTest
>> 
>> Many of these:
>> 
>> Nov 21, 2020 10:33:55 PM org.apache.tika.utils.XMLReaderUtils
>> acquireSAXParser
>> WARNING: Contention waiting for a SAXParser. Consider increasing the
>> XMLReaderUtils.POOL_SIZE
>> 
>> And then similarly:
>> 
>> [ERROR] Tests run: 24, Failures: 0, Errors: 1, Skipped: 3, Time elapsed:
>> 309.375 s <<< FAILURE! - in
>> org.apache.tika.parser.microsoft.ooxml.SXSLFExtractorTest
>> [ERROR]
>> org.apache.tika.parser.microsoft.ooxml.SXSLFExtractorTest.testUnsupportedPowerPoint
>> Time elapsed: 307.9 s  <<< ERROR!
>> org.apache.tika.exception.TikaException: TIKA-237: Illegal SAXException
>> from org.apache.tika.parser.microsoft.ooxml.OOXMLParser@e30d60
>>        at
>> org.apache.tika.parser.microsoft.ooxml.SXSLFExtractorTest.testUnsupportedPowerPoint(SXSLFExtractorTest.java:281)
>> Caused by: org.xml.sax.SAXException: Waited more than 5 minutes for a
>> SAXParser; This could indicate that a parser has not correctly released its
>> SAXParser. Please report this to the Tika team: dev@tika.apache.org
>> <mailto:dev@tika.apache.org>
>>        at
>> org.apache.tika.parser.microsoft.ooxml.SXSLFExtractorTest.testUnsupportedPowerPoint(SXSLFExtractorTest.java:281)
>> Caused by: org.apache.tika.exception.TikaException: Waited more than 5
>> minutes for a SAXParser; This could indicate that a parser has not
>> correctly released its SAXParser. Please report this to the Tika team:
>> dev@tika.apache.org <mailto:dev@tika.apache.org>
>>        at
>> org.apache.tika.parser.microsoft.ooxml.SXSLFExtractorTest.testUnsupportedPowerPoint(SXSLFExtractorTest.java:281)
>> 
>> And now:
>> 
>> [INFO] Running org.apache.tika.parser.microsoft.ooxml.SXWPFExtractorTest
>> [INFO] Tests run: 36, Failures: 0, Errors: 0, Skipped: 0, Time elapsed:
>> 0.832 s - in org.apache.tika.parser.microsoft.ooxml.SXWPFExtractorTest
>> [INFO] Running org.apache.tika.parser.microsoft.ooxml.TruncatedOOXMLTest
>> [WARNING] Tests run: 5, Failures: 0, Errors: 0, Skipped: 1, Time elapsed:
>> 0.053 s - in org.apache.tika.parser.microsoft.ooxml.TruncatedOOXMLTest
>> [INFO] Running org.apache.tika.parser.microsoft.ooxml.xps.XPSParserTest
>> Nov 21, 2020 10:39:05 PM org.apache.tika.utils.XMLReaderUtils
>> acquireSAXParser
>> WARNING: Contention waiting for a SAXParser. Consider increasing the
>> XMLReaderUtils.POOL_SIZE
>> Nov 21, 2020 10:39:06 PM org.apache.tika.utils.XMLReaderUtils
>> acquireSAXParser
>> WARNING: Contention waiting for a SAXParser. Consider increasing the
>> XMLReaderUtils.POOL_SIZE
>> Nov 21, 2020 10:39:07 PM org.apache.tika.utils.XMLReaderUtils
>> acquireSAXParser
>> WARNING: Contention waiting for a SAXParser. Consider increasing the
>> XMLReaderUtils.POOL_SIZE
>> … and so on…
>> 
>> Any suggestions?
>> 
>> Thanks!
>> 
>> — Ken
>> 
>> --------------------------
>> Ken Krugler
>> http://www.scaleunlimited.com
>> custom big data solutions & training
>> Hadoop, Cascading, Cassandra & Solr
>> 
>> 

--------------------------
Ken Krugler
http://www.scaleunlimited.com
custom big data solutions & training
Hadoop, Cascading, Cassandra & Solr

Reply via email to