Thank you, Dominik!  For the regression tests and for fixing #1 already.

As for #2, I wonder if we should look into using the actual Apache xerces for 
the next beta? Not another dependency....

-----Original Message-----
From: Dominik Stadler [mailto:dominik.stad...@gmx.at] 
Sent: Friday, April 01, 2016 4:13 PM
To: POI Developers List <dev@poi.apache.org>
Subject: Re: Beta time yet?

Hi,

the POI specific regression test report is now updated for comparing
3.14-beta2 (nearly 3.14 final) with the current 3.15-beta1 (as of Tuesday) at 
http://people.apache.org/~centic/poi_regression/reports/

Runtime-wise I see that the run did take aprox 25h:15min now compared to 25h 
before, so I would say no actual change in runtime as I was doing other stuff 
on the machine for some time and thus might have affected these timings 
somewhat!

I see two issues:

1.
A XSLF related new error:

java.lang.ArrayStoreException
        at java.util.ArrayList.toArray(ArrayList.java:408)
        at 
org.openxmlformats.schemas.drawingml.x2006.main.impl.CTGeomGuideListImpl.getGdArray(Unknown
Source)
        at 
o.a.p.xslf.usermodel.XSLFSimpleShape.getAdjustValue(XSLFSimpleShape.java:823)

Seems like list members with different types are used here so that putting them 
all into an array fails.
However I could not reproduce this locally, so not sure how it was triggered in 
the regression testing?!?

2.
An XML parsing related one:

Caused by: java.lang.ArrayIndexOutOfBoundsException: 8192
        at 
com.sun.org.apache.xerces.internal.impl.io.UTF8Reader.read(UTF8Reader.java:329)

This seems to be a bug in the JDK itself in relation to surrogate Unicode 
characters, see e.g.
https://bugs.openjdk.java.net/browse/JDK-7156085 and originally
https://issues.apache.org/jira/browse/XERCESJ-1257 for more detailed discussion.

Seems only JDK 9 has a fix for this :(
However it is very rare, only 6 times in 1 mio documents, so I think it 
outweights the gain from using the JDK XML Parser.

Dominik.


On Wed, Mar 30, 2016 at 4:52 PM, Dominik Stadler <dominik.stad...@gmx.at>
wrote:

>
> Ok, I'll try to squeeze in a run of my version of the large regression 
> test-suite with the latest trunk as it may cover a few different areas 
> not used in Tika.
>
> Dominik.
>
> On Wed, Mar 30, 2016 at 1:36 PM, kiwiwings <kiwiwi...@apache.org> wrote:
>
>> Nick Burch-8 wrote
>> > What do people think about doing a beta release in a few weeks time ...?
>>
>> Last time it took ages between the thread start and the actual release.
>> As the general consensus is anyway having short release cycles, I'll 
>> simply start the releasing trip on the next weekend (Friday evening).
>>
>> Maybe I should spent some time to script more of the manual work ... 
>> - as we use Jenkins here at my $dayjob, I'm used to (nearly) 
>> one-click-releasing the artifacts - I guess something like this would 
>> be possible with Ant too ...
>>
>> Andi
>>
>>
>>
>> --
>> View this message in context:
>> http://apache-poi.1045710.n5.nabble.com/Beta-time-yet-tp5722477p57224
>> 81.html Sent from the POI - Dev mailing list archive at Nabble.com.
>>
>> ---------------------------------------------------------------------
>> To unsubscribe, e-mail: dev-unsubscr...@poi.apache.org For additional 
>> commands, e-mail: dev-h...@poi.apache.org
>>
>>
>

---------------------------------------------------------------------
To unsubscribe, e-mail: dev-unsubscr...@poi.apache.org
For additional commands, e-mail: dev-h...@poi.apache.org

Reply via email to