Re: POI 5.1.0 RC2?

2021-10-24 Thread Andreas Beeker
Hi PJ, On 24.10.21 16:58, PJ Fanning wrote: I added a change to poi-ooxml-lite gradle file to force the inclusion of the 107 extra XSBs. poi-ooxml-lite 5.0.0 - XSB count=1498 poi-ooxml-lite latest snapshot - XSB count=1220 no 5.0.0 XSBs are missing from snapshot. Is there anything else

Re: POI 5.1.0 RC2?

2021-10-24 Thread PJ Fanning
Hi Alain, We don't seem to have any real world users hitting issues with chart cloning - it's just something that the test harness that tests lots of files does. I don't think there is any urgency on fixing the cloning issue. Regards, PJ On Sunday 24 October 2021, 17:10:14 IST, Alain

Re: POI 5.1.0 RC2?

2021-10-24 Thread Alain FAGOT BÉAREZ
Hi, I will definitely need to reorganise my life to reserve some time for the required XDDF improvements. The code has been developed with only a few examples. This is why it is still marked with @Beta but people seem to start using it for real... Best regards, Alain FAGOT BÉAREZ ⁣Obter o

Re: POI 5.1.0 RC2?

2021-10-24 Thread PJ Fanning
I added a change to poi-ooxml-lite gradle file to force the inclusion of the 107 extra XSBs. poi-ooxml-lite 5.0.0 - XSB count=1498 poi-ooxml-lite latest snapshot - XSB count=1220 no 5.0.0 XSBs are missing from snapshot. Is there anything else blocking an RC2? If not, I can see about creating

Re: POI 5.1.0 RC2?

2021-10-21 Thread Dominik Stadler
Hi, The re-run with "full" is now available at http://people.apache.org/~centic/poi_regression/reports/index500RC2to510RC2.html 1089 failed files compared to 5.0.0, mostly XDDF, only a few others. Also 300 files that now fail with some "invalid ZIP", seems Apache Compress is a bit more "picky"

Re: POI 5.1.0 RC2?

2021-10-19 Thread Tim Allison
Apologies for being absent... The xsb issue is why we haven't upgraded to 5.x on Tika yet. I _think_ we'd like to avoid the ooxml-full jar, but if that's the most robust option, we'll have to go with that. I'm also happy to grab new files, or run against our corpus if that'd be of any use. Many

Re: POI 5.1.0 RC2?

2021-10-19 Thread Dominik Stadler
Hi, You are right, we discussed the reasons before, I am sorry. LG... Dominik. On Mon, Oct 18, 2021 at 9:06 PM Andreas Beeker wrote: > Hi Dominik, > > > Would be nice to know what changes made them required now as the > documents themselves did not change. > > I think you know, but to be

Re: POI 5.1.0 RC2?

2021-10-18 Thread PJ Fanning
A large fraction of the errors in  http://people.apache.org/~centic/poi_regression/reports/index500RC2to510RC1.html relate to cloning sheets that contain charts - where the XDDF code fails to clone the chart. This is just an observation - I must admit to not knowing much about the XDDF code.

Re: POI 5.1.0 RC2?

2021-10-18 Thread Andreas Beeker
Hi Dominik, Would be nice to know what changes made them required now as the documents themselves did not change. I think you know, but to be safe, I mentioned it again. Originally only the classes where filtered, but I've added the .xsb filtering also a while ago, to minimize the lite jar

Re: POI 5.1.0 RC2?

2021-10-18 Thread PJ Fanning
Looks like the poi-integration code was changed a few months so the code that walks the document and loads XmlObjects tries harder to walk the graph. The svn commit 1888985 labelled with 'TIKA-3164 - add more .xsb and classes needed for TIKAs extractor' is the main change. I definitely think

Re: POI 5.1.0 RC2?

2021-10-18 Thread Dominik Stadler
Hi, See the report at http://people.apache.org/~centic/poi_regression/reports/index500RC2to510RC1.html , one of the files is http://people.apache.org/~centic/poi_regression/reports/bib-chernigovka.netdo.ru_download_docs_17459.doc (it's actually a docx, so is processed via XSSFFileHandler).

Re: POI 5.1.0 RC2?

2021-10-18 Thread PJ Fanning
Hi Dominik, Would you be able to provide one of the files that causes the missing xsb issues? I could try debugging to see if I can see why it fails. On Monday 18 October 2021, 12:12:02 IST, PJ Fanning wrote: Maybe POI 5.1.0 poi-ooxml can default to poi-ooxml-full as the dependency

Re: POI 5.1.0 RC2?

2021-10-18 Thread PJ Fanning
Maybe POI 5.1.0 poi-ooxml can default to poi-ooxml-full as the dependency and we could produce the lite jar with a health warning and documentation about how to use that instead if you choose to? After POI 5.1.0, we could see if the community needs the lite jar at all. Some users still try to

Re: POI 5.1.0 RC2?

2021-10-18 Thread Dominik Stadler
Hi, hm, would be fairly tedious to look for up to 107 documents and add all of them to the integration-tests. Would be nice to know what changes made them required now as the documents themselves did not change. For now I will trigger another mass-testing run with "full" sometimes this week.

Re: POI 5.1.0 RC2?

2021-10-18 Thread PJ Fanning
Thanks Dominik for running the tests. I'm not sure what the best approach is but adding some of the files that cause the missing xsb issues to the poi-test-data dir might be a good starting point. On Monday 18 October 2021, 08:05:21 IST, Dominik Stadler wrote: FYI, I did re-run the

Re: POI 5.1.0 RC2?

2021-10-14 Thread Andreas Beeker
"Kept you waiting, huh?" (tm) -No problem for me ... XmlBeans RC2 looks good so far, therefore it can be only a matter of days. On 15.10.21 00:06, PJ Fanning wrote: With POI trunk, I've added a use case that relies on XMLBeans 5.0.2 release. Can we wait until XMLBeans 5.0.2 is released?

Re: POI 5.1.0 RC2?

2021-10-14 Thread PJ Fanning
With POI trunk, I've added a use case that relies on XMLBeans 5.0.2 release. Can we wait until XMLBeans 5.0.2 is released? Alternatively, I can remove the new POI code that use XmlOptions setDisallowDocTypeDeclaration. On Thursday 14 October 2021, 22:52:10 IST, Andreas Beeker wrote:

POI 5.1.0 RC2?

2021-10-14 Thread Andreas Beeker
Hi Devs, there were heaps of changes since RC1 and it looks like PJ is working full-time on POI. When should I provide the RC2? Andi - To unsubscribe, e-mail: dev-unsubscr...@poi.apache.org For additional commands, e-mail: