I just finished the run. This compares 3.15-beta1-rc1 with 3.13-final (what we had in Tika 1.12).
Results are here: http://162.242.228.174/reports/reports_poi_3_15-beta1-rc1.tar.bz2 I've only had a chance to look at the results briefly...it looks like no major issues. We're getting quite a bit more content out of ppt and pptx! Overall, we're extracting ~500k more common English words. There are a few files that will require further investigation on content loss...some of that may be Tika's fault. See contents/content_diffs_with_exceptions.xlsx. I found if I subtracted commonA from commonB and then sorted, that was a good way to find docs that lost content. -----Original Message----- From: kiwiwings [mailto:kiwiwi...@apache.org] Sent: Friday, April 08, 2016 11:41 AM To: dev@poi.apache.org Subject: Re: [VOTE] Apache POI 3.15-beta1 release (RC1) Hi All, Is it ok, to create RC2 from the trunk? If not, I'll apply Tims changes to the 3.15-beta1 tag. Andi -- View this message in context: http://apache-poi.1045710.n5.nabble.com/VOTE-Apache-POI-3-15-beta1-release-RC1-tp5722647p5722677.html Sent from the POI - Dev mailing list archive at Nabble.com. --------------------------------------------------------------------- To unsubscribe, e-mail: dev-unsubscr...@poi.apache.org For additional commands, e-mail: dev-h...@poi.apache.org --------------------------------------------------------------------- To unsubscribe, e-mail: dev-unsubscr...@poi.apache.org For additional commands, e-mail: dev-h...@poi.apache.org