I just finished the run.  This compares 3.15-beta1-rc1 with 3.13-final (what we 
had in Tika 1.12).

Results are here:
http://162.242.228.174/reports/reports_poi_3_15-beta1-rc1.tar.bz2

I've only had a chance to look at the results briefly...it looks like no major 
issues.  We're getting quite a bit more content out of ppt and pptx!

Overall, we're extracting ~500k more common English words.  There are a few 
files that will require further investigation on content loss...some of that 
may be Tika's fault.  See contents/content_diffs_with_exceptions.xlsx.  I found 
if I subtracted commonA from commonB and then sorted, that was a good way to 
find docs that lost content.



-----Original Message-----
From: kiwiwings [mailto:kiwiwi...@apache.org] 
Sent: Friday, April 08, 2016 11:41 AM
To: dev@poi.apache.org
Subject: Re: [VOTE] Apache POI 3.15-beta1 release (RC1)

Hi All,

Is it ok, to create RC2 from the trunk?
If not, I'll apply Tims changes to the 3.15-beta1 tag.

Andi



--
View this message in context: 
http://apache-poi.1045710.n5.nabble.com/VOTE-Apache-POI-3-15-beta1-release-RC1-tp5722647p5722677.html
Sent from the POI - Dev mailing list archive at Nabble.com.

---------------------------------------------------------------------
To unsubscribe, e-mail: dev-unsubscr...@poi.apache.org For additional commands, 
e-mail: dev-h...@poi.apache.org


---------------------------------------------------------------------
To unsubscribe, e-mail: dev-unsubscr...@poi.apache.org
For additional commands, e-mail: dev-h...@poi.apache.org

Reply via email to