Regression results are here. I haven't had a chance to look. This compares Tika's trunk with poi 3.15-rc1 (? I think?) against 3.15-beta1 in Tika 1.13. Some differences might be changes at the Tika level.
I ran this against the full corpus so there are file formats we don't care about. https://github.com/tballison/share/blob/master/tika_comparisons/reports_tika_20160904_dev.zip