On Thu, 8 Mar 2012, Harry Simons wrote:
I tried the BFF Validator, and it is indeed failing!

If you're able to share the error log, that could be helpful

However, the file got created by MS Word only, and I doubt if it's 'corrupt'... since both MS Word and LibreOffice can load it fine without any errors or even warnings of any kind -- everything seems to be normal with these apps. I can even use LibreOffice 3.5 to convert it to pdf or to a .zip of xml's.

If you load it in word, and do a save-as, does the new .doc file show the same problem?

Do you/others still feel it could be addressed by a POI upgrade?

You could try with the Tika 1.1 release candidate, that has the latest POI release in it. You could also try dropping in a recent POI nightly build to see if that helps - Tika will upgrade shortly to POI 3.8 beta 6 once that's out


Also, I thought Tika uses POI and would be using POI as a .jar. But looking in Tika sources, I could find only *POI*.java files but no *POI*.jar or *poi*.jar file(s).

Depends how you use Tika. The Tika-App inlines all the dependencies, the Tika OSGi Bundle has them individually as jars in the bundle, or Maven will download them for you

Nick

Reply via email to