On Fri, 30 Jul 2010, Bracken, Patrick wrote:
Hi, I am attempting to use Tika to extract content from .doc files for search indexing purposes. I have run into some exceptions thrown when looking at documents mad in an old version of word. Is there any plan to add support for this or a way to get around it?
See TIKA-408 <https://issues.apache.org/jira/browse/TIKA-408> for details. Support is now in POI, and Tika will handle Word 7 and Word 95 documents once the next POI beta release is out. That'll hopefully be within the next fortnight
Nick
