[ https://issues.apache.org/jira/browse/TIKA-408?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12856804#action_12856804 ]
Jukka Zitting commented on TIKA-408: ------------------------------------ This is a nice improvement, thanks! Would it be possible to use the org.textmining:tm-extractors:0.4 dependency instead of having our own copies of the textmining.org classes? Note that the 1.0 version available from http://code.google.com/p/text-mining/ is under LGPL, so we can't use it directly. Can you also attach the testWORD6.doc test document you're using? > Word 6.0/7.0 documents support in office parser > ----------------------------------------------- > > Key: TIKA-408 > URL: https://issues.apache.org/jira/browse/TIKA-408 > Project: Tika > Issue Type: Improvement > Components: parser > Affects Versions: 0.7 > Reporter: Dmitry Kuzmenko > Priority: Minor > Attachments: word6.patch.gz > > > Current office parser doesn't support old Word 6.0/7.0 documents. -- This message is automatically generated by JIRA. - If you think it was sent incorrectly contact one of the administrators: https://issues.apache.org/jira/secure/Administrators.jspa - For more information on JIRA, see: http://www.atlassian.com/software/jira