[ https://issues.apache.org/jira/browse/TIKA-408?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12856819#action_12856819 ]
Dmitry Kuzmenko commented on TIKA-408: -------------------------------------- Our code was taken and expanded from Apache Nutch project http://lucene.apache.org/nutch/. We've checked, code seems very similar to text-mining library :) Using of text-mining library is good idea. We will drop view in this way in future. But currently we have no resources to assign for this work. > Word 6.0/7.0 documents support in office parser > ----------------------------------------------- > > Key: TIKA-408 > URL: https://issues.apache.org/jira/browse/TIKA-408 > Project: Tika > Issue Type: Improvement > Components: parser > Affects Versions: 0.7 > Reporter: Dmitry Kuzmenko > Priority: Minor > Attachments: testWORD6.doc, word6.patch.gz > > > Current office parser doesn't support old Word 6.0/7.0 documents. -- This message is automatically generated by JIRA. - If you think it was sent incorrectly contact one of the administrators: https://issues.apache.org/jira/secure/Administrators.jspa - For more information on JIRA, see: http://www.atlassian.com/software/jira