[ https://issues.apache.org/jira/browse/TIKA-410?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]
Ali Oral updated TIKA-410: -------------------------- Attachment: test.doc tika_test.txt Attaching a test file and extracted text file of it. > textbox content extaction for word documents > -------------------------------------------- > > Key: TIKA-410 > URL: https://issues.apache.org/jira/browse/TIKA-410 > Project: Tika > Issue Type: Improvement > Components: parser > Environment: Windows Xp > Java 1.6.0_18 > Reporter: Ali Oral > Priority: Minor > Attachments: test.doc, tika_test.txt > > > It looks like Tika does not extract text from textbox compenent of word files. -- This message is automatically generated by JIRA. - If you think it was sent incorrectly contact one of the administrators: https://issues.apache.org/jira/secure/Administrators.jspa - For more information on JIRA, see: http://www.atlassian.com/software/jira