[
https://issues.apache.org/jira/browse/TIKA-2107?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15542361#comment-15542361
]
Tim Allison commented on TIKA-2107:
-----------------------------------
Are you getting an exception or just no text extracted?
I'm getting:
{noformat}
[
{
"Content-Type": "application/msword2",
"X-Parsed-By": "org.apache.tika.parser.EmptyParser",
"X-TIKA:digest:MD5": "48fb40b1203a999e4691af4e26c368de",
"X-TIKA:digest:SHA256":
"dd7487f6df798f22bbfc1e49cca82d0c0bcdde0eb9a843cae998800cb6f4770e",
"X-TIKA:parse_time_millis": "6"
}
]
{noformat}
> Old MS Word files give error while indexing
> -------------------------------------------
>
> Key: TIKA-2107
> URL: https://issues.apache.org/jira/browse/TIKA-2107
> Project: Tika
> Issue Type: Bug
> Components: tika-batch
> Affects Versions: 2.0
> Environment: ubuntu
> Reporter: Gaurav
> Labels: patch
> Attachments: plen281.doc
>
>
> error while indexing old MS word files
--
This message was sent by Atlassian JIRA
(v6.3.4#6332)