[
https://issues.apache.org/jira/browse/TIKA-1946?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15767939#comment-15767939
]
Pascal Essiembre commented on TIKA-1946:
----------------------------------------
So what would be the percentage that are parsed properly? Am I reading the
results right that the majority failed due to EOF exceptions? Is there a way
to confirm these are indeed all truncated files and whether at least some
content was extracted for them? Given how many EOF exceptions there are, I
wonder if in some cases it may be the error reported when encountering an
unsupported file versions?
> Add mime detection and parser for WordPerfect
> ---------------------------------------------
>
> Key: TIKA-1946
> URL: https://issues.apache.org/jira/browse/TIKA-1946
> Project: Tika
> Issue Type: Improvement
> Components: mime, parser
> Reporter: Nick C
> Fix For: 2.0, 1.15
>
>
> I noticed some code on github for parsing WordPerfect files
> (https://github.com/Norconex/importer) Also looks like the author
> [~pascal.essiembre] has contributed to Tika before
--
This message was sent by Atlassian JIRA
(v6.3.4#6332)