[
https://issues.apache.org/jira/browse/TIKA-2187?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15710370#comment-15710370
]
Luis Filipe Nassif commented on TIKA-2187:
------------------------------------------
Thank you [[email protected]] for making it configurable!!!
> Align default behavior of experimental docx parser with that of doc parser in
> handling delText
> ----------------------------------------------------------------------------------------------
>
> Key: TIKA-2187
> URL: https://issues.apache.org/jira/browse/TIKA-2187
> Project: Tika
> Issue Type: Improvement
> Reporter: Tim Allison
> Priority: Minor
> Fix For: 2.0, 1.15
>
>
> Now that we can ignore delText via the experimental alternate SAXParser for
> .docx files, let's make that the default behavior to align with the expected
> behavior for our .doc parser (ignore deleted text).
> Let's also add the ability to include deleted text from .doc files.
--
This message was sent by Atlassian JIRA
(v6.3.4#6332)