[
https://issues.apache.org/jira/browse/TIKA-4361?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17921819#comment-17921819
]
Hudson commented on TIKA-4361:
------------------------------
SUCCESS: Integrated in Jenkins build Tika ยป tika-main-jdk17 #620 (See
[https://ci-builds.apache.org/job/Tika/job/tika-main-jdk17/620/])
TIKA-4361 -- follow on fix (#2108) (github:
[https://github.com/apache/tika/commit/05db89db3d24c13ee8794999c18af3a0b9a094e5])
* (edit)
tika-parsers/tika-parsers-standard/tika-parsers-standard-modules/tika-parser-microsoft-module/src/main/java/org/apache/tika/parser/microsoft/rtf/TextExtractor.java
> Rare RTF bug handling styles within an href in a malformed file
> ---------------------------------------------------------------
>
> Key: TIKA-4361
> URL: https://issues.apache.org/jira/browse/TIKA-4361
> Project: Tika
> Issue Type: Improvement
> Reporter: Tim Allison
> Assignee: Tim Allison
> Priority: Trivial
> Fix For: 4.0.0, 3.1.0
>
>
> If there's a <b> within a hyperlink, and there's an extra closing }, then the
> RTF parser closes the <b> without opening it, leading to a SAXException.
> In other parts of the code, we check to make sure that we're not in a
> hyperlink before we modify the styles. we should do that in this piece of
> code as well.
--
This message was sent by Atlassian Jira
(v8.20.10#820010)