[
https://issues.apache.org/jira/browse/TIKA-4673?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=18083612#comment-18083612
]
ASF GitHub Bot commented on TIKA-4673:
--------------------------------------
tballison closed pull request #2645: TIKA-4673
> Add a parser that's a hook for Jina Reader in 4.x
> -------------------------------------------------
>
> Key: TIKA-4673
> URL: https://issues.apache.org/jira/browse/TIKA-4673
> Project: Tika
> Issue Type: New Feature
> Reporter: Tim Allison
> Priority: Minor
>
> After adding the modern embedding and ocr options, we may want to add a
> parser that hooks Jina Reader for html and PDF cleaning in 4.x
--
This message was sent by Atlassian Jira
(v8.20.10#820010)