Tim Allison created TIKA-4673:
---------------------------------
Summary: Add a parser that's a hook for Jina Reader in 4.x
Key: TIKA-4673
URL: https://issues.apache.org/jira/browse/TIKA-4673
Project: Tika
Issue Type: New Feature
Reporter: Tim AllisonAfter adding the modern embedding and ocr options, we may want to add a parser that hooks Jina Reader for html and PDF cleaning in 4.x -- This message was sent by Atlassian Jira (v8.20.10#820010)
