[
https://issues.apache.org/jira/browse/TIKA-4133?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17765752#comment-17765752
]
ASF GitHub Bot commented on TIKA-4133:
--------------------------------------
tballison merged PR #1346:
URL: https://github.com/apache/tika/pull/1346
> Add capture group metadataFilter
> --------------------------------
>
> Key: TIKA-4133
> URL: https://issues.apache.org/jira/browse/TIKA-4133
> Project: Tika
> Issue Type: Task
> Reporter: Tim Allison
> Priority: Trivial
>
> There are some cases where it would be useful to run a regex to capture
> specific values in a metadata object.
> For example, some users might not want the mime attributes (e.g. charset) as
> in "text/html; charset=UTF-8".
> Let's start with a simple regex capture group filter. If we need to capture
> multiple matches etc, we can add that on a later ticket.
--
This message was sent by Atlassian Jira
(v8.20.10#820010)