Tim Allison created TIKA-4133:
---------------------------------
Summary: Add capture group metadataFilter
Key: TIKA-4133
URL: https://issues.apache.org/jira/browse/TIKA-4133
Project: Tika
Issue Type: Task
Reporter: Tim Allison
There are some cases where it would be useful to run a regex to capture
specific values in a metadata object.
For example, some users might not want the mime attributes (e.g. charset) as in
"text/html; charset=UTF-8".
Let's start with a simple regex capture group filter. If we need to capture
multiple matches etc, we can add that on a later ticket.
--
This message was sent by Atlassian Jira
(v8.20.10#820010)