[ 
https://issues.apache.org/jira/browse/TIKA-4133?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17765752#comment-17765752
 ] 

ASF GitHub Bot commented on TIKA-4133:
--------------------------------------

tballison merged PR #1346:
URL: https://github.com/apache/tika/pull/1346




> Add capture group metadataFilter
> --------------------------------
>
>                 Key: TIKA-4133
>                 URL: https://issues.apache.org/jira/browse/TIKA-4133
>             Project: Tika
>          Issue Type: Task
>            Reporter: Tim Allison
>            Priority: Trivial
>
> There are some cases where it would be useful to run a regex to capture 
> specific values in a metadata object.
> For example, some users might not want the mime attributes (e.g. charset) as 
> in "text/html; charset=UTF-8".
> Let's start with a simple regex capture group filter.  If we need to capture 
> multiple matches etc, we can add that on a later ticket.



--
This message was sent by Atlassian Jira
(v8.20.10#820010)

Reply via email to