[
https://issues.apache.org/jira/browse/TIKA-3986?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17700349#comment-17700349
]
Tim Allison commented on TIKA-3986:
-----------------------------------
What...?
> JDBCEmitter should strip \u0000 for postgres varchar/strings
> ------------------------------------------------------------
>
> Key: TIKA-3986
> URL: https://issues.apache.org/jira/browse/TIKA-3986
> Project: Tika
> Issue Type: Task
> Reporter: Tim Allison
> Priority: Minor
> Fix For: 2.7.1
>
> Attachments: Screenshot from 2023-03-14 15-32-34.png
>
>
> Postgres does not allow \u0000 in varchar fields. We should automatically
> replace this character if the jdbc string starts with "jdbc:postgresql". An
> alternate approach would be to add a metadata filter that runs a regex
> replace operation on metadata values.
--
This message was sent by Atlassian Jira
(v8.20.10#820010)