[
https://issues.apache.org/jira/browse/TIKA-3776?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17542684#comment-17542684
]
Tim Allison commented on TIKA-3776:
-----------------------------------
I think we should overwrite the file length if we're measuring it ourselves.
If the incoming metadata object has a file length, and we run Files.size() and
come up with a different length, we should overwrite that. But, for file name,
y, don't overwrite if there's a name already there.
[~nick], what do you think?
> HttpFetcher overwrites filename passed in
> -----------------------------------------
>
> Key: TIKA-3776
> URL: https://issues.apache.org/jira/browse/TIKA-3776
> Project: Tika
> Issue Type: Bug
> Reporter: Tom Brisland
> Assignee: Tim Allison
> Priority: Major
>
> The HttpFetcher spools file content to a temporary file with
> `TikaInputStream.get()` and passes through the existing metadata.
> TikaInputStream overwrites the filename with that of the temporary file.
> This means that passing the filename to the detect endpoint as a header is
> ignored and results from detection are incorrect.
--
This message was sent by Atlassian Jira
(v8.20.7#820007)