[
https://issues.apache.org/jira/browse/TIKA-3624?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Carina Antunes updated TIKA-3624:
---------------------------------
Description:
In API unpack/all, metadata name keys are missing from _{_}METADATA{_}_.
Expected output (eg in 1.27):
{code:java}
$ more __METADATA__{code}
{code:java}
"X-Parsed-By","org.apache.tika.parser.DefaultParser","org.apache.tika.parser.csv.TextAndCSVParser"
"Content-Encoding","ISO-8859-1"
"Content-Type","text/plain; charset=ISO-8859-1"
{code}
Output in 2.0.0:
{code:java}
$ more __METADATA__{code}
{code:java}
org.apache.tika.parser.DefaultParser,org.apache.tika.parser.csv.TextAndCSVParser
ISO-8859-1
text/plain; charset=ISO-8859-1
{code}
was:
In API unpack/all, metadata name keys are missing from __METADATA__.
Expected output (eg in 1.27)
{code:java}
$ more __METADATA__{code}
{code:java}
"X-Parsed-By","org.apache.tika.parser.DefaultParser","org.apache.tika.parser.csv.TextAndCSVParser"
"Content-Encoding","ISO-8859-1"
"Content-Type","text/plain; charset=ISO-8859-1"
{code}
Output in 2.0.0
{code:java}
$ more __METADATA__{code}
{code:java}
org.apache.tika.parser.DefaultParser,org.apache.tika.parser.csv.TextAndCSVParser
ISO-8859-1
text/plain; charset=ISO-8859-1
{code}
> Version 2.0.0 forward breaks metadata in unpack/all (From 1.27)
> ---------------------------------------------------------------
>
> Key: TIKA-3624
> URL: https://issues.apache.org/jira/browse/TIKA-3624
> Project: Tika
> Issue Type: Bug
> Affects Versions: 2.0.0, 2.1.0, 2.2.0
> Reporter: Carina Antunes
> Priority: Major
>
> In API unpack/all, metadata name keys are missing from _{_}METADATA{_}_.
> Expected output (eg in 1.27):
> {code:java}
> $ more __METADATA__{code}
> {code:java}
> "X-Parsed-By","org.apache.tika.parser.DefaultParser","org.apache.tika.parser.csv.TextAndCSVParser"
> "Content-Encoding","ISO-8859-1"
> "Content-Type","text/plain; charset=ISO-8859-1"
> {code}
>
> Output in 2.0.0:
> {code:java}
> $ more __METADATA__{code}
> {code:java}
> org.apache.tika.parser.DefaultParser,org.apache.tika.parser.csv.TextAndCSVParser
> ISO-8859-1
> text/plain; charset=ISO-8859-1
> {code}
>
>
--
This message was sent by Atlassian Jira
(v8.20.1#820001)