[ 
https://issues.apache.org/jira/browse/TIKA-3624?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Carina Antunes updated TIKA-3624:
---------------------------------
    Description: 
In API unpack/all, metadata name keys are missing from _{_}METADATA{_}_.

Expected output (eg in 1.27):
{code:java}
$ more __METADATA__{code}
{code:java}
"X-Parsed-By","org.apache.tika.parser.DefaultParser","org.apache.tika.parser.csv.TextAndCSVParser"
"Content-Encoding","ISO-8859-1"
"Content-Type","text/plain; charset=ISO-8859-1"
{code}
 

Output in 2.0.0:
{code:java}
$ more __METADATA__{code}
{code:java}
org.apache.tika.parser.DefaultParser,org.apache.tika.parser.csv.TextAndCSVParser
ISO-8859-1
text/plain; charset=ISO-8859-1
{code}
 

 

  was:
In API unpack/all, metadata name keys are missing from __METADATA__.

Expected output (eg in 1.27)

 
{code:java}
$ more __METADATA__{code}
 
{code:java}
"X-Parsed-By","org.apache.tika.parser.DefaultParser","org.apache.tika.parser.csv.TextAndCSVParser"
"Content-Encoding","ISO-8859-1"
"Content-Type","text/plain; charset=ISO-8859-1"
{code}
 

Output in 2.0.0

 
{code:java}
$ more __METADATA__{code}
 

 
{code:java}
org.apache.tika.parser.DefaultParser,org.apache.tika.parser.csv.TextAndCSVParser
ISO-8859-1
text/plain; charset=ISO-8859-1
{code}
 

 


> Version 2.0.0 forward breaks metadata in unpack/all (From 1.27)
> ---------------------------------------------------------------
>
>                 Key: TIKA-3624
>                 URL: https://issues.apache.org/jira/browse/TIKA-3624
>             Project: Tika
>          Issue Type: Bug
>    Affects Versions: 2.0.0, 2.1.0, 2.2.0
>            Reporter: Carina Antunes
>            Priority: Major
>
> In API unpack/all, metadata name keys are missing from _{_}METADATA{_}_.
> Expected output (eg in 1.27):
> {code:java}
> $ more __METADATA__{code}
> {code:java}
> "X-Parsed-By","org.apache.tika.parser.DefaultParser","org.apache.tika.parser.csv.TextAndCSVParser"
> "Content-Encoding","ISO-8859-1"
> "Content-Type","text/plain; charset=ISO-8859-1"
> {code}
>  
> Output in 2.0.0:
> {code:java}
> $ more __METADATA__{code}
> {code:java}
> org.apache.tika.parser.DefaultParser,org.apache.tika.parser.csv.TextAndCSVParser
> ISO-8859-1
> text/plain; charset=ISO-8859-1
> {code}
>  
>  



--
This message was sent by Atlassian Jira
(v8.20.1#820001)

Reply via email to