[ 
https://issues.apache.org/jira/browse/TIKA-1773?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14960599#comment-14960599
 ] 

Nick Burch commented on TIKA-1773:
----------------------------------

Ah, I think I've found the issue. Based on 
https://en.wikipedia.org/wiki/JPEG_2000#Metadata it looks like JP2 doesn't use 
regular EXIF to store the metadata. It looks like they invented a new XML based 
way to store it, neither EXIF nor XMP, which is "helpful" of them...

We'll need someone to work out what library can read the metadata (maybe 
[~rgauss] will have an idea?), then write a parser that calls out to that + 
applies the mapping onto Tika's standard metadata for it

> No XML Metadata output for JP2 files
> ------------------------------------
>
>                 Key: TIKA-1773
>                 URL: https://issues.apache.org/jira/browse/TIKA-1773
>             Project: Tika
>          Issue Type: Bug
>    Affects Versions: 1.8, 1.9, 1.10
>            Reporter: Andreas Hirtzel
>         Attachments: testJPEG.jp2
>
>
> Hi,
> Tika doesn't return output for JPEG2000 (.jp2) files in xhtml format. We're 
> using tika libraries in our application and get only empty html output for 
> this file type. If you open a jp2 file with the gui and switch to structured 
> text view, you don't get any results. There is no exception.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

Reply via email to