[
https://issues.apache.org/jira/browse/TIKA-1773?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14960599#comment-14960599
]
Nick Burch commented on TIKA-1773:
----------------------------------
Ah, I think I've found the issue. Based on
https://en.wikipedia.org/wiki/JPEG_2000#Metadata it looks like JP2 doesn't use
regular EXIF to store the metadata. It looks like they invented a new XML based
way to store it, neither EXIF nor XMP, which is "helpful" of them...
We'll need someone to work out what library can read the metadata (maybe
[~rgauss] will have an idea?), then write a parser that calls out to that +
applies the mapping onto Tika's standard metadata for it
> No XML Metadata output for JP2 files
> ------------------------------------
>
> Key: TIKA-1773
> URL: https://issues.apache.org/jira/browse/TIKA-1773
> Project: Tika
> Issue Type: Bug
> Affects Versions: 1.8, 1.9, 1.10
> Reporter: Andreas Hirtzel
> Attachments: testJPEG.jp2
>
>
> Hi,
> Tika doesn't return output for JPEG2000 (.jp2) files in xhtml format. We're
> using tika libraries in our application and get only empty html output for
> this file type. If you open a jp2 file with the gui and switch to structured
> text view, you don't get any results. There is no exception.
--
This message was sent by Atlassian JIRA
(v6.3.4#6332)