Arjohn Kampman created IMAGING-174:
--------------------------------------
Summary: Support non-8BIM signatures in Photoshop segments
Key: IMAGING-174
URL: https://issues.apache.org/jira/browse/IMAGING-174
Project: Commons Imaging
Issue Type: Bug
Components: Format: JPEG
Reporter: Arjohn Kampman
The code in IptcParser.parseAllBlocks(...) requires that all blocks have an
"8BIM" signature. However, we're frequently finding jpeg files that have "PHUT"
signatures mixed in. Some sites also report "AgHg" and "DCSR" signatures, for
example: http://dev.exiv2.org/issues/800. Although the signature is not what
the code expects, the block's data layout is still the same as for 8BIM. Please
consider either parsing such blocks, or at least skip them with a warning.
Currently, the code throws an exception which prevents us from extracting any
of the other metadata. I'm attaching a sample image from the Enron Corpus that
has two of these PHUT resource blocks.
--
This message was sent by Atlassian JIRA
(v6.3.4#6332)