[ 
https://issues.apache.org/jira/browse/IMAGING-112?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13800967#comment-13800967
 ] 

Damjan Jovanovic commented on IMAGING-112:
------------------------------------------

Hmm, that is the documented behaviour in the Javadoc:
     Reads a Jpeg image, removes all IPTC data from the App13 segment but
     leaves the other data in that segment (if present) unchanged and writes
     the result to a stream.

However it is inconsistent with what ExifRewriter does:
     Reads a Jpeg image, removes all EXIF metadata (by removing the APP1
     segment), and writes the result to a stream.

I guess the problem is that the App13 segment can contain other data in 
addition to IPTC so that's why it isn't removed completely like EXIF's App1 
segment? Maybe we should have an additional method to remove the entire segment?


> JpegIptcRewriter.removeIPTC() does not remove all metadata
> ----------------------------------------------------------
>
>                 Key: IMAGING-112
>                 URL: https://issues.apache.org/jira/browse/IMAGING-112
>             Project: Commons Imaging
>          Issue Type: Bug
>    Affects Versions: 1.0
>            Reporter: Xavier Dury
>         Attachments: ImageTest.java
>
>
> I'm trying to compute hashes from my images to detect duplicates.
> To do so, I strip every metadata (like keywords I may have added) then 
> compute a CRC32.
> My problem is that JpegIptcRewriter.removeIPTC() does not remove all metadata 
> as there is always a difference of 18 bytes (containing "Photoshop 3.0").
> So could it be possible to remove those 18 bytes as well in removeIPTC() so 
> that I can compute hashes?
> Thanks,
> Xavier



--
This message was sent by Atlassian JIRA
(v6.1#6144)

Reply via email to