[
https://issues.apache.org/jira/browse/TIKA-534?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Nick Burch resolved TIKA-534.
-----------------------------
Resolution: Fixed
Fix Version/s: 1.0
Fixed in r1082973 - we now skip over the tags we can't process
> MetadataException: Unsupported component id error parsing jpg
> -------------------------------------------------------------
>
> Key: TIKA-534
> URL: https://issues.apache.org/jira/browse/TIKA-534
> Project: Tika
> Issue Type: Bug
> Components: parser
> Affects Versions: 0.8
> Environment: Tika revision 1023712
> Reporter: Geoff Jarrad
> Fix For: 1.0
>
> Attachments: Billy Boys II.JPG, dickson1.jpg, seibicke.jpg
>
>
> I seem to be getting repeated errors parsing JPEG images that load perfectly
> well in Firefox.
> The main problem seems to be an unsupported component ID, though the actual
> ID number varies
> from jpg to jpg. Example image to follow.
> Exception in thread "main" org.apache.tika.exception.TikaException: Can't
> read TIFF/JPEG metadata
> at
> org.apache.tika.parser.image.ImageMetadataExtractor.parse(ImageMetada
> taExtractor.java:91)
> at
> org.apache.tika.parser.image.ImageMetadataExtractor.parseJpeg(ImageMetadataExtractor.java:69)
> at org.apache.tika.parser.jpeg.JpegParser.parse(JpegParser.java:56)
> at
> org.apache.tika.parser.CompositeParser.parse(CompositeParser.java:197)
> at
> org.apache.tika.parser.CompositeParser.parse(CompositeParser.java:197)
> at
> org.apache.tika.parser.AutoDetectParser.parse(AutoDetectParser.java:137)
> at org.apache.tika.cli.TikaCLI.process(TikaCLI.java:213)
> at org.apache.tika.cli.TikaCLI.main(TikaCLI.java:73)
> Caused by: com.drew.metadata.MetadataException: Unsupported component id: 152
> at com.drew.metadata.jpeg.JpegComponent.getComponentName(Unknown
> Source)
> at
> com.drew.metadata.jpeg.JpegDescriptor.getComponentDataDescription(Unknown
> Source)
> at com.drew.metadata.jpeg.JpegDescriptor.getDescription(Unknown
> Source)
> at com.drew.metadata.Directory.getDescription(Unknown Source)
> at com.drew.metadata.Tag.getDescription(Unknown Source)
> at
> org.apache.tika.parser.image.ImageMetadataExtractor.parse(ImageMetadataExtractor.java:85)
> ... 7 more
--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira