[
https://issues.apache.org/jira/browse/TIKA-774?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Chris A. Mattmann updated TIKA-774:
-----------------------------------
Fix Version/s: (was: 1.15)
1.16
> ExifTool Parser
> ---------------
>
> Key: TIKA-774
> URL: https://issues.apache.org/jira/browse/TIKA-774
> Project: Tika
> Issue Type: New Feature
> Components: parser
> Affects Versions: 1.0
> Environment: Requires be installed
> (http://www.sno.phy.queensu.ca/~phil/exiftool/)
> Reporter: Ray Gauss II
> Assignee: Chris A. Mattmann
> Labels: features, new-parser, newbie, patch
> Fix For: 1.16
>
> Attachments: testJPEG_IPTC_EXT.jpg,
> tika-core-exiftool-parser-patch.txt, tika-parsers-exiftool-parser-patch.txt
>
>
> Adds an external parser that calls ExifTool to extract extended metadata
> fields from images and other content types.
> In the core project:
> An ExifTool interface is added which contains Property objects that define
> the metadata fields available.
> An additional Property constructor for internalTextBag type.
> In the parsers project:
> An ExiftoolMetadataExtractor is added which does the work of calling ExifTool
> on the command line and mapping the response to tika metadata fields. This
> extractor could be called instead of or in addition to the existing
> ImageMetadataExtractor and JempboxExtractor under TiffParser and/or
> JpegParser but those have not been changed at this time.
> An ExiftoolParser is added which calls only the ExiftoolMetadataExtractor.
> An ExiftoolTikaMapper is added which is responsible for mapping the ExifTool
> metadata fields to existing tika and Drew Noakes metadata fields if enabled.
> An ElementRdfBagMetadataHandler is added for extracting multi-valued RDF Bag
> implementations in XML files.
> An ExifToolParserTest is added which tests several expected XMP and IPTC
> metadata values in testJPEG_IPTC_EXT.jpg.
--
This message was sent by Atlassian JIRA
(v6.3.15#6346)