[
https://issues.apache.org/jira/browse/TIKA-4314?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17884593#comment-17884593
]
Leszek Sliwko edited comment on TIKA-4314 at 9/25/24 12:36 PM:
---------------------------------------------------------------
I have another example with the attached files - both contain geolocation data
that is correctly detected by ExifTool.
However, because parsing only occurs in
{{{}org.apache.tika.parser.gdal.GDALParser{}}}, the metadata is lost (the GDAL
parser does not extract geolocation). I also recall this working fine in 2019
(probably because the GDAL parser was not implemented at that time).
was (Author: JIRAUSER282927):
I have another example with the attached files - both contain geolocation data
that is correctly detected by ExifTool.
However, because parsing only occurs in
{{{}org.apache.tika.parser.gdal.GDALParser{}}}, the metadata is lost (the GDAL
parser does not extract geolocation). I also recall this working fine in 2019.
> CompositeParser returns only one parser per content type
> --------------------------------------------------------
>
> Key: TIKA-4314
> URL: https://issues.apache.org/jira/browse/TIKA-4314
> Project: Tika
> Issue Type: Bug
> Components: core
> Affects Versions: 2.9.2
> Reporter: Leszek Sliwko
> Priority: Major
> Attachments: duration-test-2.avi, geolocation-test-1.jpg,
> geolocation-test-2.jpg
>
>
> External parsers can have many supported content types, but information is
> lost in CompositeParser:
>
> public Map<MediaType, Parser> getParsers(ParseContext context) {
> Map<MediaType, Parser> map = new HashMap<>();
> for (Parser parser : parsers) {
> for (MediaType type : parser.getSupportedTypes(context))
> { map.put(registry.normalize(type), parser); }
> }
> return map;
> }
>
> To recreate - parse any avi file (content type: video/x-msvideo), Only the
> exiftool will by picked up and the ffmpeg parser won't be executed.
--
This message was sent by Atlassian Jira
(v8.20.10#820010)