[
https://issues.apache.org/jira/browse/TIKA-3738?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17528312#comment-17528312
]
Tim Allison commented on TIKA-3738:
-----------------------------------
Another option would be to use the newer PipesParser, which is based on the
ForkParser. I can document how to use that instead of the ForkParser if you
have any interest.
> ForkParser missing metadata for some document formats
> -----------------------------------------------------
>
> Key: TIKA-3738
> URL: https://issues.apache.org/jira/browse/TIKA-3738
> Project: Tika
> Issue Type: Bug
> Components: parser
> Affects Versions: 2.3.0
> Environment: Java 11.0.14.
> Reporter: Stephen H
> Priority: Major
> Attachments: ForkParserIntegrationTest.java.diff,
> testVideoMetadataMp4.mp4
>
>
> When using ForkParser, metadata from some parsers is not being returned in
> the Metadata object or in the head of the returned XML. These include
> OpenDocument Presentation (ODP), OpenDocument Spreadsheet (ODS), Microsoft
> Word 2006 XML, MP4 Audio (M4A) and MP4 Video (MP4).
> Patch for ForkParserIntegrationTest showing the issue for these file types is
> attached, along with an MP4 video file containing metadata as there doesn't
> appear to be one currently in the test set.
--
This message was sent by Atlassian Jira
(v8.20.7#820007)