[
https://issues.apache.org/jira/browse/TIKA-3738?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17528355#comment-17528355
]
Stephen H commented on TIKA-3738:
---------------------------------
Many thanks Tim. That's now got all our product tests passing.
If PipesParser is a better alternative and more likely to be the focus of
future development then I'd be interested in going that route instead. I
couldn't see any documentation or examples for it.
> ForkParser missing metadata for some document formats
> -----------------------------------------------------
>
> Key: TIKA-3738
> URL: https://issues.apache.org/jira/browse/TIKA-3738
> Project: Tika
> Issue Type: Bug
> Components: parser
> Affects Versions: 2.3.0
> Environment: Java 11.0.14.
> Reporter: Stephen H
> Priority: Major
> Attachments: ForkParserIntegrationTest.java.diff,
> testVideoMetadataMp4.mp4
>
>
> When using ForkParser, metadata from some parsers is not being returned in
> the Metadata object or in the head of the returned XML. These include
> OpenDocument Presentation (ODP), OpenDocument Spreadsheet (ODS), Microsoft
> Word 2006 XML, MP4 Audio (M4A) and MP4 Video (MP4).
> Patch for ForkParserIntegrationTest showing the issue for these file types is
> attached, along with an MP4 video file containing metadata as there doesn't
> appear to be one currently in the test set.
--
This message was sent by Atlassian Jira
(v8.20.7#820007)