[ 
https://issues.apache.org/jira/browse/TIKA-3738?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17528355#comment-17528355
 ] 

Stephen H commented on TIKA-3738:
---------------------------------

Many thanks Tim. That's now got all our product tests passing.

If PipesParser is a better alternative and more likely to be the focus of 
future development then I'd be interested in going that route instead. I 
couldn't see any documentation or examples for it.

> ForkParser missing metadata for some document formats
> -----------------------------------------------------
>
>                 Key: TIKA-3738
>                 URL: https://issues.apache.org/jira/browse/TIKA-3738
>             Project: Tika
>          Issue Type: Bug
>          Components: parser
>    Affects Versions: 2.3.0
>         Environment: Java 11.0.14.
>            Reporter: Stephen H
>            Priority: Major
>         Attachments: ForkParserIntegrationTest.java.diff, 
> testVideoMetadataMp4.mp4
>
>
> When using ForkParser, metadata from some parsers is not being returned in 
> the Metadata object or in the head of the returned XML. These include 
> OpenDocument Presentation (ODP), OpenDocument Spreadsheet (ODS), Microsoft 
> Word 2006 XML, MP4 Audio (M4A) and MP4 Video (MP4).
> Patch for ForkParserIntegrationTest showing the issue for these file types is 
> attached, along with an MP4 video file containing metadata as there doesn't 
> appear to be one currently in the test set.



--
This message was sent by Atlassian Jira
(v8.20.7#820007)

Reply via email to