[ 
https://issues.apache.org/jira/browse/TIKA-3738?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17528312#comment-17528312
 ] 

Tim Allison commented on TIKA-3738:
-----------------------------------

Another option would be to use the newer PipesParser, which is based on the 
ForkParser.  I can document how to use that instead of the ForkParser if you 
have any interest.


> ForkParser missing metadata for some document formats
> -----------------------------------------------------
>
>                 Key: TIKA-3738
>                 URL: https://issues.apache.org/jira/browse/TIKA-3738
>             Project: Tika
>          Issue Type: Bug
>          Components: parser
>    Affects Versions: 2.3.0
>         Environment: Java 11.0.14.
>            Reporter: Stephen H
>            Priority: Major
>         Attachments: ForkParserIntegrationTest.java.diff, 
> testVideoMetadataMp4.mp4
>
>
> When using ForkParser, metadata from some parsers is not being returned in 
> the Metadata object or in the head of the returned XML. These include 
> OpenDocument Presentation (ODP), OpenDocument Spreadsheet (ODS), Microsoft 
> Word 2006 XML, MP4 Audio (M4A) and MP4 Video (MP4).
> Patch for ForkParserIntegrationTest showing the issue for these file types is 
> attached, along with an MP4 video file containing metadata as there doesn't 
> appear to be one currently in the test set.



--
This message was sent by Atlassian Jira
(v8.20.7#820007)

Reply via email to