I'm pretty sure we don't handle these now. We have mime detection based only on file extension, if I'm reading our tika-mimetypes.xml correctly.
If you can find an Apache license friendly parser that is fairly well supported, I think we'd be willing to consider integrating --- open a ticket on our JIRA to track this. Possible links: https://wiki.alfresco.com/wiki/AFP_Conversion Maybe (shudder) copy and paste from org.apache.fop.afp.parser.MODCAParser? Are you able to share test files with us? -----Original Message----- From: Amjad Hossain, Mohammad [mailto:[email protected]] Sent: Wednesday, April 20, 2016 1:38 AM To: [email protected]” <[email protected]> Subject: Extract afp file using apache tika Hi All, We have lots of file afp formatted, we have to do elastic search on afp file, since elastic search use apache tika as mapper attachment, You can see more about afp file in the following links, https://en.wikipedia.org/wiki/Advanced_Function_Presentation http://www.reviversoft.com/file-extensions/afp Now *my question*: Does apache tika able to extract afp file content? or do you have any plan to support afp file in near future? Thanks amjad
