[
https://issues.apache.org/jira/browse/TIKA-3049?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17041309#comment-17041309
]
Tim Allison commented on TIKA-3049:
-----------------------------------
* A bunch of .patch files are identified as mbox
* pcl, pxl, jbig2, prn, -- need to add mime types...may be others
> Improve file detection...varia
> ------------------------------
>
> Key: TIKA-3049
> URL: https://issues.apache.org/jira/browse/TIKA-3049
> Project: Tika
> Issue Type: Task
> Reporter: Tim Allison
> Priority: Major
> Attachments: ghostscript_attachment_details_and_tika.txt.gz
>
>
> I recently crawled a few bugzilla issue trackers to add files to our
> regression corpus. I noticed that bugzilla is able to identify the mime
> types of a few file types that we're not, and that there are some areas for
> improvements in mime types that we should be able to identify.
> I'm attaching a file from ghostscript's issue tracker:
> https://bugs.ghostscript.com/
--
This message was sent by Atlassian Jira
(v8.3.4#803005)