[ 
https://issues.apache.org/jira/browse/TIKA-3049?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17041309#comment-17041309
 ] 

Tim Allison commented on TIKA-3049:
-----------------------------------

* A bunch of .patch files are identified as mbox
* pcl, pxl, jbig2, prn, -- need to add mime types...may be others

> Improve file detection...varia
> ------------------------------
>
>                 Key: TIKA-3049
>                 URL: https://issues.apache.org/jira/browse/TIKA-3049
>             Project: Tika
>          Issue Type: Task
>            Reporter: Tim Allison
>            Priority: Major
>         Attachments: ghostscript_attachment_details_and_tika.txt.gz
>
>
> I recently crawled a few bugzilla issue trackers to add files to our 
> regression corpus.  I noticed that bugzilla is able to identify the mime 
> types of a few file types that we're not, and that there are some areas for 
> improvements in mime types that we should be able to identify.
> I'm attaching a file from ghostscript's issue tracker: 
> https://bugs.ghostscript.com/



--
This message was sent by Atlassian Jira
(v8.3.4#803005)

Reply via email to