[ 
https://issues.apache.org/jira/browse/TIKA-2689?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17354400#comment-17354400
 ] 

launchpad commented on TIKA-2689:
---------------------------------

I would be in favour of an abstract approach, wherein as a consumer of the 
library I could still use the same api tika.detect(InputStream). I am assuming 
if its a part of the PDFParser I would need to apply the PDFParser explicity. 
Whereas if its a detector it can just fall in the chain of detectors that I 
could configure to run through.

> *.ai type (Adobe illustrator ) files are not detected correctly.
> ----------------------------------------------------------------
>
>                 Key: TIKA-2689
>                 URL: https://issues.apache.org/jira/browse/TIKA-2689
>             Project: Tika
>          Issue Type: Bug
>          Components: core
>    Affects Versions: 1.16, 1.17, 1.18
>            Reporter: Amit Pandey
>            Priority: Major
>         Attachments: example.ai, screenshot-1.png
>
>
> There is in-consistency in detecting **ai* types files when using different 
> overloaded detect method. When I am using _detect(String filename)_, it gives 
> correct file type - "*application/illustrator*". If I use _detect(InputStream 
> is, String filename)_ or _detect(File fileObj)_ -  it gives file type 
> "*application/pdf*".
> Here is sample code I used.
>   
> [https://stackoverflow.com/questions/51359351/tika-detect-method-not-giving-same-exact-file-type|http://example.com/]
>  



--
This message was sent by Atlassian Jira
(v8.3.4#803005)

Reply via email to