[ 
https://issues.apache.org/jira/browse/TIKA-2689?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17351930#comment-17351930
 ] 

Tim Allison commented on TIKA-2689:
-----------------------------------

I haven't looked at this at all in forever...

We now have some examples in our bugtracker corpus (prepend 
https://corpora.tika.apache.org/base/docs/bug_trackers/)

./TIKA/TIKA-2689-0.ai
./GHOSTSCRIPT/226943-694743/GHOSTSCRIPT-689926-0.ai
./GHOSTSCRIPT/226943-694743/GHOSTSCRIPT-691957-0.ai
./GHOSTSCRIPT/226943-694743/GHOSTSCRIPT-686747-1.ai
./GHOSTSCRIPT/226943-694743/GHOSTSCRIPT-692503-0.ai
./GHOSTSCRIPT/226943-694743/GHOSTSCRIPT-692784-0.ai
./GHOSTSCRIPT/226943-694743/GHOSTSCRIPT-686747-2.ai
./GHOSTSCRIPT/226943-694743/GHOSTSCRIPT-691059-0.ai
./GHOSTSCRIPT/226943-694743/GHOSTSCRIPT-689028-1.ai
./GHOSTSCRIPT/226943-694743/GHOSTSCRIPT-691347-0.ai
./GHOSTSCRIPT/226943-694743/GHOSTSCRIPT-689874-0.ai
./GHOSTSCRIPT/226943-694743/GHOSTSCRIPT-694368-1.ai
./GHOSTSCRIPT/226943-694743/GHOSTSCRIPT-690772-0.ai
./GHOSTSCRIPT/694748-703060/GHOSTSCRIPT-694776-0.ai
./GHOSTSCRIPT/694748-703060/GHOSTSCRIPT-695012-0.ai
./GHOSTSCRIPT/694748-703060/GHOSTSCRIPT-698337-0.ai
./GHOSTSCRIPT/694748-703060/GHOSTSCRIPT-695845-0.ai
./PDFBOX/PDFBOX-1094-6.ai
./PDFBOX/PDFBOX-3385-0.ai
./OOO/68096-78664/OOO-72332-1.ai
./OOO/68096-78664/OOO-72332-7.ai
./OOO/68096-78664/OOO-72332-6.ai
./OOO/68096-78664/OOO-72332-0.ai
./MOZILLA/1062693-1110888/MOZILLA-1079240-0.ai
./MOZILLA/1062693-1110888/MOZILLA-1079240-1.ai
./MOZILLA/654106-716683/MOZILLA-675331-4.ai
./MOZILLA/654106-716683/MOZILLA-696716-0.ai
./MOZILLA/654106-716683/MOZILLA-685046-1.ai
./MOZILLA/654106-716683/MOZILLA-675331-5.ai
./MOZILLA/716687-822422/MOZILLA-759497-1.ai
./MOZILLA/1187239-1240516/MOZILLA-1237271-9.ai
./MOZILLA/916798-983978/MOZILLA-965980-10.ai
./MOZILLA/916798-983978/MOZILLA-951391-1.ai
./MOZILLA/916798-983978/MOZILLA-963734-2.ai
./MOZILLA/916798-983978/MOZILLA-970934-1.ai
./MOZILLA/822481-916766/MOZILLA-898659-8.ai
./MOZILLA/822481-916766/MOZILLA-847609-1.ai
./MOZILLA/822481-916766/MOZILLA-898659-10.ai
./MOZILLA/822481-916766/MOZILLA-898659-4.ai
./MOZILLA/822481-916766/MOZILLA-909959-12.ai
./MOZILLA/822481-916766/MOZILLA-909464-5.ai
./MOZILLA/822481-916766/MOZILLA-904141-9.ai
./MOZILLA/822481-916766/MOZILLA-898659-5.ai
./MOZILLA/822481-916766/MOZILLA-898659-3.ai
./MOZILLA/822481-916766/MOZILLA-898659-9.ai
./MOZILLA/822481-916766/MOZILLA-909959-13.ai
./MOZILLA/1035345-1062595/MOZILLA-1050393-3.ai
./MOZILLA/1035345-1062595/MOZILLA-1050393-1.ai
./MOZILLA/1240554-1312466/MOZILLA-1293671-1.ai
./MOZILLA/984033-1035142/MOZILLA-1011536-4.ai
./MOZILLA/984033-1035142/MOZILLA-1032897-2.ai
./MOZILLA/590175-653777/MOZILLA-627452-0.ai
./MOZILLA/590175-653777/MOZILLA-603607-15.ai
./MOZILLA/590175-653777/MOZILLA-603607-16.ai
./MOZILLA/590175-653777/MOZILLA-603607-14.ai
./MOZILLA/590175-653777/MOZILLA-603607-5.ai
./MOZILLA/590175-653777/MOZILLA-603607-33.ai
./MOZILLA/590175-653777/MOZILLA-603607-21.ai
./MOZILLA/590175-653777/MOZILLA-603607-11.ai
./MOZILLA/1327864-1406613/MOZILLA-1404648-2.ai
./MOZILLA/1111148-1187180/MOZILLA-1113760-0.ai
./MOZILLA/1111148-1187180/MOZILLA-1171457-5.ai
./MOZILLA/1111148-1187180/MOZILLA-1171457-6.ai
./MOZILLA/1111148-1187180/MOZILLA-1113760-4.ai
./MOZILLA/1111148-1187180/MOZILLA-1113760-7.ai
./MOZILLA/1111148-1187180/MOZILLA-1171457-4.ai

> *.ai type (Adobe illustrator ) files are not detected correctly.
> ----------------------------------------------------------------
>
>                 Key: TIKA-2689
>                 URL: https://issues.apache.org/jira/browse/TIKA-2689
>             Project: Tika
>          Issue Type: Bug
>          Components: core
>    Affects Versions: 1.16, 1.17, 1.18
>            Reporter: Amit Pandey
>            Priority: Major
>         Attachments: example.ai
>
>
> There is in-consistency in detecting **ai* types files when using different 
> overloaded detect method. When I am using _detect(String filename)_, it gives 
> correct file type - "*application/illustrator*". If I use _detect(InputStream 
> is, String filename)_ or _detect(File fileObj)_ -  it gives file type 
> "*application/pdf*".
> Here is sample code I used.
>   
> [https://stackoverflow.com/questions/51359351/tika-detect-method-not-giving-same-exact-file-type|http://example.com/]
>  



--
This message was sent by Atlassian Jira
(v8.3.4#803005)

Reply via email to