[
https://issues.apache.org/jira/browse/TIKA-2689?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17351930#comment-17351930
]
Tim Allison commented on TIKA-2689:
-----------------------------------
I haven't looked at this at all in forever...
We now have some examples in our bugtracker corpus (prepend
https://corpora.tika.apache.org/base/docs/bug_trackers/)
./TIKA/TIKA-2689-0.ai
./GHOSTSCRIPT/226943-694743/GHOSTSCRIPT-689926-0.ai
./GHOSTSCRIPT/226943-694743/GHOSTSCRIPT-691957-0.ai
./GHOSTSCRIPT/226943-694743/GHOSTSCRIPT-686747-1.ai
./GHOSTSCRIPT/226943-694743/GHOSTSCRIPT-692503-0.ai
./GHOSTSCRIPT/226943-694743/GHOSTSCRIPT-692784-0.ai
./GHOSTSCRIPT/226943-694743/GHOSTSCRIPT-686747-2.ai
./GHOSTSCRIPT/226943-694743/GHOSTSCRIPT-691059-0.ai
./GHOSTSCRIPT/226943-694743/GHOSTSCRIPT-689028-1.ai
./GHOSTSCRIPT/226943-694743/GHOSTSCRIPT-691347-0.ai
./GHOSTSCRIPT/226943-694743/GHOSTSCRIPT-689874-0.ai
./GHOSTSCRIPT/226943-694743/GHOSTSCRIPT-694368-1.ai
./GHOSTSCRIPT/226943-694743/GHOSTSCRIPT-690772-0.ai
./GHOSTSCRIPT/694748-703060/GHOSTSCRIPT-694776-0.ai
./GHOSTSCRIPT/694748-703060/GHOSTSCRIPT-695012-0.ai
./GHOSTSCRIPT/694748-703060/GHOSTSCRIPT-698337-0.ai
./GHOSTSCRIPT/694748-703060/GHOSTSCRIPT-695845-0.ai
./PDFBOX/PDFBOX-1094-6.ai
./PDFBOX/PDFBOX-3385-0.ai
./OOO/68096-78664/OOO-72332-1.ai
./OOO/68096-78664/OOO-72332-7.ai
./OOO/68096-78664/OOO-72332-6.ai
./OOO/68096-78664/OOO-72332-0.ai
./MOZILLA/1062693-1110888/MOZILLA-1079240-0.ai
./MOZILLA/1062693-1110888/MOZILLA-1079240-1.ai
./MOZILLA/654106-716683/MOZILLA-675331-4.ai
./MOZILLA/654106-716683/MOZILLA-696716-0.ai
./MOZILLA/654106-716683/MOZILLA-685046-1.ai
./MOZILLA/654106-716683/MOZILLA-675331-5.ai
./MOZILLA/716687-822422/MOZILLA-759497-1.ai
./MOZILLA/1187239-1240516/MOZILLA-1237271-9.ai
./MOZILLA/916798-983978/MOZILLA-965980-10.ai
./MOZILLA/916798-983978/MOZILLA-951391-1.ai
./MOZILLA/916798-983978/MOZILLA-963734-2.ai
./MOZILLA/916798-983978/MOZILLA-970934-1.ai
./MOZILLA/822481-916766/MOZILLA-898659-8.ai
./MOZILLA/822481-916766/MOZILLA-847609-1.ai
./MOZILLA/822481-916766/MOZILLA-898659-10.ai
./MOZILLA/822481-916766/MOZILLA-898659-4.ai
./MOZILLA/822481-916766/MOZILLA-909959-12.ai
./MOZILLA/822481-916766/MOZILLA-909464-5.ai
./MOZILLA/822481-916766/MOZILLA-904141-9.ai
./MOZILLA/822481-916766/MOZILLA-898659-5.ai
./MOZILLA/822481-916766/MOZILLA-898659-3.ai
./MOZILLA/822481-916766/MOZILLA-898659-9.ai
./MOZILLA/822481-916766/MOZILLA-909959-13.ai
./MOZILLA/1035345-1062595/MOZILLA-1050393-3.ai
./MOZILLA/1035345-1062595/MOZILLA-1050393-1.ai
./MOZILLA/1240554-1312466/MOZILLA-1293671-1.ai
./MOZILLA/984033-1035142/MOZILLA-1011536-4.ai
./MOZILLA/984033-1035142/MOZILLA-1032897-2.ai
./MOZILLA/590175-653777/MOZILLA-627452-0.ai
./MOZILLA/590175-653777/MOZILLA-603607-15.ai
./MOZILLA/590175-653777/MOZILLA-603607-16.ai
./MOZILLA/590175-653777/MOZILLA-603607-14.ai
./MOZILLA/590175-653777/MOZILLA-603607-5.ai
./MOZILLA/590175-653777/MOZILLA-603607-33.ai
./MOZILLA/590175-653777/MOZILLA-603607-21.ai
./MOZILLA/590175-653777/MOZILLA-603607-11.ai
./MOZILLA/1327864-1406613/MOZILLA-1404648-2.ai
./MOZILLA/1111148-1187180/MOZILLA-1113760-0.ai
./MOZILLA/1111148-1187180/MOZILLA-1171457-5.ai
./MOZILLA/1111148-1187180/MOZILLA-1171457-6.ai
./MOZILLA/1111148-1187180/MOZILLA-1113760-4.ai
./MOZILLA/1111148-1187180/MOZILLA-1113760-7.ai
./MOZILLA/1111148-1187180/MOZILLA-1171457-4.ai
> *.ai type (Adobe illustrator ) files are not detected correctly.
> ----------------------------------------------------------------
>
> Key: TIKA-2689
> URL: https://issues.apache.org/jira/browse/TIKA-2689
> Project: Tika
> Issue Type: Bug
> Components: core
> Affects Versions: 1.16, 1.17, 1.18
> Reporter: Amit Pandey
> Priority: Major
> Attachments: example.ai
>
>
> There is in-consistency in detecting **ai* types files when using different
> overloaded detect method. When I am using _detect(String filename)_, it gives
> correct file type - "*application/illustrator*". If I use _detect(InputStream
> is, String filename)_ or _detect(File fileObj)_ - it gives file type
> "*application/pdf*".
> Here is sample code I used.
>
> [https://stackoverflow.com/questions/51359351/tika-detect-method-not-giving-same-exact-file-type|http://example.com/]
>
--
This message was sent by Atlassian Jira
(v8.3.4#803005)