[
https://issues.apache.org/jira/browse/TIKA-1538?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14303162#comment-14303162
]
Miguel commented on TIKA-1538:
------------------------------
I was working on the junit test, but Konstantin's comment could be definitive
on the issue.
I use a different way to get my byte[] from the file, so maybe the problem is
there:
InputStream is = new FileInputStream( new File( "E:\Product345037-000.jpg" ) );
byte[] image = org.apache.commons.io.IOUtils.toByteArray( is );
I guess i should close the issue (i have no experience filing bug issues).
Thanks again.
> Wrong mimetype detection
> ------------------------
>
> Key: TIKA-1538
> URL: https://issues.apache.org/jira/browse/TIKA-1538
> Project: Tika
> Issue Type: Bug
> Affects Versions: 1.7
> Reporter: Miguel
> Attachments: Product345037-000.jpg
>
>
> [SCENARIO]
> - Working on a "supposed to be a valid JPEG file" (the file is attached to
> this issue report), which is correctly detected and treated by a browser,
> etc. (Detection works well for almost all other checked images).
> - Using tika-app-1.7.jar
> - Java code snippet:
> Tika tikaObject = new Tika();
> ...
> // image is a byte[] containing the JPEG file
> String contentTypeTika = tikaObject.detect( image );
> [RESULT]
> detected mimetype is "application/gzip" ("application/x-gzip" if using
> tika-app-1.4.jar or tika-app-1.5.jar)
> [EXPECTED]
> "image/jpeg"
--
This message was sent by Atlassian JIRA
(v6.3.4#6332)