[ 
https://issues.apache.org/jira/browse/TIKA-1538?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14303125#comment-14303125
 ] 

Miguel edited comment on TIKA-1538 at 2/3/15 11:44 AM:
-------------------------------------------------------

I have tried it and the result is as you describe, Nick (i copy and paste the 
result below). Then i guess the problem is in the way i try to detect it (java 
code). Thank you very much for your answer, and sorry to have filed an issue 
which was not a bug.

java -jar tika-app-1.7.jar Product345037-000.jpg

<?xml version="1.0" encoding="UTF-8"?><html 
xmlns="http://www.w3.org/1999/xhtml";>
<head>
<meta name="Number of Components" content="3"/>
<meta name="Resolution Units" content="none"/>
<meta name="Image Height" content="851 pixels"/>
<meta name="Data Precision" content="8 bits"/>
<meta name="Content-Length" content="164404"/>
<meta name="tiff:BitsPerSample" content="8"/>
<meta name="Compression Type" content="Baseline"/>
<meta name="X-Parsed-By" content="org.apache.tika.parser.DefaultParser"/>
<meta name="X-Parsed-By" content="org.apache.tika.parser.jpeg.JpegParser"/>
<meta name="Component 1" content="Y component: Quantization table 0, Sampling fa
ctors 2 horiz/2 vert"/>
<meta name="tiff:ImageLength" content="851"/>
<meta name="Component 2" content="Cb component: Quantization table 1, Sampling f
actors 1 horiz/1 vert"/>
<meta name="Component 3" content="Cr component: Quantization table 1, Sampling f
actors 1 horiz/1 vert"/>
<meta name="X Resolution" content="1 dot"/>
<meta name="tiff:ImageWidth" content="1280"/>
<meta name="Image Width" content="1280 pixels"/>
<meta name="Content-Type" content="image/jpeg"/>
<meta name="Y Resolution" content="1 dot"/>
<meta name="resourceName" content="Product345037-000.jpg"/>
<title/>
</head>
<body/></html>


was (Author: miguel_mitula):
I have tried it and the result is as you describe, Nick (i copy and paste the 
result below). Then i guess the problem is in the way i try to detect it (java 
code). Thank you very much for your answer, and sorry to have filed an issue 
which was not a bug.

java -jar tika-app-1.7.jar E:\bitacora\hadoop\imagenes\Product345037-000.jpg

<?xml version="1.0" encoding="UTF-8"?><html 
xmlns="http://www.w3.org/1999/xhtml";>
<head>
<meta name="Number of Components" content="3"/>
<meta name="Resolution Units" content="none"/>
<meta name="Image Height" content="851 pixels"/>
<meta name="Data Precision" content="8 bits"/>
<meta name="Content-Length" content="164404"/>
<meta name="tiff:BitsPerSample" content="8"/>
<meta name="Compression Type" content="Baseline"/>
<meta name="X-Parsed-By" content="org.apache.tika.parser.DefaultParser"/>
<meta name="X-Parsed-By" content="org.apache.tika.parser.jpeg.JpegParser"/>
<meta name="Component 1" content="Y component: Quantization table 0, Sampling fa
ctors 2 horiz/2 vert"/>
<meta name="tiff:ImageLength" content="851"/>
<meta name="Component 2" content="Cb component: Quantization table 1, Sampling f
actors 1 horiz/1 vert"/>
<meta name="Component 3" content="Cr component: Quantization table 1, Sampling f
actors 1 horiz/1 vert"/>
<meta name="X Resolution" content="1 dot"/>
<meta name="tiff:ImageWidth" content="1280"/>
<meta name="Image Width" content="1280 pixels"/>
<meta name="Content-Type" content="image/jpeg"/>
<meta name="Y Resolution" content="1 dot"/>
<meta name="resourceName" content="Product345037-000.jpg"/>
<title/>
</head>
<body/></html>

> Wrong mimetype detection
> ------------------------
>
>                 Key: TIKA-1538
>                 URL: https://issues.apache.org/jira/browse/TIKA-1538
>             Project: Tika
>          Issue Type: Bug
>    Affects Versions: 1.7
>            Reporter: Miguel
>         Attachments: Product345037-000.jpg
>
>
> [SCENARIO]
> - Working on a "supposed to be a valid JPEG file" (the file is attached to 
> this issue report), which is correctly detected and treated by a browser, 
> etc. (Detection works well for almost all other checked images).
> - Using tika-app-1.7.jar
> - Java code snippet:
> Tika tikaObject = new Tika();
> ...
> // image is a byte[] containing the JPEG file
> String contentTypeTika = tikaObject.detect( image );
> [RESULT]
> detected mimetype is "application/gzip" ("application/x-gzip" if using 
> tika-app-1.4.jar or tika-app-1.5.jar)
> [EXPECTED]
> "image/jpeg"



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

Reply via email to