[ 
https://issues.apache.org/jira/browse/TIKA-1122?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13663092#comment-13663092
 ] 

Nick Burch commented on TIKA-1122:
----------------------------------

The mimetype detected for the file doesn't seem to match the one listed by the 
parser. Do you know which one is correct? And could you try changing the parser 
to claim to support the one detection gives, to see if that solves it?
                
> Tika fails to parse chm files
> -----------------------------
>
>                 Key: TIKA-1122
>                 URL: https://issues.apache.org/jira/browse/TIKA-1122
>             Project: Tika
>          Issue Type: Bug
>          Components: parser
>    Affects Versions: 1.3
>            Reporter: Tejas Patil
>            Priority: Minor
>             Fix For: 1.4
>
>
> (reported by Jan Riewe over nutch user group, see 
> http://lucene.472066.n3.nabble.com/CHM-Files-and-Tika-td3999735.html)
> Nutch fails to parse chm files with
> ERROR tika.TikaParser - Can't retrieve Tika parser for mime-type 
> application/vnd.ms-htmlhelp
> Even after running tika-app in standalone manner (ie. not via nutch), I could 
> see not even a single chm file being parsed (I tried with 10-15 different chm 
> files of variable sizes).

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira

Reply via email to