[ 
https://issues.apache.org/jira/browse/TIKA-1842?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Jean Coudon updated TIKA-1842:
------------------------------
    Component/s:     (was: mime)
                 detector

> XML file detected as HTML
> -------------------------
>
>                 Key: TIKA-1842
>                 URL: https://issues.apache.org/jira/browse/TIKA-1842
>             Project: Tika
>          Issue Type: Bug
>          Components: detector
>    Affects Versions: 1.11
>            Reporter: Jean Coudon
>            Priority: Minor
>
> A file containing this xml:
> {code:xml}
> <note>
>     <to>Tove</to>
>     <from>Jani</from>
>     <heading>Reminder</heading>
>     <body>Don't forget me this weekend!</body>
> </note>
> {code}
> is detected as text/html instead of application/xml (or at worst text/plain) 
> when using this code sample:
> {code:java}
> Tika tika = new Tika();
> System.out.println(tika.detect(new File("src/test/resources/file.xml"))));
> {code}



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

Reply via email to