[ 
https://issues.apache.org/jira/browse/TIKA-2836?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16786555#comment-16786555
 ] 

Arnaud Dagnelies commented on TIKA-2836:
----------------------------------------

Looking at the file content, it does a bit look like a CSV file... Just with a 
txt extension.

 

That said, IMHO, I think too it would definitely make sense to give priority to 
the file extension rather than content type detection. 

> Tika core API 
> --------------
>
>                 Key: TIKA-2836
>                 URL: https://issues.apache.org/jira/browse/TIKA-2836
>             Project: Tika
>          Issue Type: Bug
>          Components: core
>    Affects Versions: 1.20
>         Environment: Linux
>            Reporter: chandra
>            Priority: Major
>         Attachments: csvtxt.txt
>
>
> Tika Core API identifying a txt file as text.csv, instead of text/plain when 
> we call tika.detect on the file



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)

Reply via email to