[ 
https://issues.apache.org/jira/browse/TIKA-3445?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17363054#comment-17363054
 ] 

Nick Burch commented on TIKA-3445:
----------------------------------

I think that's an email file, Tika thinks that's an email file, seems to be 
working as expected from my perspective!

Why do you think a file containing lots of emails shouldn't be counted as an 
email?

> Extension reading it as eml instead of txt when headers are not present
> -----------------------------------------------------------------------
>
>                 Key: TIKA-3445
>                 URL: https://issues.apache.org/jira/browse/TIKA-3445
>             Project: Tika
>          Issue Type: Bug
>          Components: core, detector, metadata, mime, parser
>    Affects Versions: 1.25, 1.26
>            Reporter: Vamsi Molli
>            Priority: Blocker
>             Fix For: 1.24.1
>
>         Attachments: test_sample_message (1).txt
>
>
> The attached txt file doesn't have starting headers it is treating as .eml 
> file but it should be .txt.
> stream = TikaInputStream.get(fis = new FileInputStream(paths));stream = 
> TikaInputStream.get(fis = new FileInputStream(paths)); 
> metadata.add(Metadata.RESOURCE_NAME_KEY, paths); MediaType mediaType = 
> detector.detect(stream, metadata);
> MediaType detect(InputStream input, Metadata metadata) throws IOException;



--
This message was sent by Atlassian Jira
(v8.3.4#803005)

Reply via email to