[ 
https://issues.apache.org/jira/browse/TIKA-2047?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15511046#comment-15511046
 ] 

Hudson commented on TIKA-2047:
------------------------------

FAILURE: Integrated in Jenkins build tika-2.x-windows #49 (See 
[https://builds.apache.org/job/tika-2.x-windows/49/])
 * Maintain passed-in mime in TXTParser (TIKA-2047). (tallison: rev 
32d9ece8d84986de240087a580e094de3f879f3c)
* (edit) CHANGES.txt
* (edit) 
tika-parser-modules/tika-parser-text-module/src/test/java/org/apache/tika/parser/txt/TXTParserTest.java
* (edit) 
tika-parser-modules/tika-parser-text-module/src/main/java/org/apache/tika/parser/txt/TXTParser.java


> TXTParser overwrites mime type/masks types that are subtype of text
> -------------------------------------------------------------------
>
>                 Key: TIKA-2047
>                 URL: https://issues.apache.org/jira/browse/TIKA-2047
>             Project: Tika
>          Issue Type: Bug
>          Components: parser
>    Affects Versions: 1.13
>            Reporter: Tim Allison
>            Assignee: Tim Allison
>            Priority: Minor
>
> For vcal and other mime types that are subclasses of {{text/plain}}, the 
> TXTParser overwrites their mime type as "text/plain".  We should check to see 
> what mime has been sent in via the Metadata and add the charset to that, e.g. 
> "text/calendar; charset=ISO-8859-1"...right?
> {noformat}
>             Charset charset = reader.getCharset();
>             MediaType type = new MediaType(MediaType.TEXT_PLAIN, charset);
>             metadata.set(Metadata.CONTENT_TYPE, type.toString());
> {noformat}



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

Reply via email to