[ https://issues.apache.org/jira/browse/TIKA-2047?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15511203#comment-15511203 ]
Hudson commented on TIKA-2047: ------------------------------ SUCCESS: Integrated in Jenkins build Tika-trunk #1103 (See [https://builds.apache.org/job/Tika-trunk/1103/]) TIKA-2047 -- maintain mime info for mimes that are subtype of text/plain (tallison: rev 415381212291e843e9091f43f6db8c432eb02aa9) * (edit) tika-parsers/src/main/java/org/apache/tika/parser/txt/TXTParser.java * (edit) CHANGES.txt * (edit) tika-parsers/src/test/java/org/apache/tika/parser/txt/TXTParserTest.java > TXTParser overwrites mime type/masks types that are subtype of text > ------------------------------------------------------------------- > > Key: TIKA-2047 > URL: https://issues.apache.org/jira/browse/TIKA-2047 > Project: Tika > Issue Type: Bug > Components: parser > Affects Versions: 1.13 > Reporter: Tim Allison > Assignee: Tim Allison > Priority: Minor > > For vcal and other mime types that are subclasses of {{text/plain}}, the > TXTParser overwrites their mime type as "text/plain". We should check to see > what mime has been sent in via the Metadata and add the charset to that, e.g. > "text/calendar; charset=ISO-8859-1"...right? > {noformat} > Charset charset = reader.getCharset(); > MediaType type = new MediaType(MediaType.TEXT_PLAIN, charset); > metadata.set(Metadata.CONTENT_TYPE, type.toString()); > {noformat} -- This message was sent by Atlassian JIRA (v6.3.4#6332)