[ 
https://issues.apache.org/jira/browse/TIKA-4244?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Kartik Jain updated TIKA-4244:
------------------------------
    Description: 
When tika-core detect(InputStream input, Metadata metadata) API is used to 
determimne the MIME type of an ics file, it returns media type `text/html`, 
rather it should've `text/calendar`.

For .ics files that have HTML content in them (additional attribute 
X-ALT-DESC;FMTTYPE=text/html). *tika-core* is returning the MIME type of such 
files as text/html, ideally, it should come up as text/calendar, but according 
to tika core text/html is not in the base types of text/calendar so it doesn't 
consider the text/calendar type, however for all ics files MIME type should be 
text/calendar

  was:
When tika-core detect(InputStream input, Metadata metadata) API is used to 
determimne the MIME type of an ics file, it returns media type `text/html`, 
rather it should've `text/calendar`.

For .ics files that have HTML content in them (additional attribute 
X-ALT-DESC;FMTTYPE=text/html). *{*}tika-core{*}* is returning the MIME type of 
such files as text/html, ideally, it should come up as text/calendar, but 
according to tika core text/html is not in the base types of text/calendar so 
it doesn't consider the text/calendar type, however for all ics files MIME type 
should be text/calendar


> Tika idenifies MIME type of ics files with html content as text/html
> --------------------------------------------------------------------
>
>                 Key: TIKA-4244
>                 URL: https://issues.apache.org/jira/browse/TIKA-4244
>             Project: Tika
>          Issue Type: Bug
>            Reporter: Kartik Jain
>            Priority: Major
>         Attachments: Sample.ics
>
>
> When tika-core detect(InputStream input, Metadata metadata) API is used to 
> determimne the MIME type of an ics file, it returns media type `text/html`, 
> rather it should've `text/calendar`.
> For .ics files that have HTML content in them (additional attribute 
> X-ALT-DESC;FMTTYPE=text/html). *tika-core* is returning the MIME type of such 
> files as text/html, ideally, it should come up as text/calendar, but 
> according to tika core text/html is not in the base types of text/calendar so 
> it doesn't consider the text/calendar type, however for all ics files MIME 
> type should be text/calendar



--
This message was sent by Atlassian Jira
(v8.20.10#820010)

Reply via email to