[
https://issues.apache.org/jira/browse/TIKA-1772?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15937437#comment-15937437
]
Nick Burch commented on TIKA-1772:
----------------------------------
Thanks for the test file! I've committed it, along with a similar version, and
a modified version of your unit test. Following some additional magic entries
inspired by reading the specs (thanks again for that link!), your file can now
be correctly be detected even without the filename
If you find any more VTT files we can't detect properly, please raise a new bug
/ re-open this one, and upload the problematic file so we can look further!
> Mimetype of VTT files
> ---------------------
>
> Key: TIKA-1772
> URL: https://issues.apache.org/jira/browse/TIKA-1772
> Project: Tika
> Issue Type: Improvement
> Reporter: Alexander Widera
> Priority: Minor
> Fix For: 1.11
>
> Attachments: TikaVtt.java, upc-video-subtitles-en.vtt
>
>
> Files with extension "vtt" are "WebVTT: The Web Video Text Tracks Format"
> files.
> The mimetype resolved by tika is currently text/plain.
> The correct mimetype should be text/vtt.
> see: https://w3c.github.io/webvtt/
--
This message was sent by Atlassian JIRA
(v6.3.15#6346)