Manali Shah created TIKA-1887:
---------------------------------
Summary: Add new mimetype for file extensions .po
Key: TIKA-1887
URL: https://issues.apache.org/jira/browse/TIKA-1887
Project: Tika
Issue Type: Improvement
Components: core, mime
Reporter: Manali Shah
Hi,
While analyzing the Trec DD polar data, we came across files that were
classified as octet-stream.
On using content based algorithms such as BFA, BFCC and FHT we were able to
determine more magic bytes for certain files.
The GNU gettext toolset is used by programmers and translators at producing,
updating and using translation files, mainly those PO files which are textual,
editable files.
We suggest a new mimetype as text/po to be added to the existing mime
repository of Tika.
--
This message was sent by Atlassian JIRA
(v6.3.4#6332)