Manali Shah created TIKA-1887:
---------------------------------

             Summary: Add new mimetype for file extensions .po 
                 Key: TIKA-1887
                 URL: https://issues.apache.org/jira/browse/TIKA-1887
             Project: Tika
          Issue Type: Improvement
          Components: core, mime
            Reporter: Manali Shah


Hi, 

While analyzing the Trec DD polar data, we came across files that were 
classified as octet-stream. 
On using content based algorithms such as BFA, BFCC  and FHT we were able to 
determine more magic bytes for certain files.

The GNU gettext toolset is used by programmers and translators at producing, 
updating and using translation files, mainly those PO files which are textual, 
editable files.
We suggest a new mimetype as text/po to be added to the existing mime 
repository of Tika.




--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

Reply via email to