Fabrizio Fortino created OAK-9456:
-------------------------------------

             Summary: Exclude all possible PSD mime types in default tika config
                 Key: OAK-9456
                 URL: https://issues.apache.org/jira/browse/OAK-9456
             Project: Jackrabbit Oak
          Issue Type: Bug
          Components: indexing
    Affects Versions: 1.40.0
            Reporter: Fabrizio Fortino
            Assignee: Fabrizio Fortino


The default tika config [0] excludes some mime types that are known to be 
resource-intensive. One of these mime types is *image/x-psd* which should 
exclude Adobe PSD file formats. As explained in this forum post [1], this file 
format can have several mime-type values. We need to include all the variations 
to avoid indexing slow-downs in these cases.

 

[0] 
[https://github.com/apache/jackrabbit-oak/blob/trunk/oak-lucene/src/main/resources/org/apache/jackrabbit/oak/plugins/index/lucene/tika-config.xml]

[1] 
https://www.forumming.com/question/18296/what-is-the-correct-mime-type-for-psd-photoshop



--
This message was sent by Atlassian Jira
(v8.3.4#803005)

Reply via email to