Fabrizio Fortino created OAK-9456:
-------------------------------------
Summary: Exclude all possible PSD mime types in default tika config
Key: OAK-9456
URL: https://issues.apache.org/jira/browse/OAK-9456
Project: Jackrabbit Oak
Issue Type: Bug
Components: indexing
Affects Versions: 1.40.0
Reporter: Fabrizio Fortino
Assignee: Fabrizio Fortino
The default tika config [0] excludes some mime types that are known to be
resource-intensive. One of these mime types is *image/x-psd* which should
exclude Adobe PSD file formats. As explained in this forum post [1], this file
format can have several mime-type values. We need to include all the variations
to avoid indexing slow-downs in these cases.
[0]
[https://github.com/apache/jackrabbit-oak/blob/trunk/oak-lucene/src/main/resources/org/apache/jackrabbit/oak/plugins/index/lucene/tika-config.xml]
[1]
https://www.forumming.com/question/18296/what-is-the-correct-mime-type-for-psd-photoshop
--
This message was sent by Atlassian Jira
(v8.3.4#803005)