Tayseer Sabha created TIKA-3960:
-----------------------------------

             Summary: PGP encrypted files get detected as 
application/octet-stream
                 Key: TIKA-3960
                 URL: https://issues.apache.org/jira/browse/TIKA-3960
             Project: Tika
          Issue Type: Bug
          Components: detector
    Affects Versions: 2.6.0
            Reporter: Tayseer Sabha


We use Tika for detecting and validating uploaded files using their 
content/magic bytes and not only their names/extension.

Passing a PGP/GPG encrypted file to Tika.detect(InputStream stream) will always 
return application/octet-stream instead of application/pgp-encrypted defined in 
tika-mimetypes.xml

The issue occurs because the application/pgp-encrypted mime-type defined in 
tika-mimetypes.xml is lacking a magic match and only has <glob pattern="*.pgp"/>

I managed to fix the issue for us temporarily by adding 
application/pgp-encrypted including a magic match in our custom-mimetypes.xml 
file. I will create a Pull Request on Github with the fix to resolve the issue.



--
This message was sent by Atlassian Jira
(v8.20.10#820010)

Reply via email to