Andreas Meier created TIKA-2576:

             Summary: Add application/zstd detection and parser
                 Key: TIKA-2576
             Project: Tika
          Issue Type: Improvement
          Components: detector, parser
            Reporter: Andreas Meier
         Attachments: huffman-compressed-larger, 

The IETF is currently checking the specification of Zstandard compression and 
the application/zstd Media Type: 

As soon as the MediaType application/zstd is set as standard the Media Type 
shall be implemented.

Possible mime-detection for tika-mimetypes.xml (second comment has to be 
changed when the standard is final):

  <mime-type type="application/zstd">
    <magic priority="50">
      <match value="0xFD2FB528" type="little32" offset="0"/>
    <glob pattern="*.zstd"/>

commons-compress version 1.16 and later provide a compressor and decompressor 
for the algorithm, based on com.github.luben zstd-jni 

Attached sampe zstd file (huffman-compressed-larger) and the result after 
decompressing it.

Decompression was done with commons-compress 1.16.1 and zstd-jni 1.3.3-3





This message was sent by Atlassian JIRA

Reply via email to