Gregory Lepore created TIKA-4058:
------------------------------------
Summary: Add file extension .rmd160 to tika-mimetypes.xml
Key: TIKA-4058
URL: https://issues.apache.org/jira/browse/TIKA-4058
Project: Tika
Issue Type: Sub-task
Reporter: Gregory Lepore
Attachments: WW-2.2_0.darwin_19.noarch.tbz2.rmd160,
WW-2.2_0.darwin_20.noarch.tbz2.rmd160, WW-2.2_0.darwin_21.noarch.tbz2.rmd160,
xpr-1.0.5_0.darwin_17.x86_64.tbz2.rmd160
The Common Crawl dataset contains thousands of hash files generated per the
Ripemd 160 hashing algorithm
([http://justsolve.archiveteam.org/wiki/RIPEMD-160).] These files are 512 bytes
long, have no magic, but end with a .rmd160 extension.
That extension is sufficiently unique to serve as a magic number (where it's
used) for these files.
--
This message was sent by Atlassian Jira
(v8.20.10#820010)