[
https://issues.apache.org/jira/browse/TIKA-4084?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Gregory Lepore updated TIKA-4084:
---------------------------------
Description:
The SquashFS format appears 1,025 times in the latest Common Crawl dataset. No
known mime type. The magic is 68737173 at offset 0 (ASCII hsqs).
File extension is .squashfs.
https://dr-emann.github.io/squashfs/squashfs.html
was:
The SquashFS format appears 1,025 times in the latest Common Crawl dataset. No
known mime type. The magic is 68737173 at offset 0 (ASCII hsqs).
> Add magic for SquashFS Format
> -----------------------------
>
> Key: TIKA-4084
> URL: https://issues.apache.org/jira/browse/TIKA-4084
> Project: Tika
> Issue Type: Sub-task
> Reporter: Gregory Lepore
> Priority: Minor
> Attachments:
> 1b20149ffd8d622112534cf0afa8b93e34eb34bdad5936452242da68a13313fa,
> 1d83edf1bd80d55b6265845254c2f11ef9aa4eca36514e53db8ef1c4d6e8a94e,
> 3fa5204cbe997675475dad8b9e289140c97523412052be5a67932605fd04bbbc
>
>
> The SquashFS format appears 1,025 times in the latest Common Crawl dataset.
> No known mime type. The magic is 68737173 at offset 0 (ASCII hsqs).
>
> File extension is .squashfs.
>
> https://dr-emann.github.io/squashfs/squashfs.html
>
>
--
This message was sent by Atlassian Jira
(v8.20.10#820010)