[ 
https://issues.apache.org/jira/browse/TIKA-4084?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Gregory Lepore updated TIKA-4084:
---------------------------------
    Description: 
The SquashFS format appears 1,025 times in the latest Common Crawl dataset. No 
known mime type. The magic is 68737173 at offset 0 (ASCII hsqs). 

 

File extension is .squashfs.

 

https://dr-emann.github.io/squashfs/squashfs.html

 

 

  was:
The SquashFS format appears 1,025 times in the latest Common Crawl dataset. No 
known mime type. The magic is 68737173 at offset 0 (ASCII hsqs). 

 

 


> Add magic for SquashFS Format
> -----------------------------
>
>                 Key: TIKA-4084
>                 URL: https://issues.apache.org/jira/browse/TIKA-4084
>             Project: Tika
>          Issue Type: Sub-task
>            Reporter: Gregory Lepore
>            Priority: Minor
>         Attachments: 
> 1b20149ffd8d622112534cf0afa8b93e34eb34bdad5936452242da68a13313fa, 
> 1d83edf1bd80d55b6265845254c2f11ef9aa4eca36514e53db8ef1c4d6e8a94e, 
> 3fa5204cbe997675475dad8b9e289140c97523412052be5a67932605fd04bbbc
>
>
> The SquashFS format appears 1,025 times in the latest Common Crawl dataset. 
> No known mime type. The magic is 68737173 at offset 0 (ASCII hsqs). 
>  
> File extension is .squashfs.
>  
> https://dr-emann.github.io/squashfs/squashfs.html
>  
>  



--
This message was sent by Atlassian Jira
(v8.20.10#820010)

Reply via email to