[ 
https://issues.apache.org/jira/browse/TIKA-4004?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17725978#comment-17725978
 ] 

Gregory Lepore commented on TIKA-4004:
--------------------------------------

I downloaded some of the original files referenced in your CSV file (so I 
didn't have to mess with extracting from the WARC) by going to the "live" URL. 
The only difference from my above magic is the files have 01 00 02 00 at offset 
8. Sample attached.

> font/otf application/vnd.ms-opentype
> ------------------------------------
>
>                 Key: TIKA-4004
>                 URL: https://issues.apache.org/jira/browse/TIKA-4004
>             Project: Tika
>          Issue Type: Sub-task
>            Reporter: Tim Allison
>            Priority: Major
>         Attachments: 000000.warc, aller-bold.eot, aller-light.eot, 
> fleurons.eot, index.html_id=45_and_type=eot, index.html_id=67_and_type=eot, 
> index.html_id=75_and_type=eot, index.html_id=77_and_type=eot, 
> index.html_id=80_and_type=eot, index.html_id=83_and_type=eot, 
> index.html_id=84_and_type=eot
>
>




--
This message was sent by Atlassian Jira
(v8.20.10#820010)

Reply via email to