[ 
https://issues.apache.org/jira/browse/TIKA-4187?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17813965#comment-17813965
 ] 

Gregory Lepore commented on TIKA-4187:
--------------------------------------

Tim - Thanks for adding this! Apologies for a possibly dumb question, I've 
recompiled the latest version of Tika from Git, and while I'm getting the 
appropriate mime-type being identified for this (and for the ARC format), I'm 
not getting any Parsers analyzing the files. Am I jumping the gun or missing 
something?

 

Content-Length: 120649774
Content-Type: application/x-internet-archive
X-TIKA:Parsed-By: org.apache.tika.parser.EmptyParser
X-TIKA:Parsed-By-Full-Set: org.apache.tika.parser.EmptyParser
X-TIKA:digest:MD5: 09a05f59c2210238c1c5ec983882f90a
X-TIKA:digest:SHA256: 
b9f1675bfec074e4646a384bf00b69657b1425e3ec7a1db2a5ad751bb57e8775
resourceName: NARA-PEOT-2004-20041015201504-00391-crawling007.archive.org.arc

> Add detection for geopackage
> ----------------------------
>
>                 Key: TIKA-4187
>                 URL: https://issues.apache.org/jira/browse/TIKA-4187
>             Project: Tika
>          Issue Type: New Feature
>            Reporter: Tim Allison
>            Priority: Minor
>             Fix For: 3.0.0
>
>
> GeoPackage is an open format that uses sqlite as a container file.
> Documentation is here: https://www.geopackage.org/
> As with other container formats (ole2, zip), we'd have to open the file with 
> sqlite and then run detection based on table names ... maybe? We need more 
> investigation on detection.
> As with illustrator and the PDFParser, we could run the detection in the 
> sqlite parser to get started.



--
This message was sent by Atlassian Jira
(v8.20.10#820010)

Reply via email to