[ 
https://issues.apache.org/jira/browse/TIKA-1513?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15300553#comment-15300553
 ] 

Nick C commented on TIKA-1513:
------------------------------

I wasn't able to find a way to detect the dbt files. I did find some 
example/test dbf/dbt files on 
http://www.clicketyclick.dk/databases/xbase/index.shtml.en Also there were some 
non 0x03 files in common crawl (000075371.dbf, 000543045.dbf, 000606319.dbf, 
001674260.dbf, 002135562.dbf) and in those example files.

I'm not sure the best way to handle the variants. Maybe have the DBFParser 
stick it in the metadata (Something like Application name?)

> Add mime detection and parsing for dbf files
> --------------------------------------------
>
>                 Key: TIKA-1513
>                 URL: https://issues.apache.org/jira/browse/TIKA-1513
>             Project: Tika
>          Issue Type: Improvement
>            Reporter: Tim Allison
>            Assignee: Tim Allison
>            Priority: Minor
>             Fix For: 2.0, 1.14
>
>
> I just came across an Apache licensed dbf parser that is available on 
> [maven|https://repo1.maven.org/maven2/org/jamel/dbf/dbf-reader/0.1.0/dbf-reader-0.1.0.pom].
> Let's add dbf parsing to Tika.
> Any other recommendations for alternate parsers?



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

Reply via email to