[ https://issues.apache.org/jira/browse/TIKA-1513?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15300553#comment-15300553 ]
Nick C commented on TIKA-1513: ------------------------------ I wasn't able to find a way to detect the dbt files. I did find some example/test dbf/dbt files on http://www.clicketyclick.dk/databases/xbase/index.shtml.en Also there were some non 0x03 files in common crawl (000075371.dbf, 000543045.dbf, 000606319.dbf, 001674260.dbf, 002135562.dbf) and in those example files. I'm not sure the best way to handle the variants. Maybe have the DBFParser stick it in the metadata (Something like Application name?) > Add mime detection and parsing for dbf files > -------------------------------------------- > > Key: TIKA-1513 > URL: https://issues.apache.org/jira/browse/TIKA-1513 > Project: Tika > Issue Type: Improvement > Reporter: Tim Allison > Assignee: Tim Allison > Priority: Minor > Fix For: 2.0, 1.14 > > > I just came across an Apache licensed dbf parser that is available on > [maven|https://repo1.maven.org/maven2/org/jamel/dbf/dbf-reader/0.1.0/dbf-reader-0.1.0.pom]. > Let's add dbf parsing to Tika. > Any other recommendations for alternate parsers? -- This message was sent by Atlassian JIRA (v6.3.4#6332)