[ 
https://issues.apache.org/jira/browse/TIKA-586?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12983205#action_12983205
 ] 

Nick Burch commented on TIKA-586:
---------------------------------

Thanks for this. I made a slight tweak to the patch to put the sample files 
under tika-parser, then committed the fix and test in r1060393.

> Parsing a ms access file (*.mdb) throws an error
> ------------------------------------------------
>
>                 Key: TIKA-586
>                 URL: https://issues.apache.org/jira/browse/TIKA-586
>             Project: Tika
>          Issue Type: Bug
>          Components: mime
>    Affects Versions: 0.8
>            Reporter: Martijn van Groningen
>            Priority: Minor
>             Fix For: 0.9
>
>         Attachments: accessdb.mdb, TIKA-586.patch, true-font.ttf
>
>
> I know that parsing a ms access file (*.mdb) is not supported since there is 
> not parser for it, but I think it should not throw an exception. 
> Currently when parsing a mdb file it is being recognized as a true font file. 
> The TrueTypeParser throws an parser specific error when encountering a mdb 
> file.
> Stacktrace:
> Exception in thread "main" org.apache.tika.exception.TikaException: TIKA-198: 
> Illegal IOException from org.apache.tika.parser.font.TrueTypeParser@6906daba
>       at 
> org.apache.tika.parser.CompositeParser.parse(CompositeParser.java:203)
>       at 
> org.apache.tika.parser.CompositeParser.parse(CompositeParser.java:197)
>       at 
> org.apache.tika.parser.AutoDetectParser.parse(AutoDetectParser.java:135)
>       at org.apache.tika.cli.TikaCLI$OutputType.process(TikaCLI.java:94)
>       at org.apache.tika.cli.TikaCLI.process(TikaCLI.java:273)
>       at org.apache.tika.cli.TikaCLI.main(TikaCLI.java:80)
> Caused by: java.io.IOException: Unexpected end of TTF stream reached
>       at org.apache.fontbox.ttf.TTFDataStream.read(TTFDataStream.java:217)
>       at 
> org.apache.fontbox.ttf.TTFDataStream.readString(TTFDataStream.java:69)
>       at 
> org.apache.fontbox.ttf.TTFDataStream.readString(TTFDataStream.java:57)
>       at 
> org.apache.fontbox.ttf.AbstractTTFParser.readTableDirectory(AbstractTTFParser.java:214)
>       at 
> org.apache.fontbox.ttf.AbstractTTFParser.parseTTF(AbstractTTFParser.java:85)
>       at org.apache.fontbox.ttf.TTFParser.parseTTF(TTFParser.java:26)
>       at 
> org.apache.fontbox.ttf.AbstractTTFParser.parseTTF(AbstractTTFParser.java:66)
>       at org.apache.fontbox.ttf.TTFParser.parseTTF(TTFParser.java:26)
>       at 
> org.apache.tika.parser.font.TrueTypeParser.parse(TrueTypeParser.java:63)
>       at 
> org.apache.tika.parser.CompositeParser.parse(CompositeParser.java:197)
>       ... 5 more

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.

Reply via email to