[ 
https://issues.apache.org/jira/browse/TIKA-3849?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17607432#comment-17607432
 ] 

Hudson commented on TIKA-3849:
------------------------------

SUCCESS: Integrated in Jenkins build Tika ยป tika-main-jdk8 #808 (See 
[https://ci-builds.apache.org/job/Tika/job/tika-main-jdk8/808/])
TIKA-3849 -- fix file extension for zstd (tallison: 
[https://github.com/apache/tika/commit/9f019adaa233a4044e9852046d0f7b82a0d72da7])
* (delete) 
tika-parsers/tika-parsers-standard/tika-parsers-standard-modules/tika-parser-pkg-module/src/test/resources/test-documents/testZSTD.zstd
* (edit) 
tika-parsers/tika-parsers-standard/tika-parsers-standard-package/src/test/java/org/apache/tika/parser/pkg/CompressorParserTest.java
* (edit) 
tika-parsers/tika-parsers-standard/tika-parsers-standard-package/src/test/java/org/apache/tika/mime/TestMimeTypes.java
* (edit) tika-core/src/main/resources/org/apache/tika/mime/tika-mimetypes.xml
* (add) 
tika-parsers/tika-parsers-standard/tika-parsers-standard-modules/tika-parser-pkg-module/src/test/resources/test-documents/testZSTD.zst


> Throw UnsupportedFormatException or similar for really old mdb files
> --------------------------------------------------------------------
>
>                 Key: TIKA-3849
>                 URL: https://issues.apache.org/jira/browse/TIKA-3849
>             Project: Tika
>          Issue Type: Wish
>            Reporter: Tim Allison
>            Priority: Minor
>             Fix For: 2.5.0
>
>
> In processing some digipres2022 bake-off files, I noticed that a number of 
> mdb files triggered the same exception: 
> {noformat}
> java.io.IOException: Unrecognized map type: 75
>     at 
> com.healthmarketscience.jackcess.impl.UsageMap.initHandler(UsageMap.java:150)
>     at com.healthmarketscience.jackcess.impl.UsageMap.read(UsageMap.java:136)
>     at 
> com.healthmarketscience.jackcess.impl.PageChannel.initialize(PageChannel.java:118)
>     at 
> com.healthmarketscience.jackcess.impl.DatabaseImpl.<init>(DatabaseImpl.java:579)
>     at 
> com.healthmarketscience.jackcess.impl.DatabaseImpl.open(DatabaseImpl.java:440)
>     at 
> com.healthmarketscience.jackcess.DatabaseBuilder.open(DatabaseBuilder.java:267)
>     at 
> org.apache.tika.parser.microsoft.JackcessParser.parse(JackcessParser.java:94)
> {noformat}
> Googling this confirmed suspicions that these are pre-97 versions of access 
> databases.  We should improve our exception type/labeling for this type of 
> exception.
> ref: https://sourceforge.net/p/jackcess/bugs/101/



--
This message was sent by Atlassian Jira
(v8.20.10#820010)

Reply via email to