[
https://issues.apache.org/jira/browse/TIKA-3849?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17603244#comment-17603244
]
Hudson commented on TIKA-3849:
------------------------------
SUCCESS: Integrated in Jenkins build Tika ยป tika-main-jdk8 #786 (See
[https://ci-builds.apache.org/job/Tika/job/tika-main-jdk8/786/])
TIKA-3849 -- throw better msg/exception for old, unsupported mdb versions
(tallison:
[https://github.com/apache/tika/commit/699b0546eaf31af81a7ef8cbb7e9752c4320f349])
* (edit)
tika-parsers/tika-parsers-standard/tika-parsers-standard-modules/tika-parser-microsoft-module/src/main/java/org/apache/tika/parser/microsoft/JackcessParser.java
> Throw UnsupportedFormaException or similar for really old mdb files
> -------------------------------------------------------------------
>
> Key: TIKA-3849
> URL: https://issues.apache.org/jira/browse/TIKA-3849
> Project: Tika
> Issue Type: Wish
> Reporter: Tim Allison
> Priority: Minor
> Fix For: 2.4.2
>
>
> In processing some digipres2022 bake-off files, I noticed that a number of
> mdb files triggered the same exception:
> {noformat}
> java.io.IOException: Unrecognized map type: 75
> at
> com.healthmarketscience.jackcess.impl.UsageMap.initHandler(UsageMap.java:150)
> at com.healthmarketscience.jackcess.impl.UsageMap.read(UsageMap.java:136)
> at
> com.healthmarketscience.jackcess.impl.PageChannel.initialize(PageChannel.java:118)
> at
> com.healthmarketscience.jackcess.impl.DatabaseImpl.<init>(DatabaseImpl.java:579)
> at
> com.healthmarketscience.jackcess.impl.DatabaseImpl.open(DatabaseImpl.java:440)
> at
> com.healthmarketscience.jackcess.DatabaseBuilder.open(DatabaseBuilder.java:267)
> at
> org.apache.tika.parser.microsoft.JackcessParser.parse(JackcessParser.java:94)
> {noformat}
> Googling this confirmed suspicions that these are pre-97 versions of access
> databases. We should improve our exception type/labeling for this type of
> exception.
> ref: https://sourceforge.net/p/jackcess/bugs/101/
--
This message was sent by Atlassian Jira
(v8.20.10#820010)