[
https://issues.apache.org/jira/browse/PDFBOX-2610?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14284144#comment-14284144
]
Tilman Hausherr commented on PDFBOX-2610:
-----------------------------------------
I found this 2012 paper by [~johanvanderknijff] here:
http://openpreservation.org/system/files/pdfProfilingJvdK19122012.pdf
{quote}
In addition to this I also ran Apache Preflight on the 'Bavaria test suite'
(...)
Preflight raised an exception for 5 out of 85 files in the dataset (6%), which
indicates that at this stage it may simply not be sufficiently stable or mature
for operational use.
{quote}
The good news is that there were no exceptions when I started. The false
positives / false negatives are now down to 7 (from 16). The bad news is that
the remaining problems are more difficult, and that I haven't investigated the
reason why sometimes more PDFA errors are reported than in the Bavaria XML file.
> Expand Isartor test for Bavaria test suite and other tests
> ----------------------------------------------------------
>
> Key: PDFBOX-2610
> URL: https://issues.apache.org/jira/browse/PDFBOX-2610
> Project: PDFBox
> Issue Type: Task
> Components: Preflight
> Affects Versions: 2.0.0
> Reporter: Tilman Hausherr
> Assignee: Tilman Hausherr
>
> 1) Expand the isartor test code so that it can also check conforming
> documents, i.e. documents that should not bring any errors. Support JBIG2.
> 2) Test the files from the Bavaria suite with preflight. I'll create
> sub-issues on that one. I counted 16 where something doesn't work as intented.
> 3) Include the Bavaria tests in the build. Only if we agree on this one. If
> not, I'll just keep it for myself as an additional regression test.
--
This message was sent by Atlassian JIRA
(v6.3.4#6332)