Thanks Tillman, exactly the info I needed.

On Mon, Jun 26, 2023 at 10:21 PM Tilman Hausherr <thaush...@t-online.de>
wrote:

> Hi,
> PDFBox preflight only checks for PDF/A-1b, not for any accessibility
> topics. Maybe your PDF isn't meant to be accessible to prevent scraping.
> Try https://verapdf.org/
> Tilman
>
> On 26.06.2023 19:36, Susan Borda wrote:
> > Hi All-
> > I'd like to check PDFs that have character encoding issues, does
> Preflight
> > do that? I checked the accessibility of a pdf file in Adobe Pro and it
> gave
> > me a "Character encoding -Failed" message. When I checked this same file
> in
> > Preflight I got this:
> >
> > Jun 26, 2023 1:24:41 PM
> org.apache.pdfbox.pdmodel.graphics.color.PDICCBased
> > ensureDisplayProfile
> > WARNING: ICC profile is Perceptual, ignoring, treating as Display class
> > Jun 26, 2023 1:24:41 PM
> org.apache.pdfbox.pdmodel.graphics.color.PDICCBased
> > ensureDisplayProfile
> > WARNING: ICC profile is Perceptual, ignoring, treating as Display class
> > Jun 26, 2023 1:24:41 PM
> org.apache.pdfbox.pdmodel.graphics.color.PDICCBased
> > ensureDisplayProfile
> > WARNING: ICC profile is Perceptual, ignoring, treating as Display class
> > The file BritishLibrary-PDF_Assessment_v1.3.pdf is a valid PDF/A-1b file
> >
> > When I try to copy/paste the text from this PDF it's all garbage and the
> > CMap is missing.
> >
> > Any advice would be greatly appreciated.
> > Thanks,
> > susan
>
>
>
> ---------------------------------------------------------------------
> To unsubscribe, e-mail: users-unsubscr...@pdfbox.apache.org
> For additional commands, e-mail: users-h...@pdfbox.apache.org
>
>

-- 
Susan Borda
Digital Preservation Projects Manager
Digital Preservation Unit
University of Michigan Libraries
Buhr Building
sbo...@umich.edu
*My office phone number is temporarily disconnected while I work remotely
due to COVID-19. Please contact me via email.*

Reply via email to