AW: Text extraction fails because permissions are not recognized

Brangs, Erik Mon, 08 Dec 2025 10:18:47 -0800

Hi,

thank you. Then I'll leave my code as it is.


> -----Ursprüngliche Nachricht-----
> Von: Tilman Hausherr [mailto:[email protected]]
> Gesendet: Montag, 8. Dezember 2025 15:05
> An: [email protected]
> Betreff: Re: Text extraction fails because permissions are not recognized
> 
> Hi,
> 
> PDFBox fails with an error message related to a missing certificate,
> Firefox and Chrome are unable to display it, Adobe displays it but
> mentions "Zertifikatssicherheit", PDF-XChange mentions something about a
> digital ID.
>  From what I see it is a special case that we don't support:
> DefEmbeddedFile which means that only the embedded files are encrypted.
> Overwriting the references to the encryption directory with blanks
> (there are two) makes it possible to display with PDFBox. So your
> document is legit but uses an exotic feature.
> 
> Tilman
> 
> Am 08.12.2025 um 10:18 schrieb Brangs, Erik:
> > Hi,
> >
> > I'm trying to extract text from the PDF at https://d-nb.info/1015222862/34 
> > using
> PDFBox 3.0.6 with bouncycastle. My code uses canExtractContent() from
> AccessPermission to check if text extraction should be attempted. When 
> attempting to
> extract text, PDFBox fails with "IOException: java.io.IOException: Provided 
> decryption
> material is not compatible with the document - did you pass a null 
> keyStore?". The
> problem also occurs with PDFBox 4.0.0-SNAPSHOT.
> >
> > Adobe Acrobat can open the PDF. According to its permission display, 
> > assembly and
> extraction of pages is not permitted. I also tried opening the document with 
> PDF-
> XChange Viewer. PDF-XChange Viewer cannot open the document and displays an
> error message that says "Error [PDF Structure 40]: Unknown/unsupported 
> security
> handler".
> >
> > Is my check for text extraction wrong or incomplete?
> >
> >
> 
> 
> ---------------------------------------------------------------------
> To unsubscribe, e-mail: [email protected]
> For additional commands, e-mail: [email protected]

AW: Text extraction fails because permissions are not recognized

Reply via email to