Hi,

I'm trying to extract text from the PDF at https://d-nb.info/1015222862/34 
using PDFBox 3.0.6 with bouncycastle. My code uses canExtractContent() from 
AccessPermission to check if text extraction should be attempted. When 
attempting to extract text, PDFBox fails with "IOException: 
java.io.IOException: Provided decryption material is not compatible with the 
document - did you pass a null keyStore?". The problem also occurs with PDFBox 
4.0.0-SNAPSHOT.

Adobe Acrobat can open the PDF. According to its permission display, assembly 
and extraction of pages is not permitted. I also tried opening the document 
with PDF-XChange Viewer. PDF-XChange Viewer cannot open the document and 
displays an error message that says "Error [PDF Structure 40]: 
Unknown/unsupported security handler".

Is my check for text extraction wrong or incomplete?


-- 
Erik Brangs
Deutsche Nationalbibliothek
Fachbereich Metadaten | IT
Adickesallee 1
60322 Frankfurt am Main
Telefon: +49 69 1525-1850
Telefax: +49 69 1525-1799
mailto:[email protected]
https://www.dnb.de

Reply via email to