[ https://issues.apache.org/jira/browse/PDFBOX-5901?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17898478#comment-17898478 ]
Tilman Hausherr commented on PDFBOX-5901: ----------------------------------------- The font of that PDF is broken for the reason explained in the messages. Are you able to extract text with Adobe Reader? We can't analyze the PDF without having it. > there is an issue with font mapping or rendering > ------------------------------------------------ > > Key: PDFBOX-5901 > URL: https://issues.apache.org/jira/browse/PDFBOX-5901 > Project: PDFBox > Issue Type: Bug > Components: FontBox > Affects Versions: 2.0.31 > Reporter: ltzzZ > Priority: Major > Attachments: image-2024-11-15-12-38-12-100.png, > image-2024-11-15-12-38-36-179.png, image-2024-11-15-12-39-22-585.png > > > When I try to extract the text content of a pdf file, I keep looping through > the warning log of font rendering or mapping, I can't get the content of the > file, how can I fix this problem. > > My code: > !image-2024-11-15-12-38-36-179.png! > problem: > !image-2024-11-15-12-39-22-585.png! > and sometimes the CPU usage is abnormal > !image-2024-11-15-12-38-12-100.png! -- This message was sent by Atlassian Jira (v8.20.10#820010) --------------------------------------------------------------------- To unsubscribe, e-mail: dev-unsubscr...@pdfbox.apache.org For additional commands, e-mail: dev-h...@pdfbox.apache.org