[jira] [Commented] (PDFBOX-5415) Infinite loop in ExtractText in 2.x branch on a specific pdf
[ https://issues.apache.org/jira/browse/PDFBOX-5415?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17522531#comment-17522531 ] Tim Allison commented on PDFBOX-5415: - An answer on the Tika side. Yes, parsing is dangerous and you’ll need to isolate at the process level; thread level isolation is not enough. See what we offer in Tika for robustness: https://cwiki.apache.org/confluence/plugins/servlet/mobile?contentId=148647830#content/view/148647830 > Infinite loop in ExtractText in 2.x branch on a specific pdf > > > Key: PDFBOX-5415 > URL: https://issues.apache.org/jira/browse/PDFBOX-5415 > Project: PDFBox > Issue Type: Bug > Components: Parsing >Affects Versions: 2.0.26 >Reporter: Tim Allison >Priority: Major > Attachments: PDFBOX-5415-TIKA-3718-p10.pdf > > > [~DavidAvant] reported an infinite loop in Tika and provided an example file. > I can reproduce this with the latest PDFBox app 2.0.26-SNAPSHOT's > ExtractText. > File: https://issues.apache.org/jira/secure/attachment/13042292/map.pdf > Adobe and a slightly out of date pdftotext also have problems with this file. -- This message was sent by Atlassian Jira (v8.20.1#820001) - To unsubscribe, e-mail: dev-unsubscr...@pdfbox.apache.org For additional commands, e-mail: dev-h...@pdfbox.apache.org
[jira] [Commented] (PDFBOX-5415) Infinite loop in ExtractText in 2.x branch on a specific pdf
[ https://issues.apache.org/jira/browse/PDFBOX-5415?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17522515#comment-17522515 ] David Avant commented on PDFBOX-5415: - To summarize the verdict from Michael Demey: This PDF is wack. :) The looping logic is not infinite, but the exponential nature of the loop is so bad that our star may go supernova before Tika is finished parsing it. As ridiculous as this PDF might be, I suspect we need to defend against it. Otherwise this becomes a potential Denial of Service attack. Given Michael's description of the nature of the issue, does it seem plausible that this can be fixed within the parser itself? Or do we need some external means of defense, like insulating the rest of the application by running Tika within its own thread? > Infinite loop in ExtractText in 2.x branch on a specific pdf > > > Key: PDFBOX-5415 > URL: https://issues.apache.org/jira/browse/PDFBOX-5415 > Project: PDFBox > Issue Type: Bug > Components: Parsing >Affects Versions: 2.0.26 >Reporter: Tim Allison >Priority: Major > Attachments: PDFBOX-5415-TIKA-3718-p10.pdf > > > [~DavidAvant] reported an infinite loop in Tika and provided an example file. > I can reproduce this with the latest PDFBox app 2.0.26-SNAPSHOT's > ExtractText. > File: https://issues.apache.org/jira/secure/attachment/13042292/map.pdf > Adobe and a slightly out of date pdftotext also have problems with this file. -- This message was sent by Atlassian Jira (v8.20.1#820001) - To unsubscribe, e-mail: dev-unsubscr...@pdfbox.apache.org For additional commands, e-mail: dev-h...@pdfbox.apache.org
[jira] [Commented] (PDFBOX-5416) PDType1Font Can't read the embedded Type1 font ArialMT
[ https://issues.apache.org/jira/browse/PDFBOX-5416?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17522369#comment-17522369 ] Tilman Hausherr commented on PDFBOX-5416: - That's because we catch it. And then we choose a font that comes close to the broken one. > PDType1Font Can't read the embedded Type1 font ArialMT > - > > Key: PDFBOX-5416 > URL: https://issues.apache.org/jira/browse/PDFBOX-5416 > Project: PDFBox > Issue Type: Bug > Components: FontBox >Affects Versions: 2.0.25 > Environment: Windows 10, JDK15, Gradle 6.7, Spring Boot 2.3.4 >Reporter: Artur >Priority: Blocker > Labels: pdtype1font, type1, type1font > Attachments: label.pdf > > > Hello, > I have this error when i send a file to print: > PDType1Font Can't read the embedded Type1 font ArialMT > java.io.EOFException: attempted to read past EOF > at org.apache.fontbox.pfb.PfbParser.parsePfb(PfbParser.java:163) > at org.apache.fontbox.pfb.PfbParser.(PfbParser.java:125) > at org.apache.fontbox.type1.Type1Font.createWithPFB(Type1Font.java:69) > at org.apache.pdfbox.pdmodel.font.PDType1Font.(PDType1Font.java:247) > at > org.apache.pdfbox.pdmodel.font.PDFontFactory.createFont(PDFontFactory.java:76) > at org.apache.pdfbox.pdmodel.PDResources.getFont(PDResources.java:146) > at > org.apache.pdfbox.contentstream.operator.text.SetFontAndSize.process(SetFontAndSize.java:66) > at > org.apache.pdfbox.contentstream.PDFStreamEngine.processOperator(PDFStreamEngine.java:939) > at > org.apache.pdfbox.contentstream.PDFStreamEngine.processStreamOperators(PDFStreamEngine.java:514) > at > org.apache.pdfbox.contentstream.PDFStreamEngine.processStream(PDFStreamEngine.java:492) > at > org.apache.pdfbox.contentstream.PDFStreamEngine.processPage(PDFStreamEngine.java:155) > at org.apache.pdfbox.rendering.PageDrawer.drawPage(PageDrawer.java:282) > at > org.apache.pdfbox.rendering.PDFRenderer.renderPageToGraphics(PDFRenderer.java:466) > at org.apache.pdfbox.printing.PDFPrintable.print(PDFPrintable.java:264) > at > java.desktop/sun.print.RasterPrinterJob.printPage(RasterPrinterJob.java:2275) > at > java.desktop/sun.print.RasterPrinterJob.print(RasterPrinterJob.java:1654) > > Thread PDType1Font Can't read the embedded Type1 font Arial-BoldMT > java.io.EOFException: attempted to read past EOF > at org.apache.fontbox.pfb.PfbParser.parsePfb(PfbParser.java:163) > at org.apache.fontbox.pfb.PfbParser.(PfbParser.java:125) > at org.apache.fontbox.type1.Type1Font.createWithPFB(Type1Font.java:69) > at org.apache.pdfbox.pdmodel.font.PDType1Font.(PDType1Font.java:247) > at > org.apache.pdfbox.pdmodel.font.PDFontFactory.createFont(PDFontFactory.java:76) > at org.apache.pdfbox.pdmodel.PDResources.getFont(PDResources.java:146) > at > org.apache.pdfbox.contentstream.operator.text.SetFontAndSize.process(SetFontAndSize.java:66) > at > org.apache.pdfbox.contentstream.PDFStreamEngine.processOperator(PDFStreamEngine.java:939) > at > org.apache.pdfbox.contentstream.PDFStreamEngine.processStreamOperators(PDFStreamEngine.java:514) > at > org.apache.pdfbox.contentstream.PDFStreamEngine.processStream(PDFStreamEngine.java:492) > at > org.apache.pdfbox.contentstream.PDFStreamEngine.processPage(PDFStreamEngine.java:155) > at org.apache.pdfbox.rendering.PageDrawer.drawPage(PageDrawer.java:282) > at > org.apache.pdfbox.rendering.PDFRenderer.renderPageToGraphics(PDFRenderer.java:466) > at org.apache.pdfbox.printing.PDFPrintable.print(PDFPrintable.java:264) > at > java.desktop/sun.print.RasterPrinterJob.printPage(RasterPrinterJob.java:2275) > at > java.desktop/sun.print.RasterPrinterJob.print(RasterPrinterJob.java:1654) -- This message was sent by Atlassian Jira (v8.20.1#820001) - To unsubscribe, e-mail: dev-unsubscr...@pdfbox.apache.org For additional commands, e-mail: dev-h...@pdfbox.apache.org
[jira] [Updated] (PDFBOX-5418) NPE during page render
[ https://issues.apache.org/jira/browse/PDFBOX-5418?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Tilman Hausherr updated PDFBOX-5418: Summary: NPE during page render (was: NPE dunring page render) > NPE during page render > -- > > Key: PDFBOX-5418 > URL: https://issues.apache.org/jira/browse/PDFBOX-5418 > Project: PDFBox > Issue Type: Bug > Components: FontBox, Rendering >Affects Versions: 2.0.25 > Environment: openjdk 11.0.14 2022-01-18 LTS >Reporter: Maksym >Assignee: Tilman Hausherr >Priority: Major > Fix For: 2.0.26, 3.0.0 PDFBox > > Attachments: font_issue.pdf > > > > Rendering page causes NPE, opening in debugger gives this stacktrace: > {code:java} > java.lang.RuntimeException: java.util.concurrent.ExecutionException: > java.lang.NullPointerException > > org.apache.pdfbox.debugger.pagepane.PagePane$RenderWorker.done(PagePane.java:481) > > java.desktop/sun.swing.AccumulativeRunnable.run(AccumulativeRunnable.java:112) > java.base/java.security.AccessController.doPrivileged(Native Method) > > java.base/java.security.ProtectionDomain$JavaSecurityAccessImpl.doIntersectionPrivilege(ProtectionDomain.java:85) > Caused by: java.util.concurrent.ExecutionException: > java.lang.NullPointerException > > org.apache.pdfbox.debugger.pagepane.PagePane$RenderWorker.done(PagePane.java:461) > > java.desktop/sun.swing.AccumulativeRunnable.run(AccumulativeRunnable.java:112) > java.base/java.security.AccessController.doPrivileged(Native Method) > > java.base/java.security.ProtectionDomain$JavaSecurityAccessImpl.doIntersectionPrivilege(ProtectionDomain.java:85) > Caused by: java.lang.NullPointerException: null > > org.apache.fontbox.ttf.GlyfCompositeDescript.getContourCount(GlyfCompositeDescript.java:239) > > org.apache.fontbox.ttf.GlyfCompositeDescript.resolve(GlyfCompositeDescript.java:113) > org.apache.fontbox.ttf.GlyphTable.getGlyphData(GlyphTable.java:216) > org.apache.fontbox.ttf.GlyphTable.getGlyph(GlyphTable.java:191) > > org.apache.pdfbox.pdmodel.font.PDCIDFontType2.getPath(PDCIDFontType2.java:418) > org.apache.pdfbox.pdmodel.font.PDType0Font.getPath(PDType0Font.java:620) > org.apache.pdfbox.rendering.TTFGlyph2D.getPathForGID(TTFGlyph2D.java:144) > > org.apache.pdfbox.rendering.TTFGlyph2D.getPathForCharacterCode(TTFGlyph2D.java:93) > org.apache.pdfbox.rendering.PageDrawer.drawGlyph2D(PageDrawer.java:514) > org.apache.pdfbox.rendering.PageDrawer.showFontGlyph(PageDrawer.java:495) > > org.apache.pdfbox.contentstream.PDFStreamEngine.showGlyph(PDFStreamEngine.java:793) > > org.apache.pdfbox.contentstream.PDFStreamEngine.showGlyph(PDFStreamEngine.java:811) > > org.apache.pdfbox.contentstream.PDFStreamEngine.showText(PDFStreamEngine.java:749) > > org.apache.pdfbox.contentstream.PDFStreamEngine.showTextStrings(PDFStreamEngine.java:654) > > org.apache.pdfbox.contentstream.operator.text.ShowTextAdjusted.process(ShowTextAdjusted.java:53) > > org.apache.pdfbox.contentstream.PDFStreamEngine.processOperator(PDFStreamEngine.java:939) > > org.apache.pdfbox.contentstream.PDFStreamEngine.processStreamOperators(PDFStreamEngine.java:514) > > org.apache.pdfbox.contentstream.PDFStreamEngine.processStream(PDFStreamEngine.java:492) > > org.apache.pdfbox.contentstream.PDFStreamEngine.showForm(PDFStreamEngine.java:186) > org.apache.pdfbox.rendering.PageDrawer.showForm(PageDrawer.java:1558) > > org.apache.pdfbox.contentstream.operator.graphics.DrawObject.process(DrawObject.java:85) > > org.apache.pdfbox.contentstream.PDFStreamEngine.processOperator(PDFStreamEngine.java:939) > > org.apache.pdfbox.contentstream.PDFStreamEngine.processStreamOperators(PDFStreamEngine.java:514) > > org.apache.pdfbox.contentstream.PDFStreamEngine.processStream(PDFStreamEngine.java:492) > > org.apache.pdfbox.contentstream.PDFStreamEngine.showForm(PDFStreamEngine.java:186) > org.apache.pdfbox.rendering.PageDrawer.showForm(PageDrawer.java:1558) > > org.apache.pdfbox.contentstream.operator.graphics.DrawObject.process(DrawObject.java:85) > > org.apache.pdfbox.contentstream.PDFStreamEngine.processOperator(PDFStreamEngine.java:939) > > org.apache.pdfbox.contentstream.PDFStreamEngine.processStreamOperators(PDFStreamEngine.java:514) > > org.apache.pdfbox.contentstream.PDFStreamEngine.processStream(PDFStreamEngine.java:492) > > org.apache.pdfbox.contentstream.PDFStreamEngine.showForm(PDFStreamEngine.java:186) > org.apache.pdfbox.rendering.PageDrawer.showForm(PageDrawer.java:1558) > > org.apache.pdfbox.contentstream.operator.graphics.DrawObject.process(DrawObject.java:85) > >
[jira] [Updated] (PDFBOX-5418) NPE dunring page render
[ https://issues.apache.org/jira/browse/PDFBOX-5418?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Tilman Hausherr updated PDFBOX-5418: Component/s: FontBox > NPE dunring page render > --- > > Key: PDFBOX-5418 > URL: https://issues.apache.org/jira/browse/PDFBOX-5418 > Project: PDFBox > Issue Type: Bug > Components: FontBox, Rendering >Affects Versions: 2.0.25 > Environment: openjdk 11.0.14 2022-01-18 LTS >Reporter: Maksym >Assignee: Tilman Hausherr >Priority: Major > Fix For: 2.0.26, 3.0.0 PDFBox > > Attachments: font_issue.pdf > > > > Rendering page causes NPE, opening in debugger gives this stacktrace: > {code:java} > java.lang.RuntimeException: java.util.concurrent.ExecutionException: > java.lang.NullPointerException > > org.apache.pdfbox.debugger.pagepane.PagePane$RenderWorker.done(PagePane.java:481) > > java.desktop/sun.swing.AccumulativeRunnable.run(AccumulativeRunnable.java:112) > java.base/java.security.AccessController.doPrivileged(Native Method) > > java.base/java.security.ProtectionDomain$JavaSecurityAccessImpl.doIntersectionPrivilege(ProtectionDomain.java:85) > Caused by: java.util.concurrent.ExecutionException: > java.lang.NullPointerException > > org.apache.pdfbox.debugger.pagepane.PagePane$RenderWorker.done(PagePane.java:461) > > java.desktop/sun.swing.AccumulativeRunnable.run(AccumulativeRunnable.java:112) > java.base/java.security.AccessController.doPrivileged(Native Method) > > java.base/java.security.ProtectionDomain$JavaSecurityAccessImpl.doIntersectionPrivilege(ProtectionDomain.java:85) > Caused by: java.lang.NullPointerException: null > > org.apache.fontbox.ttf.GlyfCompositeDescript.getContourCount(GlyfCompositeDescript.java:239) > > org.apache.fontbox.ttf.GlyfCompositeDescript.resolve(GlyfCompositeDescript.java:113) > org.apache.fontbox.ttf.GlyphTable.getGlyphData(GlyphTable.java:216) > org.apache.fontbox.ttf.GlyphTable.getGlyph(GlyphTable.java:191) > > org.apache.pdfbox.pdmodel.font.PDCIDFontType2.getPath(PDCIDFontType2.java:418) > org.apache.pdfbox.pdmodel.font.PDType0Font.getPath(PDType0Font.java:620) > org.apache.pdfbox.rendering.TTFGlyph2D.getPathForGID(TTFGlyph2D.java:144) > > org.apache.pdfbox.rendering.TTFGlyph2D.getPathForCharacterCode(TTFGlyph2D.java:93) > org.apache.pdfbox.rendering.PageDrawer.drawGlyph2D(PageDrawer.java:514) > org.apache.pdfbox.rendering.PageDrawer.showFontGlyph(PageDrawer.java:495) > > org.apache.pdfbox.contentstream.PDFStreamEngine.showGlyph(PDFStreamEngine.java:793) > > org.apache.pdfbox.contentstream.PDFStreamEngine.showGlyph(PDFStreamEngine.java:811) > > org.apache.pdfbox.contentstream.PDFStreamEngine.showText(PDFStreamEngine.java:749) > > org.apache.pdfbox.contentstream.PDFStreamEngine.showTextStrings(PDFStreamEngine.java:654) > > org.apache.pdfbox.contentstream.operator.text.ShowTextAdjusted.process(ShowTextAdjusted.java:53) > > org.apache.pdfbox.contentstream.PDFStreamEngine.processOperator(PDFStreamEngine.java:939) > > org.apache.pdfbox.contentstream.PDFStreamEngine.processStreamOperators(PDFStreamEngine.java:514) > > org.apache.pdfbox.contentstream.PDFStreamEngine.processStream(PDFStreamEngine.java:492) > > org.apache.pdfbox.contentstream.PDFStreamEngine.showForm(PDFStreamEngine.java:186) > org.apache.pdfbox.rendering.PageDrawer.showForm(PageDrawer.java:1558) > > org.apache.pdfbox.contentstream.operator.graphics.DrawObject.process(DrawObject.java:85) > > org.apache.pdfbox.contentstream.PDFStreamEngine.processOperator(PDFStreamEngine.java:939) > > org.apache.pdfbox.contentstream.PDFStreamEngine.processStreamOperators(PDFStreamEngine.java:514) > > org.apache.pdfbox.contentstream.PDFStreamEngine.processStream(PDFStreamEngine.java:492) > > org.apache.pdfbox.contentstream.PDFStreamEngine.showForm(PDFStreamEngine.java:186) > org.apache.pdfbox.rendering.PageDrawer.showForm(PageDrawer.java:1558) > > org.apache.pdfbox.contentstream.operator.graphics.DrawObject.process(DrawObject.java:85) > > org.apache.pdfbox.contentstream.PDFStreamEngine.processOperator(PDFStreamEngine.java:939) > > org.apache.pdfbox.contentstream.PDFStreamEngine.processStreamOperators(PDFStreamEngine.java:514) > > org.apache.pdfbox.contentstream.PDFStreamEngine.processStream(PDFStreamEngine.java:492) > > org.apache.pdfbox.contentstream.PDFStreamEngine.showForm(PDFStreamEngine.java:186) > org.apache.pdfbox.rendering.PageDrawer.showForm(PageDrawer.java:1558) > > org.apache.pdfbox.contentstream.operator.graphics.DrawObject.process(DrawObject.java:85) > > org.apache.pdfbox.contentstream.PDFStreamEngine.processOperator(PDFStreamEngine.java:939) > >
[jira] [Resolved] (PDFBOX-5418) NPE dunring page render
[ https://issues.apache.org/jira/browse/PDFBOX-5418?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Tilman Hausherr resolved PDFBOX-5418. - Fix Version/s: 2.0.26 3.0.0 PDFBox Assignee: Tilman Hausherr Resolution: Fixed Thanks; a snapshot build will be available at https://repository.apache.org/content/groups/snapshots/org/apache/pdfbox/pdfbox-app/2.0.26-SNAPSHOT/ within a few hours. > NPE dunring page render > --- > > Key: PDFBOX-5418 > URL: https://issues.apache.org/jira/browse/PDFBOX-5418 > Project: PDFBox > Issue Type: Bug > Components: Rendering >Affects Versions: 2.0.25 > Environment: openjdk 11.0.14 2022-01-18 LTS >Reporter: Maksym >Assignee: Tilman Hausherr >Priority: Major > Fix For: 2.0.26, 3.0.0 PDFBox > > Attachments: font_issue.pdf > > > > Rendering page causes NPE, opening in debugger gives this stacktrace: > {code:java} > java.lang.RuntimeException: java.util.concurrent.ExecutionException: > java.lang.NullPointerException > > org.apache.pdfbox.debugger.pagepane.PagePane$RenderWorker.done(PagePane.java:481) > > java.desktop/sun.swing.AccumulativeRunnable.run(AccumulativeRunnable.java:112) > java.base/java.security.AccessController.doPrivileged(Native Method) > > java.base/java.security.ProtectionDomain$JavaSecurityAccessImpl.doIntersectionPrivilege(ProtectionDomain.java:85) > Caused by: java.util.concurrent.ExecutionException: > java.lang.NullPointerException > > org.apache.pdfbox.debugger.pagepane.PagePane$RenderWorker.done(PagePane.java:461) > > java.desktop/sun.swing.AccumulativeRunnable.run(AccumulativeRunnable.java:112) > java.base/java.security.AccessController.doPrivileged(Native Method) > > java.base/java.security.ProtectionDomain$JavaSecurityAccessImpl.doIntersectionPrivilege(ProtectionDomain.java:85) > Caused by: java.lang.NullPointerException: null > > org.apache.fontbox.ttf.GlyfCompositeDescript.getContourCount(GlyfCompositeDescript.java:239) > > org.apache.fontbox.ttf.GlyfCompositeDescript.resolve(GlyfCompositeDescript.java:113) > org.apache.fontbox.ttf.GlyphTable.getGlyphData(GlyphTable.java:216) > org.apache.fontbox.ttf.GlyphTable.getGlyph(GlyphTable.java:191) > > org.apache.pdfbox.pdmodel.font.PDCIDFontType2.getPath(PDCIDFontType2.java:418) > org.apache.pdfbox.pdmodel.font.PDType0Font.getPath(PDType0Font.java:620) > org.apache.pdfbox.rendering.TTFGlyph2D.getPathForGID(TTFGlyph2D.java:144) > > org.apache.pdfbox.rendering.TTFGlyph2D.getPathForCharacterCode(TTFGlyph2D.java:93) > org.apache.pdfbox.rendering.PageDrawer.drawGlyph2D(PageDrawer.java:514) > org.apache.pdfbox.rendering.PageDrawer.showFontGlyph(PageDrawer.java:495) > > org.apache.pdfbox.contentstream.PDFStreamEngine.showGlyph(PDFStreamEngine.java:793) > > org.apache.pdfbox.contentstream.PDFStreamEngine.showGlyph(PDFStreamEngine.java:811) > > org.apache.pdfbox.contentstream.PDFStreamEngine.showText(PDFStreamEngine.java:749) > > org.apache.pdfbox.contentstream.PDFStreamEngine.showTextStrings(PDFStreamEngine.java:654) > > org.apache.pdfbox.contentstream.operator.text.ShowTextAdjusted.process(ShowTextAdjusted.java:53) > > org.apache.pdfbox.contentstream.PDFStreamEngine.processOperator(PDFStreamEngine.java:939) > > org.apache.pdfbox.contentstream.PDFStreamEngine.processStreamOperators(PDFStreamEngine.java:514) > > org.apache.pdfbox.contentstream.PDFStreamEngine.processStream(PDFStreamEngine.java:492) > > org.apache.pdfbox.contentstream.PDFStreamEngine.showForm(PDFStreamEngine.java:186) > org.apache.pdfbox.rendering.PageDrawer.showForm(PageDrawer.java:1558) > > org.apache.pdfbox.contentstream.operator.graphics.DrawObject.process(DrawObject.java:85) > > org.apache.pdfbox.contentstream.PDFStreamEngine.processOperator(PDFStreamEngine.java:939) > > org.apache.pdfbox.contentstream.PDFStreamEngine.processStreamOperators(PDFStreamEngine.java:514) > > org.apache.pdfbox.contentstream.PDFStreamEngine.processStream(PDFStreamEngine.java:492) > > org.apache.pdfbox.contentstream.PDFStreamEngine.showForm(PDFStreamEngine.java:186) > org.apache.pdfbox.rendering.PageDrawer.showForm(PageDrawer.java:1558) > > org.apache.pdfbox.contentstream.operator.graphics.DrawObject.process(DrawObject.java:85) > > org.apache.pdfbox.contentstream.PDFStreamEngine.processOperator(PDFStreamEngine.java:939) > > org.apache.pdfbox.contentstream.PDFStreamEngine.processStreamOperators(PDFStreamEngine.java:514) > > org.apache.pdfbox.contentstream.PDFStreamEngine.processStream(PDFStreamEngine.java:492) > > org.apache.pdfbox.contentstream.PDFStreamEngine.showForm(PDFStreamEngine.java:186) >
[jira] [Commented] (PDFBOX-5418) NPE dunring page render
[ https://issues.apache.org/jira/browse/PDFBOX-5418?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17522361#comment-17522361 ] ASF subversion and git services commented on PDFBOX-5418: - Commit 1899860 from Tilman Hausherr in branch 'pdfbox/trunk' [ https://svn.apache.org/r1899860 ] PDFBOX-5418: avoid NPE > NPE dunring page render > --- > > Key: PDFBOX-5418 > URL: https://issues.apache.org/jira/browse/PDFBOX-5418 > Project: PDFBox > Issue Type: Bug > Components: Rendering >Affects Versions: 2.0.25 > Environment: openjdk 11.0.14 2022-01-18 LTS >Reporter: Maksym >Priority: Major > Attachments: font_issue.pdf > > > > Rendering page causes NPE, opening in debugger gives this stacktrace: > {code:java} > java.lang.RuntimeException: java.util.concurrent.ExecutionException: > java.lang.NullPointerException > > org.apache.pdfbox.debugger.pagepane.PagePane$RenderWorker.done(PagePane.java:481) > > java.desktop/sun.swing.AccumulativeRunnable.run(AccumulativeRunnable.java:112) > java.base/java.security.AccessController.doPrivileged(Native Method) > > java.base/java.security.ProtectionDomain$JavaSecurityAccessImpl.doIntersectionPrivilege(ProtectionDomain.java:85) > Caused by: java.util.concurrent.ExecutionException: > java.lang.NullPointerException > > org.apache.pdfbox.debugger.pagepane.PagePane$RenderWorker.done(PagePane.java:461) > > java.desktop/sun.swing.AccumulativeRunnable.run(AccumulativeRunnable.java:112) > java.base/java.security.AccessController.doPrivileged(Native Method) > > java.base/java.security.ProtectionDomain$JavaSecurityAccessImpl.doIntersectionPrivilege(ProtectionDomain.java:85) > Caused by: java.lang.NullPointerException: null > > org.apache.fontbox.ttf.GlyfCompositeDescript.getContourCount(GlyfCompositeDescript.java:239) > > org.apache.fontbox.ttf.GlyfCompositeDescript.resolve(GlyfCompositeDescript.java:113) > org.apache.fontbox.ttf.GlyphTable.getGlyphData(GlyphTable.java:216) > org.apache.fontbox.ttf.GlyphTable.getGlyph(GlyphTable.java:191) > > org.apache.pdfbox.pdmodel.font.PDCIDFontType2.getPath(PDCIDFontType2.java:418) > org.apache.pdfbox.pdmodel.font.PDType0Font.getPath(PDType0Font.java:620) > org.apache.pdfbox.rendering.TTFGlyph2D.getPathForGID(TTFGlyph2D.java:144) > > org.apache.pdfbox.rendering.TTFGlyph2D.getPathForCharacterCode(TTFGlyph2D.java:93) > org.apache.pdfbox.rendering.PageDrawer.drawGlyph2D(PageDrawer.java:514) > org.apache.pdfbox.rendering.PageDrawer.showFontGlyph(PageDrawer.java:495) > > org.apache.pdfbox.contentstream.PDFStreamEngine.showGlyph(PDFStreamEngine.java:793) > > org.apache.pdfbox.contentstream.PDFStreamEngine.showGlyph(PDFStreamEngine.java:811) > > org.apache.pdfbox.contentstream.PDFStreamEngine.showText(PDFStreamEngine.java:749) > > org.apache.pdfbox.contentstream.PDFStreamEngine.showTextStrings(PDFStreamEngine.java:654) > > org.apache.pdfbox.contentstream.operator.text.ShowTextAdjusted.process(ShowTextAdjusted.java:53) > > org.apache.pdfbox.contentstream.PDFStreamEngine.processOperator(PDFStreamEngine.java:939) > > org.apache.pdfbox.contentstream.PDFStreamEngine.processStreamOperators(PDFStreamEngine.java:514) > > org.apache.pdfbox.contentstream.PDFStreamEngine.processStream(PDFStreamEngine.java:492) > > org.apache.pdfbox.contentstream.PDFStreamEngine.showForm(PDFStreamEngine.java:186) > org.apache.pdfbox.rendering.PageDrawer.showForm(PageDrawer.java:1558) > > org.apache.pdfbox.contentstream.operator.graphics.DrawObject.process(DrawObject.java:85) > > org.apache.pdfbox.contentstream.PDFStreamEngine.processOperator(PDFStreamEngine.java:939) > > org.apache.pdfbox.contentstream.PDFStreamEngine.processStreamOperators(PDFStreamEngine.java:514) > > org.apache.pdfbox.contentstream.PDFStreamEngine.processStream(PDFStreamEngine.java:492) > > org.apache.pdfbox.contentstream.PDFStreamEngine.showForm(PDFStreamEngine.java:186) > org.apache.pdfbox.rendering.PageDrawer.showForm(PageDrawer.java:1558) > > org.apache.pdfbox.contentstream.operator.graphics.DrawObject.process(DrawObject.java:85) > > org.apache.pdfbox.contentstream.PDFStreamEngine.processOperator(PDFStreamEngine.java:939) > > org.apache.pdfbox.contentstream.PDFStreamEngine.processStreamOperators(PDFStreamEngine.java:514) > > org.apache.pdfbox.contentstream.PDFStreamEngine.processStream(PDFStreamEngine.java:492) > > org.apache.pdfbox.contentstream.PDFStreamEngine.showForm(PDFStreamEngine.java:186) > org.apache.pdfbox.rendering.PageDrawer.showForm(PageDrawer.java:1558) > > org.apache.pdfbox.contentstream.operator.graphics.DrawObject.process(DrawObject.java:85) > >
[jira] [Commented] (PDFBOX-5418) NPE dunring page render
[ https://issues.apache.org/jira/browse/PDFBOX-5418?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17522360#comment-17522360 ] ASF subversion and git services commented on PDFBOX-5418: - Commit 1899859 from Tilman Hausherr in branch 'pdfbox/branches/2.0' [ https://svn.apache.org/r1899859 ] PDFBOX-5418: avoid NPE > NPE dunring page render > --- > > Key: PDFBOX-5418 > URL: https://issues.apache.org/jira/browse/PDFBOX-5418 > Project: PDFBox > Issue Type: Bug > Components: Rendering >Affects Versions: 2.0.25 > Environment: openjdk 11.0.14 2022-01-18 LTS >Reporter: Maksym >Priority: Major > Attachments: font_issue.pdf > > > > Rendering page causes NPE, opening in debugger gives this stacktrace: > {code:java} > java.lang.RuntimeException: java.util.concurrent.ExecutionException: > java.lang.NullPointerException > > org.apache.pdfbox.debugger.pagepane.PagePane$RenderWorker.done(PagePane.java:481) > > java.desktop/sun.swing.AccumulativeRunnable.run(AccumulativeRunnable.java:112) > java.base/java.security.AccessController.doPrivileged(Native Method) > > java.base/java.security.ProtectionDomain$JavaSecurityAccessImpl.doIntersectionPrivilege(ProtectionDomain.java:85) > Caused by: java.util.concurrent.ExecutionException: > java.lang.NullPointerException > > org.apache.pdfbox.debugger.pagepane.PagePane$RenderWorker.done(PagePane.java:461) > > java.desktop/sun.swing.AccumulativeRunnable.run(AccumulativeRunnable.java:112) > java.base/java.security.AccessController.doPrivileged(Native Method) > > java.base/java.security.ProtectionDomain$JavaSecurityAccessImpl.doIntersectionPrivilege(ProtectionDomain.java:85) > Caused by: java.lang.NullPointerException: null > > org.apache.fontbox.ttf.GlyfCompositeDescript.getContourCount(GlyfCompositeDescript.java:239) > > org.apache.fontbox.ttf.GlyfCompositeDescript.resolve(GlyfCompositeDescript.java:113) > org.apache.fontbox.ttf.GlyphTable.getGlyphData(GlyphTable.java:216) > org.apache.fontbox.ttf.GlyphTable.getGlyph(GlyphTable.java:191) > > org.apache.pdfbox.pdmodel.font.PDCIDFontType2.getPath(PDCIDFontType2.java:418) > org.apache.pdfbox.pdmodel.font.PDType0Font.getPath(PDType0Font.java:620) > org.apache.pdfbox.rendering.TTFGlyph2D.getPathForGID(TTFGlyph2D.java:144) > > org.apache.pdfbox.rendering.TTFGlyph2D.getPathForCharacterCode(TTFGlyph2D.java:93) > org.apache.pdfbox.rendering.PageDrawer.drawGlyph2D(PageDrawer.java:514) > org.apache.pdfbox.rendering.PageDrawer.showFontGlyph(PageDrawer.java:495) > > org.apache.pdfbox.contentstream.PDFStreamEngine.showGlyph(PDFStreamEngine.java:793) > > org.apache.pdfbox.contentstream.PDFStreamEngine.showGlyph(PDFStreamEngine.java:811) > > org.apache.pdfbox.contentstream.PDFStreamEngine.showText(PDFStreamEngine.java:749) > > org.apache.pdfbox.contentstream.PDFStreamEngine.showTextStrings(PDFStreamEngine.java:654) > > org.apache.pdfbox.contentstream.operator.text.ShowTextAdjusted.process(ShowTextAdjusted.java:53) > > org.apache.pdfbox.contentstream.PDFStreamEngine.processOperator(PDFStreamEngine.java:939) > > org.apache.pdfbox.contentstream.PDFStreamEngine.processStreamOperators(PDFStreamEngine.java:514) > > org.apache.pdfbox.contentstream.PDFStreamEngine.processStream(PDFStreamEngine.java:492) > > org.apache.pdfbox.contentstream.PDFStreamEngine.showForm(PDFStreamEngine.java:186) > org.apache.pdfbox.rendering.PageDrawer.showForm(PageDrawer.java:1558) > > org.apache.pdfbox.contentstream.operator.graphics.DrawObject.process(DrawObject.java:85) > > org.apache.pdfbox.contentstream.PDFStreamEngine.processOperator(PDFStreamEngine.java:939) > > org.apache.pdfbox.contentstream.PDFStreamEngine.processStreamOperators(PDFStreamEngine.java:514) > > org.apache.pdfbox.contentstream.PDFStreamEngine.processStream(PDFStreamEngine.java:492) > > org.apache.pdfbox.contentstream.PDFStreamEngine.showForm(PDFStreamEngine.java:186) > org.apache.pdfbox.rendering.PageDrawer.showForm(PageDrawer.java:1558) > > org.apache.pdfbox.contentstream.operator.graphics.DrawObject.process(DrawObject.java:85) > > org.apache.pdfbox.contentstream.PDFStreamEngine.processOperator(PDFStreamEngine.java:939) > > org.apache.pdfbox.contentstream.PDFStreamEngine.processStreamOperators(PDFStreamEngine.java:514) > > org.apache.pdfbox.contentstream.PDFStreamEngine.processStream(PDFStreamEngine.java:492) > > org.apache.pdfbox.contentstream.PDFStreamEngine.showForm(PDFStreamEngine.java:186) > org.apache.pdfbox.rendering.PageDrawer.showForm(PageDrawer.java:1558) > > org.apache.pdfbox.contentstream.operator.graphics.DrawObject.process(DrawObject.java:85) > >
[jira] [Commented] (PDFBOX-5416) PDType1Font Can't read the embedded Type1 font ArialMT
[ https://issues.apache.org/jira/browse/PDFBOX-5416?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17522278#comment-17522278 ] Artur commented on PDFBOX-5416: --- Thank You for your answer. I have one more question. Maybe I'm missing something or i'm not educated enough :) but why i can't catch EOFException in this code? try { File file = new File("D:\\label.pdf"); PDDocument pddDocument = PDDocument.load(file); for (int i = 0; i < pddDocument.getNumberOfPages(); i++) { PDPage page = pddDocument.getPage(i); PDResources res = page.getResources(); for (COSName fontName : res.getFontNames()) { PDFont font = res.getFont(fontName); } } } catch (EOFException e) { System.out.println("Error"); } catch (IOException e) { System.out.println("Second error"); } catch (Exception e) { System.out.println("Third error"); } > PDType1Font Can't read the embedded Type1 font ArialMT > - > > Key: PDFBOX-5416 > URL: https://issues.apache.org/jira/browse/PDFBOX-5416 > Project: PDFBox > Issue Type: Bug > Components: FontBox >Affects Versions: 2.0.25 > Environment: Windows 10, JDK15, Gradle 6.7, Spring Boot 2.3.4 >Reporter: Artur >Priority: Blocker > Labels: pdtype1font, type1, type1font > Attachments: label.pdf > > > Hello, > I have this error when i send a file to print: > PDType1Font Can't read the embedded Type1 font ArialMT > java.io.EOFException: attempted to read past EOF > at org.apache.fontbox.pfb.PfbParser.parsePfb(PfbParser.java:163) > at org.apache.fontbox.pfb.PfbParser.(PfbParser.java:125) > at org.apache.fontbox.type1.Type1Font.createWithPFB(Type1Font.java:69) > at org.apache.pdfbox.pdmodel.font.PDType1Font.(PDType1Font.java:247) > at > org.apache.pdfbox.pdmodel.font.PDFontFactory.createFont(PDFontFactory.java:76) > at org.apache.pdfbox.pdmodel.PDResources.getFont(PDResources.java:146) > at > org.apache.pdfbox.contentstream.operator.text.SetFontAndSize.process(SetFontAndSize.java:66) > at > org.apache.pdfbox.contentstream.PDFStreamEngine.processOperator(PDFStreamEngine.java:939) > at > org.apache.pdfbox.contentstream.PDFStreamEngine.processStreamOperators(PDFStreamEngine.java:514) > at > org.apache.pdfbox.contentstream.PDFStreamEngine.processStream(PDFStreamEngine.java:492) > at > org.apache.pdfbox.contentstream.PDFStreamEngine.processPage(PDFStreamEngine.java:155) > at org.apache.pdfbox.rendering.PageDrawer.drawPage(PageDrawer.java:282) > at > org.apache.pdfbox.rendering.PDFRenderer.renderPageToGraphics(PDFRenderer.java:466) > at org.apache.pdfbox.printing.PDFPrintable.print(PDFPrintable.java:264) > at > java.desktop/sun.print.RasterPrinterJob.printPage(RasterPrinterJob.java:2275) > at > java.desktop/sun.print.RasterPrinterJob.print(RasterPrinterJob.java:1654) > > Thread PDType1Font Can't read the embedded Type1 font Arial-BoldMT > java.io.EOFException: attempted to read past EOF > at org.apache.fontbox.pfb.PfbParser.parsePfb(PfbParser.java:163) > at org.apache.fontbox.pfb.PfbParser.(PfbParser.java:125) > at org.apache.fontbox.type1.Type1Font.createWithPFB(Type1Font.java:69) > at org.apache.pdfbox.pdmodel.font.PDType1Font.(PDType1Font.java:247) > at > org.apache.pdfbox.pdmodel.font.PDFontFactory.createFont(PDFontFactory.java:76) > at org.apache.pdfbox.pdmodel.PDResources.getFont(PDResources.java:146) > at > org.apache.pdfbox.contentstream.operator.text.SetFontAndSize.process(SetFontAndSize.java:66) > at > org.apache.pdfbox.contentstream.PDFStreamEngine.processOperator(PDFStreamEngine.java:939) > at > org.apache.pdfbox.contentstream.PDFStreamEngine.processStreamOperators(PDFStreamEngine.java:514) > at > org.apache.pdfbox.contentstream.PDFStreamEngine.processStream(PDFStreamEngine.java:492) > at > org.apache.pdfbox.contentstream.PDFStreamEngine.processPage(PDFStreamEngine.java:155) > at org.apache.pdfbox.rendering.PageDrawer.drawPage(PageDrawer.java:282) > at > org.apache.pdfbox.rendering.PDFRenderer.renderPageToGraphics(PDFRenderer.java:466) > at org.apache.pdfbox.printing.PDFPrintable.print(PDFPrintable.java:264) > at > java.desktop/sun.print.RasterPrinterJob.printPage(RasterPrinterJob.java:2275) > at > java.desktop/sun.print.RasterPrinterJob.print(RasterPrinterJob.java:1654) -- This message was sent by Atlassian Jira (v8.20.1#820001) - To unsubscribe, e-mail: dev-unsubscr...@pdfbox.apache.org For additional commands, e-mail: dev-h...@pdfbox.apache.org
Re: 2.0.26 release
Am 14.04.2022 um 08:13 schrieb Andreas Lehmkuehler: Cool, thanks for the feedback. I've set the ticket to resolved. Do we need to re-run the tests? BTW, what about PDFBOX-5394? Is there anything left to do? Do we have to wait for the feedback of the user? I've set that one to resolved. I will rerun the tests myself on the long weekend, if I have the time. Tilman - To unsubscribe, e-mail: dev-unsubscr...@pdfbox.apache.org For additional commands, e-mail: dev-h...@pdfbox.apache.org
[jira] [Resolved] (PDFBOX-5394) Render symbol for file attachment annotations
[ https://issues.apache.org/jira/browse/PDFBOX-5394?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Tilman Hausherr resolved PDFBOX-5394. - Resolution: Fixed Set to resolved despite lack of user feedback. I'm confident that this works. > Render symbol for file attachment annotations > - > > Key: PDFBOX-5394 > URL: https://issues.apache.org/jira/browse/PDFBOX-5394 > Project: PDFBox > Issue Type: Improvement > Components: Rendering >Affects Versions: 2.0.25 >Reporter: John Smith >Assignee: Tilman Hausherr >Priority: Minor > Labels: Annotations, annotation > Fix For: 2.0.26, 3.0.0 PDFBox > > Attachments: paperclip.pdf > > > Hi, I would need that PDFBox renders like this paper clip symbol for > attachments, which is shown when you open a PDF with an attachment in Adobe > PDF Reader but PDFBox does not render any icon for this. > I suggest using e.g. this MIT licensed icon: > [Paper Clip > Icon|https://raw.githubusercontent.com/Iconscout/unicons/master/svg/line/paperclip.svg] -- This message was sent by Atlassian Jira (v8.20.1#820001) - To unsubscribe, e-mail: dev-unsubscr...@pdfbox.apache.org For additional commands, e-mail: dev-h...@pdfbox.apache.org
[jira] [Commented] (PDFBOX-5416) PDType1Font Can't read the embedded Type1 font ArialMT
[ https://issues.apache.org/jira/browse/PDFBOX-5416?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17522098#comment-17522098 ] Tilman Hausherr commented on PDFBOX-5416: - Then you need to reproduce this freeze effect. The problem you mention is in your file and the only effect it has is that your logfiles get cluttered. I don't understand why they used embedded fonts at all, this is just a label, not a long term PDF. They could use the standard 14 fonts. > PDType1Font Can't read the embedded Type1 font ArialMT > - > > Key: PDFBOX-5416 > URL: https://issues.apache.org/jira/browse/PDFBOX-5416 > Project: PDFBox > Issue Type: Bug > Components: FontBox >Affects Versions: 2.0.25 > Environment: Windows 10, JDK15, Gradle 6.7, Spring Boot 2.3.4 >Reporter: Artur >Priority: Blocker > Labels: pdtype1font, type1, type1font > Attachments: label.pdf > > > Hello, > I have this error when i send a file to print: > PDType1Font Can't read the embedded Type1 font ArialMT > java.io.EOFException: attempted to read past EOF > at org.apache.fontbox.pfb.PfbParser.parsePfb(PfbParser.java:163) > at org.apache.fontbox.pfb.PfbParser.(PfbParser.java:125) > at org.apache.fontbox.type1.Type1Font.createWithPFB(Type1Font.java:69) > at org.apache.pdfbox.pdmodel.font.PDType1Font.(PDType1Font.java:247) > at > org.apache.pdfbox.pdmodel.font.PDFontFactory.createFont(PDFontFactory.java:76) > at org.apache.pdfbox.pdmodel.PDResources.getFont(PDResources.java:146) > at > org.apache.pdfbox.contentstream.operator.text.SetFontAndSize.process(SetFontAndSize.java:66) > at > org.apache.pdfbox.contentstream.PDFStreamEngine.processOperator(PDFStreamEngine.java:939) > at > org.apache.pdfbox.contentstream.PDFStreamEngine.processStreamOperators(PDFStreamEngine.java:514) > at > org.apache.pdfbox.contentstream.PDFStreamEngine.processStream(PDFStreamEngine.java:492) > at > org.apache.pdfbox.contentstream.PDFStreamEngine.processPage(PDFStreamEngine.java:155) > at org.apache.pdfbox.rendering.PageDrawer.drawPage(PageDrawer.java:282) > at > org.apache.pdfbox.rendering.PDFRenderer.renderPageToGraphics(PDFRenderer.java:466) > at org.apache.pdfbox.printing.PDFPrintable.print(PDFPrintable.java:264) > at > java.desktop/sun.print.RasterPrinterJob.printPage(RasterPrinterJob.java:2275) > at > java.desktop/sun.print.RasterPrinterJob.print(RasterPrinterJob.java:1654) > > Thread PDType1Font Can't read the embedded Type1 font Arial-BoldMT > java.io.EOFException: attempted to read past EOF > at org.apache.fontbox.pfb.PfbParser.parsePfb(PfbParser.java:163) > at org.apache.fontbox.pfb.PfbParser.(PfbParser.java:125) > at org.apache.fontbox.type1.Type1Font.createWithPFB(Type1Font.java:69) > at org.apache.pdfbox.pdmodel.font.PDType1Font.(PDType1Font.java:247) > at > org.apache.pdfbox.pdmodel.font.PDFontFactory.createFont(PDFontFactory.java:76) > at org.apache.pdfbox.pdmodel.PDResources.getFont(PDResources.java:146) > at > org.apache.pdfbox.contentstream.operator.text.SetFontAndSize.process(SetFontAndSize.java:66) > at > org.apache.pdfbox.contentstream.PDFStreamEngine.processOperator(PDFStreamEngine.java:939) > at > org.apache.pdfbox.contentstream.PDFStreamEngine.processStreamOperators(PDFStreamEngine.java:514) > at > org.apache.pdfbox.contentstream.PDFStreamEngine.processStream(PDFStreamEngine.java:492) > at > org.apache.pdfbox.contentstream.PDFStreamEngine.processPage(PDFStreamEngine.java:155) > at org.apache.pdfbox.rendering.PageDrawer.drawPage(PageDrawer.java:282) > at > org.apache.pdfbox.rendering.PDFRenderer.renderPageToGraphics(PDFRenderer.java:466) > at org.apache.pdfbox.printing.PDFPrintable.print(PDFPrintable.java:264) > at > java.desktop/sun.print.RasterPrinterJob.printPage(RasterPrinterJob.java:2275) > at > java.desktop/sun.print.RasterPrinterJob.print(RasterPrinterJob.java:1654) -- This message was sent by Atlassian Jira (v8.20.1#820001) - To unsubscribe, e-mail: dev-unsubscr...@pdfbox.apache.org For additional commands, e-mail: dev-h...@pdfbox.apache.org
[jira] [Commented] (PDFBOX-5416) PDType1Font Can't read the embedded Type1 font ArialMT
[ https://issues.apache.org/jira/browse/PDFBOX-5416?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17522093#comment-17522093 ] Artur commented on PDFBOX-5416: --- Yes, Most of the time I get decent and correct print, but once on a cuple times program freezes. > PDType1Font Can't read the embedded Type1 font ArialMT > - > > Key: PDFBOX-5416 > URL: https://issues.apache.org/jira/browse/PDFBOX-5416 > Project: PDFBox > Issue Type: Bug > Components: FontBox >Affects Versions: 2.0.25 > Environment: Windows 10, JDK15, Gradle 6.7, Spring Boot 2.3.4 >Reporter: Artur >Priority: Blocker > Labels: pdtype1font, type1, type1font > Attachments: label.pdf > > > Hello, > I have this error when i send a file to print: > PDType1Font Can't read the embedded Type1 font ArialMT > java.io.EOFException: attempted to read past EOF > at org.apache.fontbox.pfb.PfbParser.parsePfb(PfbParser.java:163) > at org.apache.fontbox.pfb.PfbParser.(PfbParser.java:125) > at org.apache.fontbox.type1.Type1Font.createWithPFB(Type1Font.java:69) > at org.apache.pdfbox.pdmodel.font.PDType1Font.(PDType1Font.java:247) > at > org.apache.pdfbox.pdmodel.font.PDFontFactory.createFont(PDFontFactory.java:76) > at org.apache.pdfbox.pdmodel.PDResources.getFont(PDResources.java:146) > at > org.apache.pdfbox.contentstream.operator.text.SetFontAndSize.process(SetFontAndSize.java:66) > at > org.apache.pdfbox.contentstream.PDFStreamEngine.processOperator(PDFStreamEngine.java:939) > at > org.apache.pdfbox.contentstream.PDFStreamEngine.processStreamOperators(PDFStreamEngine.java:514) > at > org.apache.pdfbox.contentstream.PDFStreamEngine.processStream(PDFStreamEngine.java:492) > at > org.apache.pdfbox.contentstream.PDFStreamEngine.processPage(PDFStreamEngine.java:155) > at org.apache.pdfbox.rendering.PageDrawer.drawPage(PageDrawer.java:282) > at > org.apache.pdfbox.rendering.PDFRenderer.renderPageToGraphics(PDFRenderer.java:466) > at org.apache.pdfbox.printing.PDFPrintable.print(PDFPrintable.java:264) > at > java.desktop/sun.print.RasterPrinterJob.printPage(RasterPrinterJob.java:2275) > at > java.desktop/sun.print.RasterPrinterJob.print(RasterPrinterJob.java:1654) > > Thread PDType1Font Can't read the embedded Type1 font Arial-BoldMT > java.io.EOFException: attempted to read past EOF > at org.apache.fontbox.pfb.PfbParser.parsePfb(PfbParser.java:163) > at org.apache.fontbox.pfb.PfbParser.(PfbParser.java:125) > at org.apache.fontbox.type1.Type1Font.createWithPFB(Type1Font.java:69) > at org.apache.pdfbox.pdmodel.font.PDType1Font.(PDType1Font.java:247) > at > org.apache.pdfbox.pdmodel.font.PDFontFactory.createFont(PDFontFactory.java:76) > at org.apache.pdfbox.pdmodel.PDResources.getFont(PDResources.java:146) > at > org.apache.pdfbox.contentstream.operator.text.SetFontAndSize.process(SetFontAndSize.java:66) > at > org.apache.pdfbox.contentstream.PDFStreamEngine.processOperator(PDFStreamEngine.java:939) > at > org.apache.pdfbox.contentstream.PDFStreamEngine.processStreamOperators(PDFStreamEngine.java:514) > at > org.apache.pdfbox.contentstream.PDFStreamEngine.processStream(PDFStreamEngine.java:492) > at > org.apache.pdfbox.contentstream.PDFStreamEngine.processPage(PDFStreamEngine.java:155) > at org.apache.pdfbox.rendering.PageDrawer.drawPage(PageDrawer.java:282) > at > org.apache.pdfbox.rendering.PDFRenderer.renderPageToGraphics(PDFRenderer.java:466) > at org.apache.pdfbox.printing.PDFPrintable.print(PDFPrintable.java:264) > at > java.desktop/sun.print.RasterPrinterJob.printPage(RasterPrinterJob.java:2275) > at > java.desktop/sun.print.RasterPrinterJob.print(RasterPrinterJob.java:1654) -- This message was sent by Atlassian Jira (v8.20.1#820001) - To unsubscribe, e-mail: dev-unsubscr...@pdfbox.apache.org For additional commands, e-mail: dev-h...@pdfbox.apache.org
Re: 2.0.26 release
Cool, thanks for the feedback. I've set the ticket to resolved. Do we need to re-run the tests? BTW, what about PDFBOX-5394? Is there anything left to do? Do we have to wait for the feedback of the user? Andreas Am 13.04.22 um 08:29 schrieb Tilman Hausherr: Yeah, PDFBOX-5413 fixes that one as well. Tilman Am 12.04.2022 um 19:26 schrieb Tilman Hausherr: Only one left: 7LRS5U6CAFMN2P6JPTZVNBUW6XOFYH4M.pdf . There is some sort of problem with an incremental save, a part of the multi-content stream is missing / has a new object number. Lets wait whether it is related to PDFBOX-5413 . (The other one, HOAZTST4E26NPA7HL72WCIVMNRQ3E4M5.pdf is an improvement, I'll add it to my own tests) Tilman Am 12.04.2022 um 18:25 schrieb Tilman Hausherr: Only commoncrawl3/7L/7LRS5U6CAFMN2P6JPTZVNBUW6XOFYH4M commoncrawl3/HO/HOAZTST4E26NPA7HL72WCIVMNRQ3E4M5 have a different text extraction With the other two it's attachment file names or doc info. Tilman Am 12.04.2022 um 08:16 schrieb Tilman Hausherr: After having looked at the content differences and trying to rule out the /Names differences, there are 4 files with content in TOP_10_MORE_IN_A that feel suspicious and IMHO need investigation. commoncrawl3/7L/7LRS5U6CAFMN2P6JPTZVNBUW6XOFYH4M govdocs1/365/365260.pdf commoncrawl3/HO/HOAZTST4E26NPA7HL72WCIVMNRQ3E4M5 govdocs1/150/150282.pdf Tilman Am 12.04.2022 um 08:09 schrieb Andreas Lehmkuehler: Thanks Tim! Looks like there are 5 new exceptions left. I'm going to check the first two ones commoncrawl3/ZC/ZCY5MCL7KI6QXVMXUZ2AJKXICQIT4TL4 commoncrawl3/WY/WYPJNTD5KQNODSXWK4GABURXRTTD5P4H The others are thrown within Jempbox Andreas Am 11.04.22 um 12:40 schrieb Tim Allison: https://corpora.tika.apache.org/base/reports/tika-2.4-20220410.tgz Haven't had a chance to review. Hot off the vm. On Sun, Apr 10, 2022 at 9:58 AM Tim Allison wrote: Will try to kick off today…first thing Monday morning (EDT) at the latest. On Sun, Apr 10, 2022 at 9:05 AM Andreas Lehmkuehler wrote: Am 09.04.22 um 19:00 schrieb Tilman Hausherr: testFlattenPDFBOX2469Filled also fails in 2.0 (it is disabled by default). I've fixed all new tickets. PDFBOX-5413 fixes the issue with the disabled flatten test. @Tim Is there any chance to re-run the tests? Andreas testFlattenPDFBOX2469Filled(org.apache.pdfbox.pdmodel.interactive.form.PDAcroFormFlattenTest) Time elapsed: 1.083 s <<< ERROR! java.io.IOException: javax.crypto.BadPaddingException: Given final block not properly padded. Such issues can arise if a bad key is used during decryption. at org.apache.pdfbox.pdmodel.interactive.form.PDAcroFormFlattenTest.generateSamples(PDAcroFormFlattenTest.java:345) at org.apache.pdfbox.pdmodel.interactive.form.PDAcroFormFlattenTest.flattenAndCompare(PDAcroFormFlattenTest.java:309) at org.apache.pdfbox.pdmodel.interactive.form.PDAcroFormFlattenTest.testFlattenPDFBOX2469Filled(PDAcroFormFlattenTest.java:105) Caused by: javax.crypto.BadPaddingException: Given final block not properly padded. Such issues can arise if a bad key is used during decryption. at org.apache.pdfbox.pdmodel.interactive.form.PDAcroFormFlattenTest.generateSamples(PDAcroFormFlattenTest.java:345) at org.apache.pdfbox.pdmodel.interactive.form.PDAcroFormFlattenTest.flattenAndCompare(PDAcroFormFlattenTest.java:309) at org.apache.pdfbox.pdmodel.interactive.form.PDAcroFormFlattenTest.testFlattenPDFBOX2469Filled(PDAcroFormFlattenTest.java:105) I'm not creating an issue this time in case this is also related to another known problem. Tilman - To unsubscribe, e-mail: dev-unsubscr...@pdfbox.apache.org For additional commands, e-mail: dev-h...@pdfbox.apache.org - To unsubscribe, e-mail: dev-unsubscr...@pdfbox.apache.org For additional commands, e-mail: dev-h...@pdfbox.apache.org - To unsubscribe, e-mail: dev-unsubscr...@pdfbox.apache.org For additional commands, e-mail: dev-h...@pdfbox.apache.org - To unsubscribe, e-mail: dev-unsubscr...@pdfbox.apache.org For additional commands, e-mail: dev-h...@pdfbox.apache.org - To unsubscribe, e-mail: dev-unsubscr...@pdfbox.apache.org For additional commands, e-mail: dev-h...@pdfbox.apache.org - To unsubscribe, e-mail: dev-unsubscr...@pdfbox.apache.org For additional commands, e-mail: dev-h...@pdfbox.apache.org - To unsubscribe, e-mail: dev-unsubscr...@pdfbox.apache.org For additional commands, e-mail: dev-h...@pdfbox.apache.org
[jira] [Commented] (PDFBOX-5413) Field text missing
[ https://issues.apache.org/jira/browse/PDFBOX-5413?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17522073#comment-17522073 ] Andreas Lehmkühler commented on PDFBOX-5413: Looks like all regressions are solved, set to resolved > Field text missing > -- > > Key: PDFBOX-5413 > URL: https://issues.apache.org/jira/browse/PDFBOX-5413 > Project: PDFBox > Issue Type: Bug > Components: Parsing >Affects Versions: 2.0.26, 3.0.0 PDFBox >Reporter: Tilman Hausherr >Assignee: Andreas Lehmkühler >Priority: Major > Labels: regression > Fix For: 2.0.26, 3.0.0 PDFBox > > Attachments: CZIB6B5RY5HQDSEXXWSGUHSAP75CAI7Q.pdf, > WYPJNTD5KQNODSXWK4GABURXRTTD5P4H.pdf, ZCY5MCL7KI6QXVMXUZ2AJKXICQIT4TL4.pdf > > > The bottom field on page 2 ("AREA OF CONSIDERATION") is missing. > This worked in 2.0.25. This is a weird case: incrementally written object 11 > points to 102796. However there is a "5" just before the 11. -- This message was sent by Atlassian Jira (v8.20.1#820001) - To unsubscribe, e-mail: dev-unsubscr...@pdfbox.apache.org For additional commands, e-mail: dev-h...@pdfbox.apache.org
[jira] [Resolved] (PDFBOX-5413) Field text missing
[ https://issues.apache.org/jira/browse/PDFBOX-5413?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Andreas Lehmkühler resolved PDFBOX-5413. Resolution: Fixed > Field text missing > -- > > Key: PDFBOX-5413 > URL: https://issues.apache.org/jira/browse/PDFBOX-5413 > Project: PDFBox > Issue Type: Bug > Components: Parsing >Affects Versions: 2.0.26, 3.0.0 PDFBox >Reporter: Tilman Hausherr >Assignee: Andreas Lehmkühler >Priority: Major > Labels: regression > Fix For: 2.0.26, 3.0.0 PDFBox > > Attachments: CZIB6B5RY5HQDSEXXWSGUHSAP75CAI7Q.pdf, > WYPJNTD5KQNODSXWK4GABURXRTTD5P4H.pdf, ZCY5MCL7KI6QXVMXUZ2AJKXICQIT4TL4.pdf > > > The bottom field on page 2 ("AREA OF CONSIDERATION") is missing. > This worked in 2.0.25. This is a weird case: incrementally written object 11 > points to 102796. However there is a "5" just before the 11. -- This message was sent by Atlassian Jira (v8.20.1#820001) - To unsubscribe, e-mail: dev-unsubscr...@pdfbox.apache.org For additional commands, e-mail: dev-h...@pdfbox.apache.org