[jira] [Commented] (PDFBOX-5415) Infinite loop in ExtractText in 2.x branch on a specific pdf

2022-04-14 Thread Tim Allison (Jira)


[ 
https://issues.apache.org/jira/browse/PDFBOX-5415?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17522531#comment-17522531
 ] 

Tim Allison commented on PDFBOX-5415:
-

An answer on the Tika side. Yes, parsing is dangerous and you’ll need to 
isolate at the process level; thread level isolation is not enough. See what we 
offer in Tika for robustness: 
https://cwiki.apache.org/confluence/plugins/servlet/mobile?contentId=148647830#content/view/148647830

> Infinite loop in ExtractText in 2.x branch on a specific pdf
> 
>
> Key: PDFBOX-5415
> URL: https://issues.apache.org/jira/browse/PDFBOX-5415
> Project: PDFBox
>  Issue Type: Bug
>  Components: Parsing
>Affects Versions: 2.0.26
>Reporter: Tim Allison
>Priority: Major
> Attachments: PDFBOX-5415-TIKA-3718-p10.pdf
>
>
> [~DavidAvant] reported an infinite loop in Tika and provided an example file. 
>  I can reproduce this with the latest PDFBox app 2.0.26-SNAPSHOT's 
> ExtractText.
> File: https://issues.apache.org/jira/secure/attachment/13042292/map.pdf
> Adobe and a slightly out of date pdftotext also have problems with this file.



--
This message was sent by Atlassian Jira
(v8.20.1#820001)

-
To unsubscribe, e-mail: dev-unsubscr...@pdfbox.apache.org
For additional commands, e-mail: dev-h...@pdfbox.apache.org



[jira] [Commented] (PDFBOX-5415) Infinite loop in ExtractText in 2.x branch on a specific pdf

2022-04-14 Thread David Avant (Jira)


[ 
https://issues.apache.org/jira/browse/PDFBOX-5415?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17522515#comment-17522515
 ] 

David Avant commented on PDFBOX-5415:
-

To summarize the verdict from Michael Demey:   This PDF is wack.  :)
The looping logic is not infinite, but the exponential nature of the loop is so 
bad that our star may go supernova before Tika is finished parsing it.

As ridiculous as this PDF might be, I suspect we need to defend against it.   
Otherwise this becomes a potential Denial of Service attack.

Given Michael's description of the nature of the issue, does it seem plausible 
that this can be fixed within the parser itself?    Or do we need some external 
means of defense, like insulating the rest of the application by running Tika 
within its own thread?

> Infinite loop in ExtractText in 2.x branch on a specific pdf
> 
>
> Key: PDFBOX-5415
> URL: https://issues.apache.org/jira/browse/PDFBOX-5415
> Project: PDFBox
>  Issue Type: Bug
>  Components: Parsing
>Affects Versions: 2.0.26
>Reporter: Tim Allison
>Priority: Major
> Attachments: PDFBOX-5415-TIKA-3718-p10.pdf
>
>
> [~DavidAvant] reported an infinite loop in Tika and provided an example file. 
>  I can reproduce this with the latest PDFBox app 2.0.26-SNAPSHOT's 
> ExtractText.
> File: https://issues.apache.org/jira/secure/attachment/13042292/map.pdf
> Adobe and a slightly out of date pdftotext also have problems with this file.



--
This message was sent by Atlassian Jira
(v8.20.1#820001)

-
To unsubscribe, e-mail: dev-unsubscr...@pdfbox.apache.org
For additional commands, e-mail: dev-h...@pdfbox.apache.org



[jira] [Commented] (PDFBOX-5416) PDType1Font Can't read the embedded Type1 font ArialMT

2022-04-14 Thread Tilman Hausherr (Jira)


[ 
https://issues.apache.org/jira/browse/PDFBOX-5416?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17522369#comment-17522369
 ] 

Tilman Hausherr commented on PDFBOX-5416:
-

That's because we catch it. And then we choose a font that comes close to the 
broken one.

> PDType1Font  Can't read the embedded Type1 font ArialMT
> -
>
> Key: PDFBOX-5416
> URL: https://issues.apache.org/jira/browse/PDFBOX-5416
> Project: PDFBox
>  Issue Type: Bug
>  Components: FontBox
>Affects Versions: 2.0.25
> Environment: Windows 10, JDK15, Gradle 6.7, Spring Boot 2.3.4
>Reporter: Artur
>Priority: Blocker
>  Labels: pdtype1font, type1, type1font
> Attachments: label.pdf
>
>
> Hello,
> I have this error when i send a file to print:
> PDType1Font  Can't read the embedded Type1 font ArialMT
> java.io.EOFException: attempted to read past EOF
>     at org.apache.fontbox.pfb.PfbParser.parsePfb(PfbParser.java:163)
>     at org.apache.fontbox.pfb.PfbParser.(PfbParser.java:125)
>     at org.apache.fontbox.type1.Type1Font.createWithPFB(Type1Font.java:69)
>     at org.apache.pdfbox.pdmodel.font.PDType1Font.(PDType1Font.java:247)
>     at 
> org.apache.pdfbox.pdmodel.font.PDFontFactory.createFont(PDFontFactory.java:76)
>     at org.apache.pdfbox.pdmodel.PDResources.getFont(PDResources.java:146)
>     at 
> org.apache.pdfbox.contentstream.operator.text.SetFontAndSize.process(SetFontAndSize.java:66)
>     at 
> org.apache.pdfbox.contentstream.PDFStreamEngine.processOperator(PDFStreamEngine.java:939)
>     at 
> org.apache.pdfbox.contentstream.PDFStreamEngine.processStreamOperators(PDFStreamEngine.java:514)
>     at 
> org.apache.pdfbox.contentstream.PDFStreamEngine.processStream(PDFStreamEngine.java:492)
>     at 
> org.apache.pdfbox.contentstream.PDFStreamEngine.processPage(PDFStreamEngine.java:155)
>     at org.apache.pdfbox.rendering.PageDrawer.drawPage(PageDrawer.java:282)
>     at 
> org.apache.pdfbox.rendering.PDFRenderer.renderPageToGraphics(PDFRenderer.java:466)
>     at org.apache.pdfbox.printing.PDFPrintable.print(PDFPrintable.java:264)
>     at 
> java.desktop/sun.print.RasterPrinterJob.printPage(RasterPrinterJob.java:2275)
>     at 
> java.desktop/sun.print.RasterPrinterJob.print(RasterPrinterJob.java:1654)
>  
> Thread PDType1Font  Can't read the embedded Type1 font Arial-BoldMT
> java.io.EOFException: attempted to read past EOF
>     at org.apache.fontbox.pfb.PfbParser.parsePfb(PfbParser.java:163)
>     at org.apache.fontbox.pfb.PfbParser.(PfbParser.java:125)
>     at org.apache.fontbox.type1.Type1Font.createWithPFB(Type1Font.java:69)
>     at org.apache.pdfbox.pdmodel.font.PDType1Font.(PDType1Font.java:247)
>     at 
> org.apache.pdfbox.pdmodel.font.PDFontFactory.createFont(PDFontFactory.java:76)
>     at org.apache.pdfbox.pdmodel.PDResources.getFont(PDResources.java:146)
>     at 
> org.apache.pdfbox.contentstream.operator.text.SetFontAndSize.process(SetFontAndSize.java:66)
>     at 
> org.apache.pdfbox.contentstream.PDFStreamEngine.processOperator(PDFStreamEngine.java:939)
>     at 
> org.apache.pdfbox.contentstream.PDFStreamEngine.processStreamOperators(PDFStreamEngine.java:514)
>     at 
> org.apache.pdfbox.contentstream.PDFStreamEngine.processStream(PDFStreamEngine.java:492)
>     at 
> org.apache.pdfbox.contentstream.PDFStreamEngine.processPage(PDFStreamEngine.java:155)
>     at org.apache.pdfbox.rendering.PageDrawer.drawPage(PageDrawer.java:282)
>     at 
> org.apache.pdfbox.rendering.PDFRenderer.renderPageToGraphics(PDFRenderer.java:466)
>     at org.apache.pdfbox.printing.PDFPrintable.print(PDFPrintable.java:264)
>     at 
> java.desktop/sun.print.RasterPrinterJob.printPage(RasterPrinterJob.java:2275)
>     at 
> java.desktop/sun.print.RasterPrinterJob.print(RasterPrinterJob.java:1654)



--
This message was sent by Atlassian Jira
(v8.20.1#820001)

-
To unsubscribe, e-mail: dev-unsubscr...@pdfbox.apache.org
For additional commands, e-mail: dev-h...@pdfbox.apache.org



[jira] [Updated] (PDFBOX-5418) NPE during page render

2022-04-14 Thread Tilman Hausherr (Jira)


 [ 
https://issues.apache.org/jira/browse/PDFBOX-5418?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Tilman Hausherr updated PDFBOX-5418:

Summary: NPE during page render  (was: NPE dunring page render)

> NPE during page render
> --
>
> Key: PDFBOX-5418
> URL: https://issues.apache.org/jira/browse/PDFBOX-5418
> Project: PDFBox
>  Issue Type: Bug
>  Components: FontBox, Rendering
>Affects Versions: 2.0.25
> Environment: openjdk 11.0.14 2022-01-18 LTS
>Reporter: Maksym
>Assignee: Tilman Hausherr
>Priority: Major
> Fix For: 2.0.26, 3.0.0 PDFBox
>
> Attachments: font_issue.pdf
>
>
>  
> Rendering page causes NPE, opening in debugger gives this stacktrace:
> {code:java}
> java.lang.RuntimeException: java.util.concurrent.ExecutionException: 
> java.lang.NullPointerException
>     
> org.apache.pdfbox.debugger.pagepane.PagePane$RenderWorker.done(PagePane.java:481)
>     
> java.desktop/sun.swing.AccumulativeRunnable.run(AccumulativeRunnable.java:112)
>     java.base/java.security.AccessController.doPrivileged(Native Method)
>     
> java.base/java.security.ProtectionDomain$JavaSecurityAccessImpl.doIntersectionPrivilege(ProtectionDomain.java:85)
> Caused by: java.util.concurrent.ExecutionException: 
> java.lang.NullPointerException
>     
> org.apache.pdfbox.debugger.pagepane.PagePane$RenderWorker.done(PagePane.java:461)
>     
> java.desktop/sun.swing.AccumulativeRunnable.run(AccumulativeRunnable.java:112)
>     java.base/java.security.AccessController.doPrivileged(Native Method)
>     
> java.base/java.security.ProtectionDomain$JavaSecurityAccessImpl.doIntersectionPrivilege(ProtectionDomain.java:85)
> Caused by: java.lang.NullPointerException: null
>     
> org.apache.fontbox.ttf.GlyfCompositeDescript.getContourCount(GlyfCompositeDescript.java:239)
>     
> org.apache.fontbox.ttf.GlyfCompositeDescript.resolve(GlyfCompositeDescript.java:113)
>     org.apache.fontbox.ttf.GlyphTable.getGlyphData(GlyphTable.java:216)
>     org.apache.fontbox.ttf.GlyphTable.getGlyph(GlyphTable.java:191)
>     
> org.apache.pdfbox.pdmodel.font.PDCIDFontType2.getPath(PDCIDFontType2.java:418)
>     org.apache.pdfbox.pdmodel.font.PDType0Font.getPath(PDType0Font.java:620)
>     org.apache.pdfbox.rendering.TTFGlyph2D.getPathForGID(TTFGlyph2D.java:144)
>     
> org.apache.pdfbox.rendering.TTFGlyph2D.getPathForCharacterCode(TTFGlyph2D.java:93)
>     org.apache.pdfbox.rendering.PageDrawer.drawGlyph2D(PageDrawer.java:514)
>     org.apache.pdfbox.rendering.PageDrawer.showFontGlyph(PageDrawer.java:495)
>     
> org.apache.pdfbox.contentstream.PDFStreamEngine.showGlyph(PDFStreamEngine.java:793)
>     
> org.apache.pdfbox.contentstream.PDFStreamEngine.showGlyph(PDFStreamEngine.java:811)
>     
> org.apache.pdfbox.contentstream.PDFStreamEngine.showText(PDFStreamEngine.java:749)
>     
> org.apache.pdfbox.contentstream.PDFStreamEngine.showTextStrings(PDFStreamEngine.java:654)
>     
> org.apache.pdfbox.contentstream.operator.text.ShowTextAdjusted.process(ShowTextAdjusted.java:53)
>     
> org.apache.pdfbox.contentstream.PDFStreamEngine.processOperator(PDFStreamEngine.java:939)
>     
> org.apache.pdfbox.contentstream.PDFStreamEngine.processStreamOperators(PDFStreamEngine.java:514)
>     
> org.apache.pdfbox.contentstream.PDFStreamEngine.processStream(PDFStreamEngine.java:492)
>     
> org.apache.pdfbox.contentstream.PDFStreamEngine.showForm(PDFStreamEngine.java:186)
>     org.apache.pdfbox.rendering.PageDrawer.showForm(PageDrawer.java:1558)
>     
> org.apache.pdfbox.contentstream.operator.graphics.DrawObject.process(DrawObject.java:85)
>     
> org.apache.pdfbox.contentstream.PDFStreamEngine.processOperator(PDFStreamEngine.java:939)
>     
> org.apache.pdfbox.contentstream.PDFStreamEngine.processStreamOperators(PDFStreamEngine.java:514)
>     
> org.apache.pdfbox.contentstream.PDFStreamEngine.processStream(PDFStreamEngine.java:492)
>     
> org.apache.pdfbox.contentstream.PDFStreamEngine.showForm(PDFStreamEngine.java:186)
>     org.apache.pdfbox.rendering.PageDrawer.showForm(PageDrawer.java:1558)
>     
> org.apache.pdfbox.contentstream.operator.graphics.DrawObject.process(DrawObject.java:85)
>     
> org.apache.pdfbox.contentstream.PDFStreamEngine.processOperator(PDFStreamEngine.java:939)
>     
> org.apache.pdfbox.contentstream.PDFStreamEngine.processStreamOperators(PDFStreamEngine.java:514)
>     
> org.apache.pdfbox.contentstream.PDFStreamEngine.processStream(PDFStreamEngine.java:492)
>     
> org.apache.pdfbox.contentstream.PDFStreamEngine.showForm(PDFStreamEngine.java:186)
>     org.apache.pdfbox.rendering.PageDrawer.showForm(PageDrawer.java:1558)
>     
> org.apache.pdfbox.contentstream.operator.graphics.DrawObject.process(DrawObject.java:85)
>     
> 

[jira] [Updated] (PDFBOX-5418) NPE dunring page render

2022-04-14 Thread Tilman Hausherr (Jira)


 [ 
https://issues.apache.org/jira/browse/PDFBOX-5418?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Tilman Hausherr updated PDFBOX-5418:

Component/s: FontBox

> NPE dunring page render
> ---
>
> Key: PDFBOX-5418
> URL: https://issues.apache.org/jira/browse/PDFBOX-5418
> Project: PDFBox
>  Issue Type: Bug
>  Components: FontBox, Rendering
>Affects Versions: 2.0.25
> Environment: openjdk 11.0.14 2022-01-18 LTS
>Reporter: Maksym
>Assignee: Tilman Hausherr
>Priority: Major
> Fix For: 2.0.26, 3.0.0 PDFBox
>
> Attachments: font_issue.pdf
>
>
>  
> Rendering page causes NPE, opening in debugger gives this stacktrace:
> {code:java}
> java.lang.RuntimeException: java.util.concurrent.ExecutionException: 
> java.lang.NullPointerException
>     
> org.apache.pdfbox.debugger.pagepane.PagePane$RenderWorker.done(PagePane.java:481)
>     
> java.desktop/sun.swing.AccumulativeRunnable.run(AccumulativeRunnable.java:112)
>     java.base/java.security.AccessController.doPrivileged(Native Method)
>     
> java.base/java.security.ProtectionDomain$JavaSecurityAccessImpl.doIntersectionPrivilege(ProtectionDomain.java:85)
> Caused by: java.util.concurrent.ExecutionException: 
> java.lang.NullPointerException
>     
> org.apache.pdfbox.debugger.pagepane.PagePane$RenderWorker.done(PagePane.java:461)
>     
> java.desktop/sun.swing.AccumulativeRunnable.run(AccumulativeRunnable.java:112)
>     java.base/java.security.AccessController.doPrivileged(Native Method)
>     
> java.base/java.security.ProtectionDomain$JavaSecurityAccessImpl.doIntersectionPrivilege(ProtectionDomain.java:85)
> Caused by: java.lang.NullPointerException: null
>     
> org.apache.fontbox.ttf.GlyfCompositeDescript.getContourCount(GlyfCompositeDescript.java:239)
>     
> org.apache.fontbox.ttf.GlyfCompositeDescript.resolve(GlyfCompositeDescript.java:113)
>     org.apache.fontbox.ttf.GlyphTable.getGlyphData(GlyphTable.java:216)
>     org.apache.fontbox.ttf.GlyphTable.getGlyph(GlyphTable.java:191)
>     
> org.apache.pdfbox.pdmodel.font.PDCIDFontType2.getPath(PDCIDFontType2.java:418)
>     org.apache.pdfbox.pdmodel.font.PDType0Font.getPath(PDType0Font.java:620)
>     org.apache.pdfbox.rendering.TTFGlyph2D.getPathForGID(TTFGlyph2D.java:144)
>     
> org.apache.pdfbox.rendering.TTFGlyph2D.getPathForCharacterCode(TTFGlyph2D.java:93)
>     org.apache.pdfbox.rendering.PageDrawer.drawGlyph2D(PageDrawer.java:514)
>     org.apache.pdfbox.rendering.PageDrawer.showFontGlyph(PageDrawer.java:495)
>     
> org.apache.pdfbox.contentstream.PDFStreamEngine.showGlyph(PDFStreamEngine.java:793)
>     
> org.apache.pdfbox.contentstream.PDFStreamEngine.showGlyph(PDFStreamEngine.java:811)
>     
> org.apache.pdfbox.contentstream.PDFStreamEngine.showText(PDFStreamEngine.java:749)
>     
> org.apache.pdfbox.contentstream.PDFStreamEngine.showTextStrings(PDFStreamEngine.java:654)
>     
> org.apache.pdfbox.contentstream.operator.text.ShowTextAdjusted.process(ShowTextAdjusted.java:53)
>     
> org.apache.pdfbox.contentstream.PDFStreamEngine.processOperator(PDFStreamEngine.java:939)
>     
> org.apache.pdfbox.contentstream.PDFStreamEngine.processStreamOperators(PDFStreamEngine.java:514)
>     
> org.apache.pdfbox.contentstream.PDFStreamEngine.processStream(PDFStreamEngine.java:492)
>     
> org.apache.pdfbox.contentstream.PDFStreamEngine.showForm(PDFStreamEngine.java:186)
>     org.apache.pdfbox.rendering.PageDrawer.showForm(PageDrawer.java:1558)
>     
> org.apache.pdfbox.contentstream.operator.graphics.DrawObject.process(DrawObject.java:85)
>     
> org.apache.pdfbox.contentstream.PDFStreamEngine.processOperator(PDFStreamEngine.java:939)
>     
> org.apache.pdfbox.contentstream.PDFStreamEngine.processStreamOperators(PDFStreamEngine.java:514)
>     
> org.apache.pdfbox.contentstream.PDFStreamEngine.processStream(PDFStreamEngine.java:492)
>     
> org.apache.pdfbox.contentstream.PDFStreamEngine.showForm(PDFStreamEngine.java:186)
>     org.apache.pdfbox.rendering.PageDrawer.showForm(PageDrawer.java:1558)
>     
> org.apache.pdfbox.contentstream.operator.graphics.DrawObject.process(DrawObject.java:85)
>     
> org.apache.pdfbox.contentstream.PDFStreamEngine.processOperator(PDFStreamEngine.java:939)
>     
> org.apache.pdfbox.contentstream.PDFStreamEngine.processStreamOperators(PDFStreamEngine.java:514)
>     
> org.apache.pdfbox.contentstream.PDFStreamEngine.processStream(PDFStreamEngine.java:492)
>     
> org.apache.pdfbox.contentstream.PDFStreamEngine.showForm(PDFStreamEngine.java:186)
>     org.apache.pdfbox.rendering.PageDrawer.showForm(PageDrawer.java:1558)
>     
> org.apache.pdfbox.contentstream.operator.graphics.DrawObject.process(DrawObject.java:85)
>     
> org.apache.pdfbox.contentstream.PDFStreamEngine.processOperator(PDFStreamEngine.java:939)
>     
> 

[jira] [Resolved] (PDFBOX-5418) NPE dunring page render

2022-04-14 Thread Tilman Hausherr (Jira)


 [ 
https://issues.apache.org/jira/browse/PDFBOX-5418?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Tilman Hausherr resolved PDFBOX-5418.
-
Fix Version/s: 2.0.26
   3.0.0 PDFBox
 Assignee: Tilman Hausherr
   Resolution: Fixed

Thanks; a snapshot build will be available at
https://repository.apache.org/content/groups/snapshots/org/apache/pdfbox/pdfbox-app/2.0.26-SNAPSHOT/
within a few hours.

> NPE dunring page render
> ---
>
> Key: PDFBOX-5418
> URL: https://issues.apache.org/jira/browse/PDFBOX-5418
> Project: PDFBox
>  Issue Type: Bug
>  Components: Rendering
>Affects Versions: 2.0.25
> Environment: openjdk 11.0.14 2022-01-18 LTS
>Reporter: Maksym
>Assignee: Tilman Hausherr
>Priority: Major
> Fix For: 2.0.26, 3.0.0 PDFBox
>
> Attachments: font_issue.pdf
>
>
>  
> Rendering page causes NPE, opening in debugger gives this stacktrace:
> {code:java}
> java.lang.RuntimeException: java.util.concurrent.ExecutionException: 
> java.lang.NullPointerException
>     
> org.apache.pdfbox.debugger.pagepane.PagePane$RenderWorker.done(PagePane.java:481)
>     
> java.desktop/sun.swing.AccumulativeRunnable.run(AccumulativeRunnable.java:112)
>     java.base/java.security.AccessController.doPrivileged(Native Method)
>     
> java.base/java.security.ProtectionDomain$JavaSecurityAccessImpl.doIntersectionPrivilege(ProtectionDomain.java:85)
> Caused by: java.util.concurrent.ExecutionException: 
> java.lang.NullPointerException
>     
> org.apache.pdfbox.debugger.pagepane.PagePane$RenderWorker.done(PagePane.java:461)
>     
> java.desktop/sun.swing.AccumulativeRunnable.run(AccumulativeRunnable.java:112)
>     java.base/java.security.AccessController.doPrivileged(Native Method)
>     
> java.base/java.security.ProtectionDomain$JavaSecurityAccessImpl.doIntersectionPrivilege(ProtectionDomain.java:85)
> Caused by: java.lang.NullPointerException: null
>     
> org.apache.fontbox.ttf.GlyfCompositeDescript.getContourCount(GlyfCompositeDescript.java:239)
>     
> org.apache.fontbox.ttf.GlyfCompositeDescript.resolve(GlyfCompositeDescript.java:113)
>     org.apache.fontbox.ttf.GlyphTable.getGlyphData(GlyphTable.java:216)
>     org.apache.fontbox.ttf.GlyphTable.getGlyph(GlyphTable.java:191)
>     
> org.apache.pdfbox.pdmodel.font.PDCIDFontType2.getPath(PDCIDFontType2.java:418)
>     org.apache.pdfbox.pdmodel.font.PDType0Font.getPath(PDType0Font.java:620)
>     org.apache.pdfbox.rendering.TTFGlyph2D.getPathForGID(TTFGlyph2D.java:144)
>     
> org.apache.pdfbox.rendering.TTFGlyph2D.getPathForCharacterCode(TTFGlyph2D.java:93)
>     org.apache.pdfbox.rendering.PageDrawer.drawGlyph2D(PageDrawer.java:514)
>     org.apache.pdfbox.rendering.PageDrawer.showFontGlyph(PageDrawer.java:495)
>     
> org.apache.pdfbox.contentstream.PDFStreamEngine.showGlyph(PDFStreamEngine.java:793)
>     
> org.apache.pdfbox.contentstream.PDFStreamEngine.showGlyph(PDFStreamEngine.java:811)
>     
> org.apache.pdfbox.contentstream.PDFStreamEngine.showText(PDFStreamEngine.java:749)
>     
> org.apache.pdfbox.contentstream.PDFStreamEngine.showTextStrings(PDFStreamEngine.java:654)
>     
> org.apache.pdfbox.contentstream.operator.text.ShowTextAdjusted.process(ShowTextAdjusted.java:53)
>     
> org.apache.pdfbox.contentstream.PDFStreamEngine.processOperator(PDFStreamEngine.java:939)
>     
> org.apache.pdfbox.contentstream.PDFStreamEngine.processStreamOperators(PDFStreamEngine.java:514)
>     
> org.apache.pdfbox.contentstream.PDFStreamEngine.processStream(PDFStreamEngine.java:492)
>     
> org.apache.pdfbox.contentstream.PDFStreamEngine.showForm(PDFStreamEngine.java:186)
>     org.apache.pdfbox.rendering.PageDrawer.showForm(PageDrawer.java:1558)
>     
> org.apache.pdfbox.contentstream.operator.graphics.DrawObject.process(DrawObject.java:85)
>     
> org.apache.pdfbox.contentstream.PDFStreamEngine.processOperator(PDFStreamEngine.java:939)
>     
> org.apache.pdfbox.contentstream.PDFStreamEngine.processStreamOperators(PDFStreamEngine.java:514)
>     
> org.apache.pdfbox.contentstream.PDFStreamEngine.processStream(PDFStreamEngine.java:492)
>     
> org.apache.pdfbox.contentstream.PDFStreamEngine.showForm(PDFStreamEngine.java:186)
>     org.apache.pdfbox.rendering.PageDrawer.showForm(PageDrawer.java:1558)
>     
> org.apache.pdfbox.contentstream.operator.graphics.DrawObject.process(DrawObject.java:85)
>     
> org.apache.pdfbox.contentstream.PDFStreamEngine.processOperator(PDFStreamEngine.java:939)
>     
> org.apache.pdfbox.contentstream.PDFStreamEngine.processStreamOperators(PDFStreamEngine.java:514)
>     
> org.apache.pdfbox.contentstream.PDFStreamEngine.processStream(PDFStreamEngine.java:492)
>     
> org.apache.pdfbox.contentstream.PDFStreamEngine.showForm(PDFStreamEngine.java:186)
>     

[jira] [Commented] (PDFBOX-5418) NPE dunring page render

2022-04-14 Thread ASF subversion and git services (Jira)


[ 
https://issues.apache.org/jira/browse/PDFBOX-5418?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17522361#comment-17522361
 ] 

ASF subversion and git services commented on PDFBOX-5418:
-

Commit 1899860 from Tilman Hausherr in branch 'pdfbox/trunk'
[ https://svn.apache.org/r1899860 ]

PDFBOX-5418: avoid NPE

> NPE dunring page render
> ---
>
> Key: PDFBOX-5418
> URL: https://issues.apache.org/jira/browse/PDFBOX-5418
> Project: PDFBox
>  Issue Type: Bug
>  Components: Rendering
>Affects Versions: 2.0.25
> Environment: openjdk 11.0.14 2022-01-18 LTS
>Reporter: Maksym
>Priority: Major
> Attachments: font_issue.pdf
>
>
>  
> Rendering page causes NPE, opening in debugger gives this stacktrace:
> {code:java}
> java.lang.RuntimeException: java.util.concurrent.ExecutionException: 
> java.lang.NullPointerException
>     
> org.apache.pdfbox.debugger.pagepane.PagePane$RenderWorker.done(PagePane.java:481)
>     
> java.desktop/sun.swing.AccumulativeRunnable.run(AccumulativeRunnable.java:112)
>     java.base/java.security.AccessController.doPrivileged(Native Method)
>     
> java.base/java.security.ProtectionDomain$JavaSecurityAccessImpl.doIntersectionPrivilege(ProtectionDomain.java:85)
> Caused by: java.util.concurrent.ExecutionException: 
> java.lang.NullPointerException
>     
> org.apache.pdfbox.debugger.pagepane.PagePane$RenderWorker.done(PagePane.java:461)
>     
> java.desktop/sun.swing.AccumulativeRunnable.run(AccumulativeRunnable.java:112)
>     java.base/java.security.AccessController.doPrivileged(Native Method)
>     
> java.base/java.security.ProtectionDomain$JavaSecurityAccessImpl.doIntersectionPrivilege(ProtectionDomain.java:85)
> Caused by: java.lang.NullPointerException: null
>     
> org.apache.fontbox.ttf.GlyfCompositeDescript.getContourCount(GlyfCompositeDescript.java:239)
>     
> org.apache.fontbox.ttf.GlyfCompositeDescript.resolve(GlyfCompositeDescript.java:113)
>     org.apache.fontbox.ttf.GlyphTable.getGlyphData(GlyphTable.java:216)
>     org.apache.fontbox.ttf.GlyphTable.getGlyph(GlyphTable.java:191)
>     
> org.apache.pdfbox.pdmodel.font.PDCIDFontType2.getPath(PDCIDFontType2.java:418)
>     org.apache.pdfbox.pdmodel.font.PDType0Font.getPath(PDType0Font.java:620)
>     org.apache.pdfbox.rendering.TTFGlyph2D.getPathForGID(TTFGlyph2D.java:144)
>     
> org.apache.pdfbox.rendering.TTFGlyph2D.getPathForCharacterCode(TTFGlyph2D.java:93)
>     org.apache.pdfbox.rendering.PageDrawer.drawGlyph2D(PageDrawer.java:514)
>     org.apache.pdfbox.rendering.PageDrawer.showFontGlyph(PageDrawer.java:495)
>     
> org.apache.pdfbox.contentstream.PDFStreamEngine.showGlyph(PDFStreamEngine.java:793)
>     
> org.apache.pdfbox.contentstream.PDFStreamEngine.showGlyph(PDFStreamEngine.java:811)
>     
> org.apache.pdfbox.contentstream.PDFStreamEngine.showText(PDFStreamEngine.java:749)
>     
> org.apache.pdfbox.contentstream.PDFStreamEngine.showTextStrings(PDFStreamEngine.java:654)
>     
> org.apache.pdfbox.contentstream.operator.text.ShowTextAdjusted.process(ShowTextAdjusted.java:53)
>     
> org.apache.pdfbox.contentstream.PDFStreamEngine.processOperator(PDFStreamEngine.java:939)
>     
> org.apache.pdfbox.contentstream.PDFStreamEngine.processStreamOperators(PDFStreamEngine.java:514)
>     
> org.apache.pdfbox.contentstream.PDFStreamEngine.processStream(PDFStreamEngine.java:492)
>     
> org.apache.pdfbox.contentstream.PDFStreamEngine.showForm(PDFStreamEngine.java:186)
>     org.apache.pdfbox.rendering.PageDrawer.showForm(PageDrawer.java:1558)
>     
> org.apache.pdfbox.contentstream.operator.graphics.DrawObject.process(DrawObject.java:85)
>     
> org.apache.pdfbox.contentstream.PDFStreamEngine.processOperator(PDFStreamEngine.java:939)
>     
> org.apache.pdfbox.contentstream.PDFStreamEngine.processStreamOperators(PDFStreamEngine.java:514)
>     
> org.apache.pdfbox.contentstream.PDFStreamEngine.processStream(PDFStreamEngine.java:492)
>     
> org.apache.pdfbox.contentstream.PDFStreamEngine.showForm(PDFStreamEngine.java:186)
>     org.apache.pdfbox.rendering.PageDrawer.showForm(PageDrawer.java:1558)
>     
> org.apache.pdfbox.contentstream.operator.graphics.DrawObject.process(DrawObject.java:85)
>     
> org.apache.pdfbox.contentstream.PDFStreamEngine.processOperator(PDFStreamEngine.java:939)
>     
> org.apache.pdfbox.contentstream.PDFStreamEngine.processStreamOperators(PDFStreamEngine.java:514)
>     
> org.apache.pdfbox.contentstream.PDFStreamEngine.processStream(PDFStreamEngine.java:492)
>     
> org.apache.pdfbox.contentstream.PDFStreamEngine.showForm(PDFStreamEngine.java:186)
>     org.apache.pdfbox.rendering.PageDrawer.showForm(PageDrawer.java:1558)
>     
> org.apache.pdfbox.contentstream.operator.graphics.DrawObject.process(DrawObject.java:85)
>     
> 

[jira] [Commented] (PDFBOX-5418) NPE dunring page render

2022-04-14 Thread ASF subversion and git services (Jira)


[ 
https://issues.apache.org/jira/browse/PDFBOX-5418?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17522360#comment-17522360
 ] 

ASF subversion and git services commented on PDFBOX-5418:
-

Commit 1899859 from Tilman Hausherr in branch 'pdfbox/branches/2.0'
[ https://svn.apache.org/r1899859 ]

PDFBOX-5418: avoid NPE

> NPE dunring page render
> ---
>
> Key: PDFBOX-5418
> URL: https://issues.apache.org/jira/browse/PDFBOX-5418
> Project: PDFBox
>  Issue Type: Bug
>  Components: Rendering
>Affects Versions: 2.0.25
> Environment: openjdk 11.0.14 2022-01-18 LTS
>Reporter: Maksym
>Priority: Major
> Attachments: font_issue.pdf
>
>
>  
> Rendering page causes NPE, opening in debugger gives this stacktrace:
> {code:java}
> java.lang.RuntimeException: java.util.concurrent.ExecutionException: 
> java.lang.NullPointerException
>     
> org.apache.pdfbox.debugger.pagepane.PagePane$RenderWorker.done(PagePane.java:481)
>     
> java.desktop/sun.swing.AccumulativeRunnable.run(AccumulativeRunnable.java:112)
>     java.base/java.security.AccessController.doPrivileged(Native Method)
>     
> java.base/java.security.ProtectionDomain$JavaSecurityAccessImpl.doIntersectionPrivilege(ProtectionDomain.java:85)
> Caused by: java.util.concurrent.ExecutionException: 
> java.lang.NullPointerException
>     
> org.apache.pdfbox.debugger.pagepane.PagePane$RenderWorker.done(PagePane.java:461)
>     
> java.desktop/sun.swing.AccumulativeRunnable.run(AccumulativeRunnable.java:112)
>     java.base/java.security.AccessController.doPrivileged(Native Method)
>     
> java.base/java.security.ProtectionDomain$JavaSecurityAccessImpl.doIntersectionPrivilege(ProtectionDomain.java:85)
> Caused by: java.lang.NullPointerException: null
>     
> org.apache.fontbox.ttf.GlyfCompositeDescript.getContourCount(GlyfCompositeDescript.java:239)
>     
> org.apache.fontbox.ttf.GlyfCompositeDescript.resolve(GlyfCompositeDescript.java:113)
>     org.apache.fontbox.ttf.GlyphTable.getGlyphData(GlyphTable.java:216)
>     org.apache.fontbox.ttf.GlyphTable.getGlyph(GlyphTable.java:191)
>     
> org.apache.pdfbox.pdmodel.font.PDCIDFontType2.getPath(PDCIDFontType2.java:418)
>     org.apache.pdfbox.pdmodel.font.PDType0Font.getPath(PDType0Font.java:620)
>     org.apache.pdfbox.rendering.TTFGlyph2D.getPathForGID(TTFGlyph2D.java:144)
>     
> org.apache.pdfbox.rendering.TTFGlyph2D.getPathForCharacterCode(TTFGlyph2D.java:93)
>     org.apache.pdfbox.rendering.PageDrawer.drawGlyph2D(PageDrawer.java:514)
>     org.apache.pdfbox.rendering.PageDrawer.showFontGlyph(PageDrawer.java:495)
>     
> org.apache.pdfbox.contentstream.PDFStreamEngine.showGlyph(PDFStreamEngine.java:793)
>     
> org.apache.pdfbox.contentstream.PDFStreamEngine.showGlyph(PDFStreamEngine.java:811)
>     
> org.apache.pdfbox.contentstream.PDFStreamEngine.showText(PDFStreamEngine.java:749)
>     
> org.apache.pdfbox.contentstream.PDFStreamEngine.showTextStrings(PDFStreamEngine.java:654)
>     
> org.apache.pdfbox.contentstream.operator.text.ShowTextAdjusted.process(ShowTextAdjusted.java:53)
>     
> org.apache.pdfbox.contentstream.PDFStreamEngine.processOperator(PDFStreamEngine.java:939)
>     
> org.apache.pdfbox.contentstream.PDFStreamEngine.processStreamOperators(PDFStreamEngine.java:514)
>     
> org.apache.pdfbox.contentstream.PDFStreamEngine.processStream(PDFStreamEngine.java:492)
>     
> org.apache.pdfbox.contentstream.PDFStreamEngine.showForm(PDFStreamEngine.java:186)
>     org.apache.pdfbox.rendering.PageDrawer.showForm(PageDrawer.java:1558)
>     
> org.apache.pdfbox.contentstream.operator.graphics.DrawObject.process(DrawObject.java:85)
>     
> org.apache.pdfbox.contentstream.PDFStreamEngine.processOperator(PDFStreamEngine.java:939)
>     
> org.apache.pdfbox.contentstream.PDFStreamEngine.processStreamOperators(PDFStreamEngine.java:514)
>     
> org.apache.pdfbox.contentstream.PDFStreamEngine.processStream(PDFStreamEngine.java:492)
>     
> org.apache.pdfbox.contentstream.PDFStreamEngine.showForm(PDFStreamEngine.java:186)
>     org.apache.pdfbox.rendering.PageDrawer.showForm(PageDrawer.java:1558)
>     
> org.apache.pdfbox.contentstream.operator.graphics.DrawObject.process(DrawObject.java:85)
>     
> org.apache.pdfbox.contentstream.PDFStreamEngine.processOperator(PDFStreamEngine.java:939)
>     
> org.apache.pdfbox.contentstream.PDFStreamEngine.processStreamOperators(PDFStreamEngine.java:514)
>     
> org.apache.pdfbox.contentstream.PDFStreamEngine.processStream(PDFStreamEngine.java:492)
>     
> org.apache.pdfbox.contentstream.PDFStreamEngine.showForm(PDFStreamEngine.java:186)
>     org.apache.pdfbox.rendering.PageDrawer.showForm(PageDrawer.java:1558)
>     
> org.apache.pdfbox.contentstream.operator.graphics.DrawObject.process(DrawObject.java:85)
>     
> 

[jira] [Commented] (PDFBOX-5416) PDType1Font Can't read the embedded Type1 font ArialMT

2022-04-14 Thread Artur (Jira)


[ 
https://issues.apache.org/jira/browse/PDFBOX-5416?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17522278#comment-17522278
 ] 

Artur commented on PDFBOX-5416:
---

Thank You for your answer. I have one more question. Maybe I'm missing 
something or i'm not educated enough :) but why i can't catch EOFException in 
this code?

try {
            File file = new File("D:\\label.pdf");
            PDDocument pddDocument = PDDocument.load(file);
            for (int i = 0; i < pddDocument.getNumberOfPages(); i++) {
                PDPage page = pddDocument.getPage(i);
                PDResources res = page.getResources();
                for (COSName fontName : res.getFontNames()) {
                    PDFont font = res.getFont(fontName);
                }
            }
        } catch (EOFException e) {
            System.out.println("Error");
        } catch (IOException e) {
            System.out.println("Second error");
        } catch (Exception e) {
            System.out.println("Third error");
        }

> PDType1Font  Can't read the embedded Type1 font ArialMT
> -
>
> Key: PDFBOX-5416
> URL: https://issues.apache.org/jira/browse/PDFBOX-5416
> Project: PDFBox
>  Issue Type: Bug
>  Components: FontBox
>Affects Versions: 2.0.25
> Environment: Windows 10, JDK15, Gradle 6.7, Spring Boot 2.3.4
>Reporter: Artur
>Priority: Blocker
>  Labels: pdtype1font, type1, type1font
> Attachments: label.pdf
>
>
> Hello,
> I have this error when i send a file to print:
> PDType1Font  Can't read the embedded Type1 font ArialMT
> java.io.EOFException: attempted to read past EOF
>     at org.apache.fontbox.pfb.PfbParser.parsePfb(PfbParser.java:163)
>     at org.apache.fontbox.pfb.PfbParser.(PfbParser.java:125)
>     at org.apache.fontbox.type1.Type1Font.createWithPFB(Type1Font.java:69)
>     at org.apache.pdfbox.pdmodel.font.PDType1Font.(PDType1Font.java:247)
>     at 
> org.apache.pdfbox.pdmodel.font.PDFontFactory.createFont(PDFontFactory.java:76)
>     at org.apache.pdfbox.pdmodel.PDResources.getFont(PDResources.java:146)
>     at 
> org.apache.pdfbox.contentstream.operator.text.SetFontAndSize.process(SetFontAndSize.java:66)
>     at 
> org.apache.pdfbox.contentstream.PDFStreamEngine.processOperator(PDFStreamEngine.java:939)
>     at 
> org.apache.pdfbox.contentstream.PDFStreamEngine.processStreamOperators(PDFStreamEngine.java:514)
>     at 
> org.apache.pdfbox.contentstream.PDFStreamEngine.processStream(PDFStreamEngine.java:492)
>     at 
> org.apache.pdfbox.contentstream.PDFStreamEngine.processPage(PDFStreamEngine.java:155)
>     at org.apache.pdfbox.rendering.PageDrawer.drawPage(PageDrawer.java:282)
>     at 
> org.apache.pdfbox.rendering.PDFRenderer.renderPageToGraphics(PDFRenderer.java:466)
>     at org.apache.pdfbox.printing.PDFPrintable.print(PDFPrintable.java:264)
>     at 
> java.desktop/sun.print.RasterPrinterJob.printPage(RasterPrinterJob.java:2275)
>     at 
> java.desktop/sun.print.RasterPrinterJob.print(RasterPrinterJob.java:1654)
>  
> Thread PDType1Font  Can't read the embedded Type1 font Arial-BoldMT
> java.io.EOFException: attempted to read past EOF
>     at org.apache.fontbox.pfb.PfbParser.parsePfb(PfbParser.java:163)
>     at org.apache.fontbox.pfb.PfbParser.(PfbParser.java:125)
>     at org.apache.fontbox.type1.Type1Font.createWithPFB(Type1Font.java:69)
>     at org.apache.pdfbox.pdmodel.font.PDType1Font.(PDType1Font.java:247)
>     at 
> org.apache.pdfbox.pdmodel.font.PDFontFactory.createFont(PDFontFactory.java:76)
>     at org.apache.pdfbox.pdmodel.PDResources.getFont(PDResources.java:146)
>     at 
> org.apache.pdfbox.contentstream.operator.text.SetFontAndSize.process(SetFontAndSize.java:66)
>     at 
> org.apache.pdfbox.contentstream.PDFStreamEngine.processOperator(PDFStreamEngine.java:939)
>     at 
> org.apache.pdfbox.contentstream.PDFStreamEngine.processStreamOperators(PDFStreamEngine.java:514)
>     at 
> org.apache.pdfbox.contentstream.PDFStreamEngine.processStream(PDFStreamEngine.java:492)
>     at 
> org.apache.pdfbox.contentstream.PDFStreamEngine.processPage(PDFStreamEngine.java:155)
>     at org.apache.pdfbox.rendering.PageDrawer.drawPage(PageDrawer.java:282)
>     at 
> org.apache.pdfbox.rendering.PDFRenderer.renderPageToGraphics(PDFRenderer.java:466)
>     at org.apache.pdfbox.printing.PDFPrintable.print(PDFPrintable.java:264)
>     at 
> java.desktop/sun.print.RasterPrinterJob.printPage(RasterPrinterJob.java:2275)
>     at 
> java.desktop/sun.print.RasterPrinterJob.print(RasterPrinterJob.java:1654)



--
This message was sent by Atlassian Jira
(v8.20.1#820001)

-
To unsubscribe, e-mail: dev-unsubscr...@pdfbox.apache.org
For additional commands, e-mail: dev-h...@pdfbox.apache.org



Re: 2.0.26 release

2022-04-14 Thread Tilman Hausherr

Am 14.04.2022 um 08:13 schrieb Andreas Lehmkuehler:

Cool, thanks for the feedback. I've set the ticket to resolved.

Do we need to re-run the tests?

BTW, what about PDFBOX-5394? Is there anything left to do? Do we have 
to wait for the feedback of the user? 


I've set that one to resolved.

I will rerun the tests myself on the long weekend, if I have the time.

Tilman


-
To unsubscribe, e-mail: dev-unsubscr...@pdfbox.apache.org
For additional commands, e-mail: dev-h...@pdfbox.apache.org



[jira] [Resolved] (PDFBOX-5394) Render symbol for file attachment annotations

2022-04-14 Thread Tilman Hausherr (Jira)


 [ 
https://issues.apache.org/jira/browse/PDFBOX-5394?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Tilman Hausherr resolved PDFBOX-5394.
-
Resolution: Fixed

Set to resolved despite lack of user feedback. I'm confident that this works.

> Render symbol for file attachment annotations
> -
>
> Key: PDFBOX-5394
> URL: https://issues.apache.org/jira/browse/PDFBOX-5394
> Project: PDFBox
>  Issue Type: Improvement
>  Components: Rendering
>Affects Versions: 2.0.25
>Reporter: John Smith
>Assignee: Tilman Hausherr
>Priority: Minor
>  Labels: Annotations, annotation
> Fix For: 2.0.26, 3.0.0 PDFBox
>
> Attachments: paperclip.pdf
>
>
> Hi, I would need that PDFBox renders like this paper clip symbol for 
> attachments, which is shown when you open a PDF with an attachment in Adobe 
> PDF Reader but PDFBox does not render any icon for this.
> I suggest using e.g. this MIT licensed icon:
> [Paper Clip 
> Icon|https://raw.githubusercontent.com/Iconscout/unicons/master/svg/line/paperclip.svg]



--
This message was sent by Atlassian Jira
(v8.20.1#820001)

-
To unsubscribe, e-mail: dev-unsubscr...@pdfbox.apache.org
For additional commands, e-mail: dev-h...@pdfbox.apache.org



[jira] [Commented] (PDFBOX-5416) PDType1Font Can't read the embedded Type1 font ArialMT

2022-04-14 Thread Tilman Hausherr (Jira)


[ 
https://issues.apache.org/jira/browse/PDFBOX-5416?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17522098#comment-17522098
 ] 

Tilman Hausherr commented on PDFBOX-5416:
-

Then you need to reproduce this freeze effect. The problem you mention is in 
your file and the only effect it has is that your logfiles get cluttered.

I don't understand why they used embedded fonts at all, this is just a label, 
not a long term PDF. They could use the standard 14 fonts.

> PDType1Font  Can't read the embedded Type1 font ArialMT
> -
>
> Key: PDFBOX-5416
> URL: https://issues.apache.org/jira/browse/PDFBOX-5416
> Project: PDFBox
>  Issue Type: Bug
>  Components: FontBox
>Affects Versions: 2.0.25
> Environment: Windows 10, JDK15, Gradle 6.7, Spring Boot 2.3.4
>Reporter: Artur
>Priority: Blocker
>  Labels: pdtype1font, type1, type1font
> Attachments: label.pdf
>
>
> Hello,
> I have this error when i send a file to print:
> PDType1Font  Can't read the embedded Type1 font ArialMT
> java.io.EOFException: attempted to read past EOF
>     at org.apache.fontbox.pfb.PfbParser.parsePfb(PfbParser.java:163)
>     at org.apache.fontbox.pfb.PfbParser.(PfbParser.java:125)
>     at org.apache.fontbox.type1.Type1Font.createWithPFB(Type1Font.java:69)
>     at org.apache.pdfbox.pdmodel.font.PDType1Font.(PDType1Font.java:247)
>     at 
> org.apache.pdfbox.pdmodel.font.PDFontFactory.createFont(PDFontFactory.java:76)
>     at org.apache.pdfbox.pdmodel.PDResources.getFont(PDResources.java:146)
>     at 
> org.apache.pdfbox.contentstream.operator.text.SetFontAndSize.process(SetFontAndSize.java:66)
>     at 
> org.apache.pdfbox.contentstream.PDFStreamEngine.processOperator(PDFStreamEngine.java:939)
>     at 
> org.apache.pdfbox.contentstream.PDFStreamEngine.processStreamOperators(PDFStreamEngine.java:514)
>     at 
> org.apache.pdfbox.contentstream.PDFStreamEngine.processStream(PDFStreamEngine.java:492)
>     at 
> org.apache.pdfbox.contentstream.PDFStreamEngine.processPage(PDFStreamEngine.java:155)
>     at org.apache.pdfbox.rendering.PageDrawer.drawPage(PageDrawer.java:282)
>     at 
> org.apache.pdfbox.rendering.PDFRenderer.renderPageToGraphics(PDFRenderer.java:466)
>     at org.apache.pdfbox.printing.PDFPrintable.print(PDFPrintable.java:264)
>     at 
> java.desktop/sun.print.RasterPrinterJob.printPage(RasterPrinterJob.java:2275)
>     at 
> java.desktop/sun.print.RasterPrinterJob.print(RasterPrinterJob.java:1654)
>  
> Thread PDType1Font  Can't read the embedded Type1 font Arial-BoldMT
> java.io.EOFException: attempted to read past EOF
>     at org.apache.fontbox.pfb.PfbParser.parsePfb(PfbParser.java:163)
>     at org.apache.fontbox.pfb.PfbParser.(PfbParser.java:125)
>     at org.apache.fontbox.type1.Type1Font.createWithPFB(Type1Font.java:69)
>     at org.apache.pdfbox.pdmodel.font.PDType1Font.(PDType1Font.java:247)
>     at 
> org.apache.pdfbox.pdmodel.font.PDFontFactory.createFont(PDFontFactory.java:76)
>     at org.apache.pdfbox.pdmodel.PDResources.getFont(PDResources.java:146)
>     at 
> org.apache.pdfbox.contentstream.operator.text.SetFontAndSize.process(SetFontAndSize.java:66)
>     at 
> org.apache.pdfbox.contentstream.PDFStreamEngine.processOperator(PDFStreamEngine.java:939)
>     at 
> org.apache.pdfbox.contentstream.PDFStreamEngine.processStreamOperators(PDFStreamEngine.java:514)
>     at 
> org.apache.pdfbox.contentstream.PDFStreamEngine.processStream(PDFStreamEngine.java:492)
>     at 
> org.apache.pdfbox.contentstream.PDFStreamEngine.processPage(PDFStreamEngine.java:155)
>     at org.apache.pdfbox.rendering.PageDrawer.drawPage(PageDrawer.java:282)
>     at 
> org.apache.pdfbox.rendering.PDFRenderer.renderPageToGraphics(PDFRenderer.java:466)
>     at org.apache.pdfbox.printing.PDFPrintable.print(PDFPrintable.java:264)
>     at 
> java.desktop/sun.print.RasterPrinterJob.printPage(RasterPrinterJob.java:2275)
>     at 
> java.desktop/sun.print.RasterPrinterJob.print(RasterPrinterJob.java:1654)



--
This message was sent by Atlassian Jira
(v8.20.1#820001)

-
To unsubscribe, e-mail: dev-unsubscr...@pdfbox.apache.org
For additional commands, e-mail: dev-h...@pdfbox.apache.org



[jira] [Commented] (PDFBOX-5416) PDType1Font Can't read the embedded Type1 font ArialMT

2022-04-14 Thread Artur (Jira)


[ 
https://issues.apache.org/jira/browse/PDFBOX-5416?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17522093#comment-17522093
 ] 

Artur commented on PDFBOX-5416:
---

Yes, Most of the time I get decent and correct print, but once on a cuple times 
program freezes.

> PDType1Font  Can't read the embedded Type1 font ArialMT
> -
>
> Key: PDFBOX-5416
> URL: https://issues.apache.org/jira/browse/PDFBOX-5416
> Project: PDFBox
>  Issue Type: Bug
>  Components: FontBox
>Affects Versions: 2.0.25
> Environment: Windows 10, JDK15, Gradle 6.7, Spring Boot 2.3.4
>Reporter: Artur
>Priority: Blocker
>  Labels: pdtype1font, type1, type1font
> Attachments: label.pdf
>
>
> Hello,
> I have this error when i send a file to print:
> PDType1Font  Can't read the embedded Type1 font ArialMT
> java.io.EOFException: attempted to read past EOF
>     at org.apache.fontbox.pfb.PfbParser.parsePfb(PfbParser.java:163)
>     at org.apache.fontbox.pfb.PfbParser.(PfbParser.java:125)
>     at org.apache.fontbox.type1.Type1Font.createWithPFB(Type1Font.java:69)
>     at org.apache.pdfbox.pdmodel.font.PDType1Font.(PDType1Font.java:247)
>     at 
> org.apache.pdfbox.pdmodel.font.PDFontFactory.createFont(PDFontFactory.java:76)
>     at org.apache.pdfbox.pdmodel.PDResources.getFont(PDResources.java:146)
>     at 
> org.apache.pdfbox.contentstream.operator.text.SetFontAndSize.process(SetFontAndSize.java:66)
>     at 
> org.apache.pdfbox.contentstream.PDFStreamEngine.processOperator(PDFStreamEngine.java:939)
>     at 
> org.apache.pdfbox.contentstream.PDFStreamEngine.processStreamOperators(PDFStreamEngine.java:514)
>     at 
> org.apache.pdfbox.contentstream.PDFStreamEngine.processStream(PDFStreamEngine.java:492)
>     at 
> org.apache.pdfbox.contentstream.PDFStreamEngine.processPage(PDFStreamEngine.java:155)
>     at org.apache.pdfbox.rendering.PageDrawer.drawPage(PageDrawer.java:282)
>     at 
> org.apache.pdfbox.rendering.PDFRenderer.renderPageToGraphics(PDFRenderer.java:466)
>     at org.apache.pdfbox.printing.PDFPrintable.print(PDFPrintable.java:264)
>     at 
> java.desktop/sun.print.RasterPrinterJob.printPage(RasterPrinterJob.java:2275)
>     at 
> java.desktop/sun.print.RasterPrinterJob.print(RasterPrinterJob.java:1654)
>  
> Thread PDType1Font  Can't read the embedded Type1 font Arial-BoldMT
> java.io.EOFException: attempted to read past EOF
>     at org.apache.fontbox.pfb.PfbParser.parsePfb(PfbParser.java:163)
>     at org.apache.fontbox.pfb.PfbParser.(PfbParser.java:125)
>     at org.apache.fontbox.type1.Type1Font.createWithPFB(Type1Font.java:69)
>     at org.apache.pdfbox.pdmodel.font.PDType1Font.(PDType1Font.java:247)
>     at 
> org.apache.pdfbox.pdmodel.font.PDFontFactory.createFont(PDFontFactory.java:76)
>     at org.apache.pdfbox.pdmodel.PDResources.getFont(PDResources.java:146)
>     at 
> org.apache.pdfbox.contentstream.operator.text.SetFontAndSize.process(SetFontAndSize.java:66)
>     at 
> org.apache.pdfbox.contentstream.PDFStreamEngine.processOperator(PDFStreamEngine.java:939)
>     at 
> org.apache.pdfbox.contentstream.PDFStreamEngine.processStreamOperators(PDFStreamEngine.java:514)
>     at 
> org.apache.pdfbox.contentstream.PDFStreamEngine.processStream(PDFStreamEngine.java:492)
>     at 
> org.apache.pdfbox.contentstream.PDFStreamEngine.processPage(PDFStreamEngine.java:155)
>     at org.apache.pdfbox.rendering.PageDrawer.drawPage(PageDrawer.java:282)
>     at 
> org.apache.pdfbox.rendering.PDFRenderer.renderPageToGraphics(PDFRenderer.java:466)
>     at org.apache.pdfbox.printing.PDFPrintable.print(PDFPrintable.java:264)
>     at 
> java.desktop/sun.print.RasterPrinterJob.printPage(RasterPrinterJob.java:2275)
>     at 
> java.desktop/sun.print.RasterPrinterJob.print(RasterPrinterJob.java:1654)



--
This message was sent by Atlassian Jira
(v8.20.1#820001)

-
To unsubscribe, e-mail: dev-unsubscr...@pdfbox.apache.org
For additional commands, e-mail: dev-h...@pdfbox.apache.org



Re: 2.0.26 release

2022-04-14 Thread Andreas Lehmkuehler

Cool, thanks for the feedback. I've set the ticket to resolved.

Do we need to re-run the tests?

BTW, what about PDFBOX-5394? Is there anything left to do? Do we have to wait 
for the feedback of the user?


Andreas

Am 13.04.22 um 08:29 schrieb Tilman Hausherr:

Yeah, PDFBOX-5413 fixes that one as well. 

Tilman

Am 12.04.2022 um 19:26 schrieb Tilman Hausherr:

Only one left: 7LRS5U6CAFMN2P6JPTZVNBUW6XOFYH4M.pdf .

There is some sort of problem with an incremental save, a part of the 
multi-content stream is missing / has a new object number. Lets wait whether 
it is related to PDFBOX-5413 .


(The other one, HOAZTST4E26NPA7HL72WCIVMNRQ3E4M5.pdf is an improvement, I'll 
add it to my own tests)


Tilman

Am 12.04.2022 um 18:25 schrieb Tilman Hausherr:

Only
commoncrawl3/7L/7LRS5U6CAFMN2P6JPTZVNBUW6XOFYH4M
commoncrawl3/HO/HOAZTST4E26NPA7HL72WCIVMNRQ3E4M5
have a different text extraction

With the other two it's attachment file names or doc info.

Tilman

Am 12.04.2022 um 08:16 schrieb Tilman Hausherr:
After having looked at the content differences and trying to rule out the 
/Names differences, there are 4 files with content in TOP_10_MORE_IN_A that 
feel suspicious and IMHO need investigation.


commoncrawl3/7L/7LRS5U6CAFMN2P6JPTZVNBUW6XOFYH4M
govdocs1/365/365260.pdf
commoncrawl3/HO/HOAZTST4E26NPA7HL72WCIVMNRQ3E4M5
govdocs1/150/150282.pdf

Tilman



Am 12.04.2022 um 08:09 schrieb Andreas Lehmkuehler:

Thanks Tim!

Looks like there are 5 new exceptions left.

I'm going to check the first two ones

commoncrawl3/ZC/ZCY5MCL7KI6QXVMXUZ2AJKXICQIT4TL4
commoncrawl3/WY/WYPJNTD5KQNODSXWK4GABURXRTTD5P4H

The others are thrown within Jempbox 


Andreas

Am 11.04.22 um 12:40 schrieb Tim Allison:

https://corpora.tika.apache.org/base/reports/tika-2.4-20220410.tgz

Haven't had a chance to review.  Hot off the vm.

On Sun, Apr 10, 2022 at 9:58 AM Tim Allison  wrote:


Will try to kick off today…first thing Monday morning (EDT) at the latest.

On Sun, Apr 10, 2022 at 9:05 AM Andreas Lehmkuehler  
wrote:


Am 09.04.22 um 19:00 schrieb Tilman Hausherr:

testFlattenPDFBOX2469Filled also fails in 2.0 (it is disabled by default).

I've fixed all new tickets. PDFBOX-5413 fixes the issue with the disabled
flatten test.

@Tim Is there any chance to re-run the tests?

Andreas



testFlattenPDFBOX2469Filled(org.apache.pdfbox.pdmodel.interactive.form.PDAcroFormFlattenTest) 


Time elapsed: 1.083 s  <<< ERROR!
java.io.IOException: javax.crypto.BadPaddingException: Given final 
block not
properly padded. Such issues can arise if a bad key is used during 
decryption.

  at
org.apache.pdfbox.pdmodel.interactive.form.PDAcroFormFlattenTest.generateSamples(PDAcroFormFlattenTest.java:345) 



  at
org.apache.pdfbox.pdmodel.interactive.form.PDAcroFormFlattenTest.flattenAndCompare(PDAcroFormFlattenTest.java:309) 



  at
org.apache.pdfbox.pdmodel.interactive.form.PDAcroFormFlattenTest.testFlattenPDFBOX2469Filled(PDAcroFormFlattenTest.java:105) 



Caused by: javax.crypto.BadPaddingException: Given final block not 
properly

padded. Such issues can arise if a bad key is used during decryption.
  at
org.apache.pdfbox.pdmodel.interactive.form.PDAcroFormFlattenTest.generateSamples(PDAcroFormFlattenTest.java:345) 



  at
org.apache.pdfbox.pdmodel.interactive.form.PDAcroFormFlattenTest.flattenAndCompare(PDAcroFormFlattenTest.java:309) 



  at
org.apache.pdfbox.pdmodel.interactive.form.PDAcroFormFlattenTest.testFlattenPDFBOX2469Filled(PDAcroFormFlattenTest.java:105) 




I'm not creating an issue this time in case this is also related to 
another

known problem.

Tilman



-
To unsubscribe, e-mail: dev-unsubscr...@pdfbox.apache.org
For additional commands, e-mail: dev-h...@pdfbox.apache.org




-
To unsubscribe, e-mail: dev-unsubscr...@pdfbox.apache.org
For additional commands, e-mail: dev-h...@pdfbox.apache.org



-
To unsubscribe, e-mail: dev-unsubscr...@pdfbox.apache.org
For additional commands, e-mail: dev-h...@pdfbox.apache.org




-
To unsubscribe, e-mail: dev-unsubscr...@pdfbox.apache.org
For additional commands, e-mail: dev-h...@pdfbox.apache.org




-
To unsubscribe, e-mail: dev-unsubscr...@pdfbox.apache.org
For additional commands, e-mail: dev-h...@pdfbox.apache.org




-
To unsubscribe, e-mail: dev-unsubscr...@pdfbox.apache.org
For additional commands, e-mail: dev-h...@pdfbox.apache.org




-
To unsubscribe, e-mail: dev-unsubscr...@pdfbox.apache.org
For additional commands, e-mail: dev-h...@pdfbox.apache.org





[jira] [Commented] (PDFBOX-5413) Field text missing

2022-04-14 Thread Jira


[ 
https://issues.apache.org/jira/browse/PDFBOX-5413?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17522073#comment-17522073
 ] 

Andreas Lehmkühler commented on PDFBOX-5413:


Looks like all regressions are solved, set to resolved


> Field text missing
> --
>
> Key: PDFBOX-5413
> URL: https://issues.apache.org/jira/browse/PDFBOX-5413
> Project: PDFBox
>  Issue Type: Bug
>  Components: Parsing
>Affects Versions: 2.0.26, 3.0.0 PDFBox
>Reporter: Tilman Hausherr
>Assignee: Andreas Lehmkühler
>Priority: Major
>  Labels: regression
> Fix For: 2.0.26, 3.0.0 PDFBox
>
> Attachments: CZIB6B5RY5HQDSEXXWSGUHSAP75CAI7Q.pdf, 
> WYPJNTD5KQNODSXWK4GABURXRTTD5P4H.pdf, ZCY5MCL7KI6QXVMXUZ2AJKXICQIT4TL4.pdf
>
>
> The bottom field on page 2 ("AREA OF CONSIDERATION") is missing.
> This worked in 2.0.25. This is a weird case: incrementally written object 11 
> points to 102796. However there is a "5" just before the 11.



--
This message was sent by Atlassian Jira
(v8.20.1#820001)

-
To unsubscribe, e-mail: dev-unsubscr...@pdfbox.apache.org
For additional commands, e-mail: dev-h...@pdfbox.apache.org



[jira] [Resolved] (PDFBOX-5413) Field text missing

2022-04-14 Thread Jira


 [ 
https://issues.apache.org/jira/browse/PDFBOX-5413?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Andreas Lehmkühler resolved PDFBOX-5413.

Resolution: Fixed

> Field text missing
> --
>
> Key: PDFBOX-5413
> URL: https://issues.apache.org/jira/browse/PDFBOX-5413
> Project: PDFBox
>  Issue Type: Bug
>  Components: Parsing
>Affects Versions: 2.0.26, 3.0.0 PDFBox
>Reporter: Tilman Hausherr
>Assignee: Andreas Lehmkühler
>Priority: Major
>  Labels: regression
> Fix For: 2.0.26, 3.0.0 PDFBox
>
> Attachments: CZIB6B5RY5HQDSEXXWSGUHSAP75CAI7Q.pdf, 
> WYPJNTD5KQNODSXWK4GABURXRTTD5P4H.pdf, ZCY5MCL7KI6QXVMXUZ2AJKXICQIT4TL4.pdf
>
>
> The bottom field on page 2 ("AREA OF CONSIDERATION") is missing.
> This worked in 2.0.25. This is a weird case: incrementally written object 11 
> points to 102796. However there is a "5" just before the 11.



--
This message was sent by Atlassian Jira
(v8.20.1#820001)

-
To unsubscribe, e-mail: dev-unsubscr...@pdfbox.apache.org
For additional commands, e-mail: dev-h...@pdfbox.apache.org