Re: 2.0.26 release

2022-04-07 Thread Andreas Lehmkuehler

Thanks Tim!

I've checked the first files of the new exceptions and there seems to be at 
least one new regression


commoncrawl3/ZC/ZCY5MCL7KI6QXVMXUZ2AJKXICQIT4TL4
commoncrawl3/WY/WYPJNTD5KQNODSXWK4GABURXRTTD5P4H
commoncrawl3/YI/YIEMGIQYGXCQ5AZOE35ESXYCZHWR3V57
commoncrawl3_refetched/5C/5CWAUHFCZMK42IHSMSKNIR3MHXHR4IRN

All render fine using 2.0.25 but throw an exception using 2.0.26

I'm going to have a deeper look later

Am 07.04.22 um 20:27 schrieb Tim Allison:

https://corpora.tika.apache.org/base/reports/pdfbox-2.0.26-snapshot-reports.tgz

I haven't had a chance to look at them yet.

On Thu, Apr 7, 2022 at 9:07 AM Andreas Lehmkühler  wrote:


Yes, please

Thanks in advance
Andreas

07.04.2022 11:44:38 Tim Allison :


Sounds great! Should I rerun the regression tests today?

On Thu, Apr 7, 2022 at 1:41 AM Andreas Lehmkuehler  wrote:


Hi,

sorry for the delay.  I'm planning to cut the 2.0.26 release next
Saturday, the
day after tomorrow, if nobody objects.

Andreas

P.S.: I'm targeting a new 3.0.0 alpha release once the 2.0.26 release is
out

-
To unsubscribe, e-mail: dev-unsubscr...@pdfbox.apache.org
For additional commands, e-mail: dev-h...@pdfbox.apache.org




-
To unsubscribe, e-mail: dev-unsubscr...@pdfbox.apache.org
For additional commands, e-mail: dev-h...@pdfbox.apache.org



-
To unsubscribe, e-mail: dev-unsubscr...@pdfbox.apache.org
For additional commands, e-mail: dev-h...@pdfbox.apache.org




-
To unsubscribe, e-mail: dev-unsubscr...@pdfbox.apache.org
For additional commands, e-mail: dev-h...@pdfbox.apache.org



[jira] [Commented] (PDFBOX-5403) Blurry / distorted rendering

2022-04-07 Thread Tilman Hausherr (Jira)


[ 
https://issues.apache.org/jira/browse/PDFBOX-5403?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17519114#comment-17519114
 ] 

Tilman Hausherr commented on PDFBOX-5403:
-

Ah, you improved the code I proposed. (Btw there's a X-Y-mismatch bug in the 
"imageTransform.scale(" line)
This works really nicely with your file. Yeah, the "Schleuse" file is flawed. 
And PDFBOX-2171-002-002710-p14.pdf ("New Harmony Consolidated" and "Sailor 
Springs" patterns).

> Blurry / distorted rendering
> 
>
> Key: PDFBOX-5403
> URL: https://issues.apache.org/jira/browse/PDFBOX-5403
> Project: PDFBox
>  Issue Type: Bug
>  Components: Rendering
>Affects Versions: 2.0.25
>Reporter: Oliver Schmidtmer
>Priority: Major
> Attachments: alpha_interpolation.patch, bad rendering.pdf, 
> image-2022-03-29-18-29-03-860.png, image-2022-03-30-14-39-12-855.png, 
> image-2022-03-31-11-55-36-061.png, inver-smooth-scale.patch
>
>
> The attached PDF uses many stripes with an image and a mask.
> There seems to be an issue when mask and image are combined.
> !image-2022-03-29-18-29-03-860.png|width=518,height=91!



--
This message was sent by Atlassian Jira
(v8.20.1#820001)

-
To unsubscribe, e-mail: dev-unsubscr...@pdfbox.apache.org
For additional commands, e-mail: dev-h...@pdfbox.apache.org



Re: 2.0.26 release

2022-04-07 Thread Tim Allison
https://corpora.tika.apache.org/base/reports/pdfbox-2.0.26-snapshot-reports.tgz

I haven't had a chance to look at them yet.

On Thu, Apr 7, 2022 at 9:07 AM Andreas Lehmkühler  wrote:
>
> Yes, please
>
> Thanks in advance
> Andreas
>
> 07.04.2022 11:44:38 Tim Allison :
>
> > Sounds great! Should I rerun the regression tests today?
> >
> > On Thu, Apr 7, 2022 at 1:41 AM Andreas Lehmkuehler  wrote:
> >
> >> Hi,
> >>
> >> sorry for the delay.  I'm planning to cut the 2.0.26 release next
> >> Saturday, the
> >> day after tomorrow, if nobody objects.
> >>
> >> Andreas
> >>
> >> P.S.: I'm targeting a new 3.0.0 alpha release once the 2.0.26 release is
> >> out
> >>
> >> -
> >> To unsubscribe, e-mail: dev-unsubscr...@pdfbox.apache.org
> >> For additional commands, e-mail: dev-h...@pdfbox.apache.org
> >>
> >>
>
> -
> To unsubscribe, e-mail: dev-unsubscr...@pdfbox.apache.org
> For additional commands, e-mail: dev-h...@pdfbox.apache.org
>

-
To unsubscribe, e-mail: dev-unsubscr...@pdfbox.apache.org
For additional commands, e-mail: dev-h...@pdfbox.apache.org



[jira] [Updated] (PDFBOX-5409) Image/figure Bounding Box

2022-04-07 Thread Jake Marasovic (Jira)


 [ 
https://issues.apache.org/jira/browse/PDFBOX-5409?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Jake Marasovic updated PDFBOX-5409:
---
Labels: beginner  (was: )

> Image/figure Bounding Box
> -
>
> Key: PDFBOX-5409
> URL: https://issues.apache.org/jira/browse/PDFBOX-5409
> Project: PDFBox
>  Issue Type: Wish
>Reporter: Jake Marasovic
>Priority: Major
>  Labels: beginner
>
> Hi, I'm trying to make a PDF that passes PAC 3 (PDF accessibility checker) 
> tests and their report is saying I need to set a bounding box for images. Is 
> this already a feature? I've looked through all the methods related to images 
> and haven't been able to find one. 



--
This message was sent by Atlassian Jira
(v8.20.1#820001)

-
To unsubscribe, e-mail: dev-unsubscr...@pdfbox.apache.org
For additional commands, e-mail: dev-h...@pdfbox.apache.org



[jira] [Created] (PDFBOX-5409) Image/figure Bounding Box

2022-04-07 Thread Jake Marasovic (Jira)
Jake Marasovic created PDFBOX-5409:
--

 Summary: Image/figure Bounding Box
 Key: PDFBOX-5409
 URL: https://issues.apache.org/jira/browse/PDFBOX-5409
 Project: PDFBox
  Issue Type: Wish
Reporter: Jake Marasovic


Hi, I'm trying to make a PDF that passes PAC 3 (PDF accessibility checker) 
tests and their report is saying I need to set a bounding box for images. Is 
this already a feature? I've looked through all the methods related to images 
and haven't been able to find one. 



--
This message was sent by Atlassian Jira
(v8.20.1#820001)

-
To unsubscribe, e-mail: dev-unsubscr...@pdfbox.apache.org
For additional commands, e-mail: dev-h...@pdfbox.apache.org



[jira] [Resolved] (PDFBOX-5396) Add maven enforcer rule to ensure that JAVA_HOME is set

2022-04-07 Thread Tim Allison (Jira)


 [ 
https://issues.apache.org/jira/browse/PDFBOX-5396?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Tim Allison resolved PDFBOX-5396.
-
Fix Version/s: 2.0.26
   Resolution: Fixed

> Add maven enforcer rule to ensure that JAVA_HOME is set
> ---
>
> Key: PDFBOX-5396
> URL: https://issues.apache.org/jira/browse/PDFBOX-5396
> Project: PDFBox
>  Issue Type: Task
>Affects Versions: 2.0.25
>Reporter: Tim Allison
>Priority: Trivial
> Fix For: 2.0.26
>
>
> I recently stubbed my toe on this one again.  At least in the 2.x branch, the 
> module fontbox requires that the JAVA_HOME variable be set.  If it isn't set, 
> the project build fails in fontbox without any meaningful indication as to 
> why, even with the -X option set in maven.
> {noformat}
> (default-compile) on project fontbox: Compilation failure -> [Help 1]
> org.apache.maven.lifecycle.LifecycleExecutionException: Failed to
> execute goal org.apache.maven.plugins:maven-compiler-plugin:3.6.0:compile
> (default-compile) on project fontbox: Compilation failure
> at org.apache.maven.lifecycle.internal.MojoExecutor.execute
> {noformat}
> Also, on our website, there's no mention that JAVA_HOME should be set.  And, 
> yes, I realize that it is set on most developers' systems. :D
> One solution would be to add this rule to the maven-enforcer-plugin 
> configuration in the parent pom:
> {code:java}
> 
>   JAVA_HOME
>   The JAVA_HOME environment variable must be set!
> 
> {code}
> If this is ok, I'll add this rule in 2.x and see if I get the same behavior 
> in trunk.
> Side note: This was probably the cause of: 
> https://www.mail-archive.com/users@pdfbox.apache.org/msg11423.html and a few 
> other issues.



--
This message was sent by Atlassian Jira
(v8.20.1#820001)

-
To unsubscribe, e-mail: dev-unsubscr...@pdfbox.apache.org
For additional commands, e-mail: dev-h...@pdfbox.apache.org



[jira] [Commented] (PDFBOX-5396) Add maven enforcer rule to ensure that JAVA_HOME is set

2022-04-07 Thread ASF subversion and git services (Jira)


[ 
https://issues.apache.org/jira/browse/PDFBOX-5396?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17518895#comment-17518895
 ] 

ASF subversion and git services commented on PDFBOX-5396:
-

Commit 1899651 from Tim Allison in branch 'pdfbox/branches/2.0'
[ https://svn.apache.org/r1899651 ]

PDFBOX-5396 -- add meaningful message when JAVA_HOME is not set.

> Add maven enforcer rule to ensure that JAVA_HOME is set
> ---
>
> Key: PDFBOX-5396
> URL: https://issues.apache.org/jira/browse/PDFBOX-5396
> Project: PDFBox
>  Issue Type: Task
>Affects Versions: 2.0.25
>Reporter: Tim Allison
>Priority: Trivial
>
> I recently stubbed my toe on this one again.  At least in the 2.x branch, the 
> module fontbox requires that the JAVA_HOME variable be set.  If it isn't set, 
> the project build fails in fontbox without any meaningful indication as to 
> why, even with the -X option set in maven.
> {noformat}
> (default-compile) on project fontbox: Compilation failure -> [Help 1]
> org.apache.maven.lifecycle.LifecycleExecutionException: Failed to
> execute goal org.apache.maven.plugins:maven-compiler-plugin:3.6.0:compile
> (default-compile) on project fontbox: Compilation failure
> at org.apache.maven.lifecycle.internal.MojoExecutor.execute
> {noformat}
> Also, on our website, there's no mention that JAVA_HOME should be set.  And, 
> yes, I realize that it is set on most developers' systems. :D
> One solution would be to add this rule to the maven-enforcer-plugin 
> configuration in the parent pom:
> {code:java}
> 
>   JAVA_HOME
>   The JAVA_HOME environment variable must be set!
> 
> {code}
> If this is ok, I'll add this rule in 2.x and see if I get the same behavior 
> in trunk.
> Side note: This was probably the cause of: 
> https://www.mail-archive.com/users@pdfbox.apache.org/msg11423.html and a few 
> other issues.



--
This message was sent by Atlassian Jira
(v8.20.1#820001)

-
To unsubscribe, e-mail: dev-unsubscr...@pdfbox.apache.org
For additional commands, e-mail: dev-h...@pdfbox.apache.org



Re: 2.0.26 release

2022-04-07 Thread Andreas Lehmkühler
Yes, please

Thanks in advance
Andreas

07.04.2022 11:44:38 Tim Allison :

> Sounds great! Should I rerun the regression tests today?
> 
> On Thu, Apr 7, 2022 at 1:41 AM Andreas Lehmkuehler  wrote:
> 
>> Hi,
>> 
>> sorry for the delay.  I'm planning to cut the 2.0.26 release next
>> Saturday, the
>> day after tomorrow, if nobody objects.
>> 
>> Andreas
>> 
>> P.S.: I'm targeting a new 3.0.0 alpha release once the 2.0.26 release is
>> out
>> 
>> -
>> To unsubscribe, e-mail: dev-unsubscr...@pdfbox.apache.org
>> For additional commands, e-mail: dev-h...@pdfbox.apache.org
>> 
>> 

-
To unsubscribe, e-mail: dev-unsubscr...@pdfbox.apache.org
For additional commands, e-mail: dev-h...@pdfbox.apache.org



Re: 2.0.26 release

2022-04-07 Thread Tim Allison
Sounds great! Should I rerun the regression tests today?

On Thu, Apr 7, 2022 at 1:41 AM Andreas Lehmkuehler  wrote:

> Hi,
>
> sorry for the delay.  I'm planning to cut the 2.0.26 release next
> Saturday, the
> day after tomorrow, if nobody objects.
>
> Andreas
>
> P.S.: I'm targeting a new 3.0.0 alpha release once the 2.0.26 release is
> out
>
> -
> To unsubscribe, e-mail: dev-unsubscr...@pdfbox.apache.org
> For additional commands, e-mail: dev-h...@pdfbox.apache.org
>
>


[jira] [Commented] (PDFBOX-5407) Fields visible on click if NeedAppearances = false

2022-04-07 Thread Maruan Sahyoun (Jira)


[ 
https://issues.apache.org/jira/browse/PDFBOX-5407?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17518634#comment-17518634
 ] 

Maruan Sahyoun commented on PDFBOX-5407:


I've tested the appearance generation using the upcoming 2.0 version. Results 
look fine to me.

Generation was done using my Linux Workstation. Viewing was done using a macOS 
12 desktop. I've also installed Roboto to the Linux workstation to be able to 
use that font.

 

> Fields visible on click if NeedAppearances = false
> --
>
> Key: PDFBOX-5407
> URL: https://issues.apache.org/jira/browse/PDFBOX-5407
> Project: PDFBox
>  Issue Type: Bug
>  Components: AcroForm, Rendering
>Affects Versions: 2.0.24
>Reporter: Dmitry Betanov
>Priority: Minor
>  Labels: Appearance
> Attachments: Main.java, Safari.png, input.pdf, invisible_fields.mov, 
> macOS-Preview.png, output-acrobat.pdf, output-pdfbox-macOS-preview.png, 
> output-pdfbox-safari.png, output-pdfbox.pdf, output.pdf
>
>
> We have an issue that if we use NeedAppearances = false some of the input 
> field values only visible on click.
> This happens only on few viewers like in safari and MacOS default pdf viewer.
> The problem is that we cannot use NeedAppearances = true and similar issues 
> suggests to use it.
> Pdfbox version - 2.0.24, JDK - 11.
> Video shows an example of invisible values in MacOS default pdf viewer and 
> visible values in chrome.



--
This message was sent by Atlassian Jira
(v8.20.1#820001)

-
To unsubscribe, e-mail: dev-unsubscr...@pdfbox.apache.org
For additional commands, e-mail: dev-h...@pdfbox.apache.org



[jira] [Updated] (PDFBOX-5407) Fields visible on click if NeedAppearances = false

2022-04-07 Thread Maruan Sahyoun (Jira)


 [ 
https://issues.apache.org/jira/browse/PDFBOX-5407?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Maruan Sahyoun updated PDFBOX-5407:
---
Attachment: output-pdfbox-macOS-preview.png
output-pdfbox-safari.png

> Fields visible on click if NeedAppearances = false
> --
>
> Key: PDFBOX-5407
> URL: https://issues.apache.org/jira/browse/PDFBOX-5407
> Project: PDFBox
>  Issue Type: Bug
>  Components: AcroForm, Rendering
>Affects Versions: 2.0.24
>Reporter: Dmitry Betanov
>Priority: Minor
>  Labels: Appearance
> Attachments: Main.java, Safari.png, input.pdf, invisible_fields.mov, 
> macOS-Preview.png, output-acrobat.pdf, output-pdfbox-macOS-preview.png, 
> output-pdfbox-safari.png, output-pdfbox.pdf, output.pdf
>
>
> We have an issue that if we use NeedAppearances = false some of the input 
> field values only visible on click.
> This happens only on few viewers like in safari and MacOS default pdf viewer.
> The problem is that we cannot use NeedAppearances = true and similar issues 
> suggests to use it.
> Pdfbox version - 2.0.24, JDK - 11.
> Video shows an example of invisible values in MacOS default pdf viewer and 
> visible values in chrome.



--
This message was sent by Atlassian Jira
(v8.20.1#820001)

-
To unsubscribe, e-mail: dev-unsubscr...@pdfbox.apache.org
For additional commands, e-mail: dev-h...@pdfbox.apache.org



[jira] [Updated] (PDFBOX-5407) Fields visible on click if NeedAppearances = false

2022-04-07 Thread Maruan Sahyoun (Jira)


 [ 
https://issues.apache.org/jira/browse/PDFBOX-5407?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Maruan Sahyoun updated PDFBOX-5407:
---
Attachment: output-pdfbox.pdf

> Fields visible on click if NeedAppearances = false
> --
>
> Key: PDFBOX-5407
> URL: https://issues.apache.org/jira/browse/PDFBOX-5407
> Project: PDFBox
>  Issue Type: Bug
>  Components: AcroForm, Rendering
>Affects Versions: 2.0.24
>Reporter: Dmitry Betanov
>Priority: Minor
>  Labels: Appearance
> Attachments: Main.java, Safari.png, input.pdf, invisible_fields.mov, 
> macOS-Preview.png, output-acrobat.pdf, output-pdfbox.pdf, output.pdf
>
>
> We have an issue that if we use NeedAppearances = false some of the input 
> field values only visible on click.
> This happens only on few viewers like in safari and MacOS default pdf viewer.
> The problem is that we cannot use NeedAppearances = true and similar issues 
> suggests to use it.
> Pdfbox version - 2.0.24, JDK - 11.
> Video shows an example of invisible values in MacOS default pdf viewer and 
> visible values in chrome.



--
This message was sent by Atlassian Jira
(v8.20.1#820001)

-
To unsubscribe, e-mail: dev-unsubscr...@pdfbox.apache.org
For additional commands, e-mail: dev-h...@pdfbox.apache.org



[jira] [Updated] (PDFBOX-5407) Fields visible on click if NeedAppearances = false

2022-04-07 Thread Maruan Sahyoun (Jira)


 [ 
https://issues.apache.org/jira/browse/PDFBOX-5407?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Maruan Sahyoun updated PDFBOX-5407:
---
Attachment: Safari.png
macOS-Preview.png

> Fields visible on click if NeedAppearances = false
> --
>
> Key: PDFBOX-5407
> URL: https://issues.apache.org/jira/browse/PDFBOX-5407
> Project: PDFBox
>  Issue Type: Bug
>  Components: AcroForm, Rendering
>Affects Versions: 2.0.24
>Reporter: Dmitry Betanov
>Priority: Minor
>  Labels: Appearance
> Attachments: Main.java, Safari.png, input.pdf, invisible_fields.mov, 
> macOS-Preview.png, output-acrobat.pdf, output.pdf
>
>
> We have an issue that if we use NeedAppearances = false some of the input 
> field values only visible on click.
> This happens only on few viewers like in safari and MacOS default pdf viewer.
> The problem is that we cannot use NeedAppearances = true and similar issues 
> suggests to use it.
> Pdfbox version - 2.0.24, JDK - 11.
> Video shows an example of invisible values in MacOS default pdf viewer and 
> visible values in chrome.



--
This message was sent by Atlassian Jira
(v8.20.1#820001)

-
To unsubscribe, e-mail: dev-unsubscr...@pdfbox.apache.org
For additional commands, e-mail: dev-h...@pdfbox.apache.org



[jira] [Updated] (PDFBOX-5407) Fields visible on click if NeedAppearances = false

2022-04-07 Thread Maruan Sahyoun (Jira)


 [ 
https://issues.apache.org/jira/browse/PDFBOX-5407?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Maruan Sahyoun updated PDFBOX-5407:
---
Attachment: output-acrobat.pdf

> Fields visible on click if NeedAppearances = false
> --
>
> Key: PDFBOX-5407
> URL: https://issues.apache.org/jira/browse/PDFBOX-5407
> Project: PDFBox
>  Issue Type: Bug
>  Components: AcroForm, Rendering
>Affects Versions: 2.0.24
>Reporter: Dmitry Betanov
>Priority: Minor
>  Labels: Appearance
> Attachments: Main.java, input.pdf, invisible_fields.mov, 
> output-acrobat.pdf, output.pdf
>
>
> We have an issue that if we use NeedAppearances = false some of the input 
> field values only visible on click.
> This happens only on few viewers like in safari and MacOS default pdf viewer.
> The problem is that we cannot use NeedAppearances = true and similar issues 
> suggests to use it.
> Pdfbox version - 2.0.24, JDK - 11.
> Video shows an example of invisible values in MacOS default pdf viewer and 
> visible values in chrome.



--
This message was sent by Atlassian Jira
(v8.20.1#820001)

-
To unsubscribe, e-mail: dev-unsubscr...@pdfbox.apache.org
For additional commands, e-mail: dev-h...@pdfbox.apache.org



[jira] [Resolved] (PDFBOX-5398) Parsing fails in 2.0.26 that worked in 2.0.25

2022-04-07 Thread Jira


 [ 
https://issues.apache.org/jira/browse/PDFBOX-5398?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Andreas Lehmkühler resolved PDFBOX-5398.

Resolution: Fixed

[~tilman] thanks for the analysis

> Parsing fails in 2.0.26 that worked in 2.0.25
> -
>
> Key: PDFBOX-5398
> URL: https://issues.apache.org/jira/browse/PDFBOX-5398
> Project: PDFBox
>  Issue Type: Bug
>  Components: Parsing
>Affects Versions: 2.0.26, 3.0.0 PDFBox
>Reporter: Tilman Hausherr
>Assignee: Andreas Lehmkühler
>Priority: Major
>  Labels: regression
> Fix For: 2.0.26, 3.0.0 PDFBox
>
> Attachments: 077867.pdf, 392443.pdf, 
> crash-024bde7e01045bb3a6ab9d86b13cf411bc35.pdf
>
>
> {noformat}
> März 23, 2022 4:14:13 AM org.apache.pdfbox.pdfparser.BaseParser 
> parseCOSDictionaryNameValuePair
> WARNUNG: Empty COSName at offset 12313
> Exception in thread "main" java.io.IOException: Unknown dir object c='>' 
> cInt=62 peek='>' peekInt=62 at offset 12326 (start offset: 12326)
> at 
> org.apache.pdfbox.pdfparser.BaseParser.parseDirObject(BaseParser.java:928)
> at 
> org.apache.pdfbox.pdfparser.BaseParser.parseCOSDictionaryValue(BaseParser.java:154)
> at 
> org.apache.pdfbox.pdfparser.BaseParser.parseCOSDictionaryNameValuePair(BaseParser.java:303)
> at 
> org.apache.pdfbox.pdfparser.BaseParser.parseCOSDictionary(BaseParser.java:228)
> at 
> org.apache.pdfbox.pdfparser.BaseParser.parseDirObject(BaseParser.java:872)
> at 
> org.apache.pdfbox.pdfparser.BaseParser.parseCOSDictionaryValue(BaseParser.java:154)
> at 
> org.apache.pdfbox.pdfparser.BaseParser.parseCOSDictionaryNameValuePair(BaseParser.java:303)
> at 
> org.apache.pdfbox.pdfparser.BaseParser.parseCOSDictionary(BaseParser.java:228)
> at 
> org.apache.pdfbox.pdfparser.BaseParser.parseDirObject(BaseParser.java:872)
> at 
> org.apache.pdfbox.pdfparser.COSParser.parseFileObject(COSParser.java:916)
> at 
> org.apache.pdfbox.pdfparser.COSParser.parseObjectDynamically(COSParser.java:883)
> at 
> org.apache.pdfbox.pdfparser.COSParser.parseObjectDynamically(COSParser.java:796)
> at 
> org.apache.pdfbox.pdfparser.COSParser.parseDictObjects(COSParser.java:756)
> at 
> org.apache.pdfbox.pdfparser.PDFParser.initialParse(PDFParser.java:187)
> at org.apache.pdfbox.pdfparser.PDFParser.parse(PDFParser.java:226)
> {noformat}
> The cause is not PDFBOX-5283.



--
This message was sent by Atlassian Jira
(v8.20.1#820001)

-
To unsubscribe, e-mail: dev-unsubscr...@pdfbox.apache.org
For additional commands, e-mail: dev-h...@pdfbox.apache.org



[jira] [Updated] (PDFBOX-5339) A list of bugs found (70 bugs in total)

2022-04-07 Thread Jira


 [ 
https://issues.apache.org/jira/browse/PDFBOX-5339?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Andreas Lehmkühler updated PDFBOX-5339:
---
Description: 
1. Unique Bugs Found
Recently we (Zhang Cen [https://github.com/occia], Huang Wenjie 
[https://github.com/ZanderHuang] and Zhang Xiaohan 
[https://github.com/Han0nly]) discovered a series of bugs in latest pdfbox 
(3.0.0-alpha2).
Every bug we reported in the following is unique and reproducable. Furthermore, 
they have been manually analyzed and triaged in removing the duplicates.
Due to the lack of contextual knowledge in the pdfbox library, we cannot 
thoroughly fix some bugs hence we look forward to any proposed plan from the 
developers in fixing these bugs.

2. Bug Report and Crash Seeds
The bug report folder can be downloaded from 
[https://drive.google.com/drive/folders/1TMOzudQOVXPKdZ1--NyusyV7kHRA2MSE?usp=sharing]
It contains both reports and crash seeds.

3. Test Program to Reproduce Crashes
The test program can be downloaded from 
[https://drive.google.com/file/d/1r0OsDC0vg8Qc-XtGg0XDKbxubaPozcBj/view?usp=sharing]

Total 70 bugs are reported in this issue.
A full list is provided below.

4. Folder structure
 - Level 1 (folder): exception type
 - Level 2 (folder): error location
 - Level 3 (files): POC file and report.txt including reproducing steps

5. report.txt content:
      1. Exception type
      2. Error location
      3. Bug cause and impact
      4. Crash thread's stacks
      5. Steps to reproduce

 

6. Bug full list (crashes under java.lang.IllegalArgumentException and 
IllegalStateException should be wrapped instead of using the common exception 
types)

pdfbox_reported_crashes
├── java.lang.ArrayIndexOutOfBoundsException
│   ├─= org.apache.fontbox.cff.CFFParser.readString--CFFParser.java-781
│   ├─= org.apache.fontbox.cff.Type1CharString.seac--Type1CharString.java-484
│   ├─= 
org.apache.fontbox.ttf.HorizontalMetricsTable.getAdvanceWidth--HorizontalMetricsTable.java-113
│   ├─= 
org.apache.pdfbox.filter.CCITTFaxDecoderStream.decode2D--CCITTFaxDecoderStream.java-218
│   └─= 
org.apache.pdfbox.pdfparser.PDFXrefStreamParser=ObjectNumbers.--PDFXrefStreamParser.java-202
├── java.lang.ClassCastException
│   ├─= org.apache.fontbox.cff.CFFParser.parseType1Dicts--CFFParser.java-765
│   ├─= 
org.apache.fontbox.cmap.CMapParser.parseBeginbfrange--CMapParser.java-377
│   ├─= 
org.apache.pdfbox.contentstream.operator.text.SetTextLeading.process--SetTextLeading.java-37
│   ├─= 
org.apache.pdfbox.pdmodel.font.PDFont.getAverageFontWidth--PDFont.java-402
│   ├─= 
org.apache.pdfbox.pdmodel.font.PDType1CFont.--PDType1CFont.java-101
│   └─= org.apache.pdfbox.util.Matrix.--Matrix.java-70
├── java.lang.IllegalArgumentException
│   ├─= 
org.apache.fontbox.cff.CFFParser=DictData=Entry.getBoolean--CFFParser.java-1247
│   ├─= org.apache.fontbox.cff.CFFParser.readCharset--CFFParser.java-1042
│   ├─= org.apache.fontbox.cff.CFFParser.readEncoding--CFFParser.java-808
│   ├─= 
org.apache.fontbox.cff.Type1CharString.callothersubr--Type1CharString.java-383
│   ├─= 
org.apache.fontbox.cff.Type1CharString.handleType1Command--Type1CharString.java-319
│   ├─= org.apache.pdfbox.cos.COSObjectKey.--COSObjectKey.java-54
│   ├─= org.apache.pdfbox.cos.COSObjectKey.--COSObjectKey.java-58
│   ├─= 
org.apache.pdfbox.pdmodel.font.PDFontFactory.createDescendantFont--PDFontFactory.java-128
│   ├─= 
org.apache.pdfbox.pdmodel.font.PDFontFactory.createFont--PDFontFactory.java-100
│   ├─= 
org.apache.pdfbox.pdmodel.font.PDFontFactory.createFont--PDFontFactory.java-104
│   ├─= org.apache.pdfbox.pdmodel.font.PDType1Font.--PDType1Font.java-202
│   └── org.apache.pdfbox.util.Matrix.checkFloatValues--Matrix.java-300
├── java.lang.IllegalStateException
│   ├─= org.apache.fontbox.cff.CFFCharsetCID.getSIDForGID--CFFCharsetCID.java-59
│   └─= org.apache.pdfbox.pdmodel.PDPageTree.sanitizeType--PDPageTree.java-261
├── java.lang.IndexOutOfBoundsException
│   ├─= 
org.apache.fontbox.cff.CFFParser=DictData=Entry.getNumber--CFFParser.java-1229
│   ├─= 
org.apache.fontbox.cff.Type1CharString.handleType1Command--Type1CharString.java-292
│   ├─= 
org.apache.fontbox.cff.Type2CharString.handleType2Command--Type2CharString.java-146
│   ├─= org.apache.fontbox.util.BoundingBox.--BoundingBox.java-65
│   ├─= 
org.apache.pdfbox.contentstream.operator.text.SetTextLeading.process--SetTextLeading.java-37
│   └─= org.apache.pdfbox.cos.COSArray.getObject--COSArray.java-205
├── java.lang.NegativeArraySizeException
│   └─= 
org.apache.pdfbox.pdfparser.PDFXrefStreamParser.parse--PDFXrefStreamParser.java-123
├── java.lang.NullPointerException
│   ├─= org.apache.fontbox.cff.CFFParser.parseFont--CFFParser.java-486
│   ├─= org.apache.fontbox.cff.CFFParser.readString--CFFParser.java-779
│   ├─= org.apache.fontbox.cmap.CMap.toInt--CMap.java-207
│   ├─= org.apache.fontbox.type1.Token.intValue--Token.java-107
│   ├─= 

[jira] [Commented] (PDFBOX-5398) Parsing fails in 2.0.26 that worked in 2.0.25

2022-04-07 Thread ASF subversion and git services (Jira)


[ 
https://issues.apache.org/jira/browse/PDFBOX-5398?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17518597#comment-17518597
 ] 

ASF subversion and git services commented on PDFBOX-5398:
-

Commit 1899641 from le...@apache.org in branch 'pdfbox/trunk'
[ https://svn.apache.org/r1899641 ]

PDFBOX-5398, PDFBOX-5339: revert changes from 1897171 as they were to strict 
and led to unwanted parsing issues. This brings back an issue with a really bad 
pdf which ends up in a StackOverflow

> Parsing fails in 2.0.26 that worked in 2.0.25
> -
>
> Key: PDFBOX-5398
> URL: https://issues.apache.org/jira/browse/PDFBOX-5398
> Project: PDFBox
>  Issue Type: Bug
>  Components: Parsing
>Affects Versions: 2.0.26, 3.0.0 PDFBox
>Reporter: Tilman Hausherr
>Assignee: Andreas Lehmkühler
>Priority: Major
>  Labels: regression
> Fix For: 2.0.26, 3.0.0 PDFBox
>
> Attachments: 077867.pdf, 392443.pdf, 
> crash-024bde7e01045bb3a6ab9d86b13cf411bc35.pdf
>
>
> {noformat}
> März 23, 2022 4:14:13 AM org.apache.pdfbox.pdfparser.BaseParser 
> parseCOSDictionaryNameValuePair
> WARNUNG: Empty COSName at offset 12313
> Exception in thread "main" java.io.IOException: Unknown dir object c='>' 
> cInt=62 peek='>' peekInt=62 at offset 12326 (start offset: 12326)
> at 
> org.apache.pdfbox.pdfparser.BaseParser.parseDirObject(BaseParser.java:928)
> at 
> org.apache.pdfbox.pdfparser.BaseParser.parseCOSDictionaryValue(BaseParser.java:154)
> at 
> org.apache.pdfbox.pdfparser.BaseParser.parseCOSDictionaryNameValuePair(BaseParser.java:303)
> at 
> org.apache.pdfbox.pdfparser.BaseParser.parseCOSDictionary(BaseParser.java:228)
> at 
> org.apache.pdfbox.pdfparser.BaseParser.parseDirObject(BaseParser.java:872)
> at 
> org.apache.pdfbox.pdfparser.BaseParser.parseCOSDictionaryValue(BaseParser.java:154)
> at 
> org.apache.pdfbox.pdfparser.BaseParser.parseCOSDictionaryNameValuePair(BaseParser.java:303)
> at 
> org.apache.pdfbox.pdfparser.BaseParser.parseCOSDictionary(BaseParser.java:228)
> at 
> org.apache.pdfbox.pdfparser.BaseParser.parseDirObject(BaseParser.java:872)
> at 
> org.apache.pdfbox.pdfparser.COSParser.parseFileObject(COSParser.java:916)
> at 
> org.apache.pdfbox.pdfparser.COSParser.parseObjectDynamically(COSParser.java:883)
> at 
> org.apache.pdfbox.pdfparser.COSParser.parseObjectDynamically(COSParser.java:796)
> at 
> org.apache.pdfbox.pdfparser.COSParser.parseDictObjects(COSParser.java:756)
> at 
> org.apache.pdfbox.pdfparser.PDFParser.initialParse(PDFParser.java:187)
> at org.apache.pdfbox.pdfparser.PDFParser.parse(PDFParser.java:226)
> {noformat}
> The cause is not PDFBOX-5283.



--
This message was sent by Atlassian Jira
(v8.20.1#820001)

-
To unsubscribe, e-mail: dev-unsubscr...@pdfbox.apache.org
For additional commands, e-mail: dev-h...@pdfbox.apache.org



[jira] [Commented] (PDFBOX-5339) A list of bugs found (70 bugs in total)

2022-04-07 Thread ASF subversion and git services (Jira)


[ 
https://issues.apache.org/jira/browse/PDFBOX-5339?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17518596#comment-17518596
 ] 

ASF subversion and git services commented on PDFBOX-5339:
-

Commit 1899640 from le...@apache.org in branch 'pdfbox/branches/2.0'
[ https://svn.apache.org/r1899640 ]

PDFBOX-5398, PDFBOX-5339: revert changes from 1897171 as they were to strict 
and led to unwanted parsing issues. This brings back an issue with a really bad 
pdf which ends up in a StackOverflow

> A list of bugs found (70 bugs in total)
> ---
>
> Key: PDFBOX-5339
> URL: https://issues.apache.org/jira/browse/PDFBOX-5339
> Project: PDFBox
>  Issue Type: Bug
>Affects Versions: 2.0.25, 3.0.0 PDFBox
>Reporter: Huang Wenjie
>Priority: Minor
> Fix For: 2.0.26, 3.0.0 PDFBox
>
> Attachments: crash-4698e0dc7833a3f959d06707e01d03cda52a83f4
>
>
> 1. Unique Bugs Found
> Recently we (Zhang Cen [https://github.com/occia], Huang Wenjie 
> [https://github.com/ZanderHuang] and Zhang Xiaohan 
> [https://github.com/Han0nly]) discovered a series of bugs in latest pdfbox 
> (3.0.0-alpha2).
> Every bug we reported in the following is unique and reproducable. 
> Furthermore, they have been manually analyzed and triaged in removing the 
> duplicates.
> Due to the lack of contextual knowledge in the pdfbox library, we cannot 
> thoroughly fix some bugs hence we look forward to any proposed plan from the 
> developers in fixing these bugs.
> 2. Bug Report and Crash Seeds
> The bug report folder can be downloaded from 
> [https://drive.google.com/drive/folders/1TMOzudQOVXPKdZ1--NyusyV7kHRA2MSE?usp=sharing]
> It contains both reports and crash seeds.
> 3. Test Program to Reproduce Crashes
> The test program can be downloaded from 
> [https://drive.google.com/file/d/1r0OsDC0vg8Qc-XtGg0XDKbxubaPozcBj/view?usp=sharing]
> Total 70 bugs are reported in this issue.
> A full list is provided below.
> 4. Folder structure
>  - Level 1 (folder): exception type
>  - Level 2 (folder): error location
>  - Level 3 (files): POC file and report.txt including reproducing steps
> 5. report.txt content:
>       1. Exception type
>       2. Error location
>       3. Bug cause and impact
>       4. Crash thread's stacks
>       5. Steps to reproduce
>  
> 6. Bug full list (crashes under java.lang.IllegalArgumentException and 
> IllegalStateException should be wrapped instead of using the common exception 
> types)
> pdfbox_reported_crashes
> ├── java.lang.ArrayIndexOutOfBoundsException
> │   ├─= org.apache.fontbox.cff.CFFParser.readString--CFFParser.java-781
> │   ├─= org.apache.fontbox.cff.Type1CharString.seac--Type1CharString.java-484
> │   ├─= 
> org.apache.fontbox.ttf.HorizontalMetricsTable.getAdvanceWidth--HorizontalMetricsTable.java-113
> │   ├─= 
> org.apache.pdfbox.filter.CCITTFaxDecoderStream.decode2D--CCITTFaxDecoderStream.java-218
> │   └─= 
> org.apache.pdfbox.pdfparser.PDFXrefStreamParser=ObjectNumbers.--PDFXrefStreamParser.java-202
> ├── java.lang.ClassCastException
> │   ├─= org.apache.fontbox.cff.CFFParser.parseType1Dicts--CFFParser.java-765
> │   ├─= 
> org.apache.fontbox.cmap.CMapParser.parseBeginbfrange--CMapParser.java-377
> │   ├─= 
> org.apache.pdfbox.contentstream.operator.text.SetTextLeading.process--SetTextLeading.java-37
> │   ├─= 
> org.apache.pdfbox.pdmodel.font.PDFont.getAverageFontWidth--PDFont.java-402
> │   ├─= 
> org.apache.pdfbox.pdmodel.font.PDType1CFont.--PDType1CFont.java-101
> │   └─= org.apache.pdfbox.util.Matrix.--Matrix.java-70
> ├── java.lang.IllegalArgumentException
> │   ├─= 
> org.apache.fontbox.cff.CFFParser=DictData=Entry.getBoolean--CFFParser.java-1247
> │   ├─= org.apache.fontbox.cff.CFFParser.readCharset--CFFParser.java-1042
> │   ├─= org.apache.fontbox.cff.CFFParser.readEncoding--CFFParser.java-808
> │   ├─= 
> org.apache.fontbox.cff.Type1CharString.callothersubr--Type1CharString.java-383
> │   ├─= 
> org.apache.fontbox.cff.Type1CharString.handleType1Command--Type1CharString.java-319
> │   ├─= org.apache.pdfbox.cos.COSObjectKey.--COSObjectKey.java-54
> │   ├─= org.apache.pdfbox.cos.COSObjectKey.--COSObjectKey.java-58
> │   ├─= 
> org.apache.pdfbox.pdmodel.font.PDFontFactory.createDescendantFont--PDFontFactory.java-128
> │   ├─= 
> org.apache.pdfbox.pdmodel.font.PDFontFactory.createFont--PDFontFactory.java-100
> │   ├─= 
> org.apache.pdfbox.pdmodel.font.PDFontFactory.createFont--PDFontFactory.java-104
> │   ├─= 
> org.apache.pdfbox.pdmodel.font.PDType1Font.--PDType1Font.java-202
> │   └── org.apache.pdfbox.util.Matrix.checkFloatValues--Matrix.java-300
> ├── java.lang.IllegalStateException
> │   ├─= 
> org.apache.fontbox.cff.CFFCharsetCID.getSIDForGID--CFFCharsetCID.java-59
> │   └─= org.apache.pdfbox.pdmodel.PDPageTree.sanitizeType--PDPageTree.java-261
> ├── java.lang.IndexOutOfBoundsException

[jira] [Commented] (PDFBOX-5398) Parsing fails in 2.0.26 that worked in 2.0.25

2022-04-07 Thread ASF subversion and git services (Jira)


[ 
https://issues.apache.org/jira/browse/PDFBOX-5398?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17518595#comment-17518595
 ] 

ASF subversion and git services commented on PDFBOX-5398:
-

Commit 1899640 from le...@apache.org in branch 'pdfbox/branches/2.0'
[ https://svn.apache.org/r1899640 ]

PDFBOX-5398, PDFBOX-5339: revert changes from 1897171 as they were to strict 
and led to unwanted parsing issues. This brings back an issue with a really bad 
pdf which ends up in a StackOverflow

> Parsing fails in 2.0.26 that worked in 2.0.25
> -
>
> Key: PDFBOX-5398
> URL: https://issues.apache.org/jira/browse/PDFBOX-5398
> Project: PDFBox
>  Issue Type: Bug
>  Components: Parsing
>Affects Versions: 2.0.26, 3.0.0 PDFBox
>Reporter: Tilman Hausherr
>Assignee: Andreas Lehmkühler
>Priority: Major
>  Labels: regression
> Fix For: 2.0.26, 3.0.0 PDFBox
>
> Attachments: 077867.pdf, 392443.pdf, 
> crash-024bde7e01045bb3a6ab9d86b13cf411bc35.pdf
>
>
> {noformat}
> März 23, 2022 4:14:13 AM org.apache.pdfbox.pdfparser.BaseParser 
> parseCOSDictionaryNameValuePair
> WARNUNG: Empty COSName at offset 12313
> Exception in thread "main" java.io.IOException: Unknown dir object c='>' 
> cInt=62 peek='>' peekInt=62 at offset 12326 (start offset: 12326)
> at 
> org.apache.pdfbox.pdfparser.BaseParser.parseDirObject(BaseParser.java:928)
> at 
> org.apache.pdfbox.pdfparser.BaseParser.parseCOSDictionaryValue(BaseParser.java:154)
> at 
> org.apache.pdfbox.pdfparser.BaseParser.parseCOSDictionaryNameValuePair(BaseParser.java:303)
> at 
> org.apache.pdfbox.pdfparser.BaseParser.parseCOSDictionary(BaseParser.java:228)
> at 
> org.apache.pdfbox.pdfparser.BaseParser.parseDirObject(BaseParser.java:872)
> at 
> org.apache.pdfbox.pdfparser.BaseParser.parseCOSDictionaryValue(BaseParser.java:154)
> at 
> org.apache.pdfbox.pdfparser.BaseParser.parseCOSDictionaryNameValuePair(BaseParser.java:303)
> at 
> org.apache.pdfbox.pdfparser.BaseParser.parseCOSDictionary(BaseParser.java:228)
> at 
> org.apache.pdfbox.pdfparser.BaseParser.parseDirObject(BaseParser.java:872)
> at 
> org.apache.pdfbox.pdfparser.COSParser.parseFileObject(COSParser.java:916)
> at 
> org.apache.pdfbox.pdfparser.COSParser.parseObjectDynamically(COSParser.java:883)
> at 
> org.apache.pdfbox.pdfparser.COSParser.parseObjectDynamically(COSParser.java:796)
> at 
> org.apache.pdfbox.pdfparser.COSParser.parseDictObjects(COSParser.java:756)
> at 
> org.apache.pdfbox.pdfparser.PDFParser.initialParse(PDFParser.java:187)
> at org.apache.pdfbox.pdfparser.PDFParser.parse(PDFParser.java:226)
> {noformat}
> The cause is not PDFBOX-5283.



--
This message was sent by Atlassian Jira
(v8.20.1#820001)

-
To unsubscribe, e-mail: dev-unsubscr...@pdfbox.apache.org
For additional commands, e-mail: dev-h...@pdfbox.apache.org



[jira] [Commented] (PDFBOX-5339) A list of bugs found (70 bugs in total)

2022-04-07 Thread ASF subversion and git services (Jira)


[ 
https://issues.apache.org/jira/browse/PDFBOX-5339?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17518598#comment-17518598
 ] 

ASF subversion and git services commented on PDFBOX-5339:
-

Commit 1899641 from le...@apache.org in branch 'pdfbox/trunk'
[ https://svn.apache.org/r1899641 ]

PDFBOX-5398, PDFBOX-5339: revert changes from 1897171 as they were to strict 
and led to unwanted parsing issues. This brings back an issue with a really bad 
pdf which ends up in a StackOverflow

> A list of bugs found (70 bugs in total)
> ---
>
> Key: PDFBOX-5339
> URL: https://issues.apache.org/jira/browse/PDFBOX-5339
> Project: PDFBox
>  Issue Type: Bug
>Affects Versions: 2.0.25, 3.0.0 PDFBox
>Reporter: Huang Wenjie
>Priority: Minor
> Fix For: 2.0.26, 3.0.0 PDFBox
>
> Attachments: crash-4698e0dc7833a3f959d06707e01d03cda52a83f4
>
>
> 1. Unique Bugs Found
> Recently we (Zhang Cen [https://github.com/occia], Huang Wenjie 
> [https://github.com/ZanderHuang] and Zhang Xiaohan 
> [https://github.com/Han0nly]) discovered a series of bugs in latest pdfbox 
> (3.0.0-alpha2).
> Every bug we reported in the following is unique and reproducable. 
> Furthermore, they have been manually analyzed and triaged in removing the 
> duplicates.
> Due to the lack of contextual knowledge in the pdfbox library, we cannot 
> thoroughly fix some bugs hence we look forward to any proposed plan from the 
> developers in fixing these bugs.
> 2. Bug Report and Crash Seeds
> The bug report folder can be downloaded from 
> [https://drive.google.com/drive/folders/1TMOzudQOVXPKdZ1--NyusyV7kHRA2MSE?usp=sharing]
> It contains both reports and crash seeds.
> 3. Test Program to Reproduce Crashes
> The test program can be downloaded from 
> [https://drive.google.com/file/d/1r0OsDC0vg8Qc-XtGg0XDKbxubaPozcBj/view?usp=sharing]
> Total 70 bugs are reported in this issue.
> A full list is provided below.
> 4. Folder structure
>  - Level 1 (folder): exception type
>  - Level 2 (folder): error location
>  - Level 3 (files): POC file and report.txt including reproducing steps
> 5. report.txt content:
>       1. Exception type
>       2. Error location
>       3. Bug cause and impact
>       4. Crash thread's stacks
>       5. Steps to reproduce
>  
> 6. Bug full list (crashes under java.lang.IllegalArgumentException and 
> IllegalStateException should be wrapped instead of using the common exception 
> types)
> pdfbox_reported_crashes
> ├── java.lang.ArrayIndexOutOfBoundsException
> │   ├─= org.apache.fontbox.cff.CFFParser.readString--CFFParser.java-781
> │   ├─= org.apache.fontbox.cff.Type1CharString.seac--Type1CharString.java-484
> │   ├─= 
> org.apache.fontbox.ttf.HorizontalMetricsTable.getAdvanceWidth--HorizontalMetricsTable.java-113
> │   ├─= 
> org.apache.pdfbox.filter.CCITTFaxDecoderStream.decode2D--CCITTFaxDecoderStream.java-218
> │   └─= 
> org.apache.pdfbox.pdfparser.PDFXrefStreamParser=ObjectNumbers.--PDFXrefStreamParser.java-202
> ├── java.lang.ClassCastException
> │   ├─= org.apache.fontbox.cff.CFFParser.parseType1Dicts--CFFParser.java-765
> │   ├─= 
> org.apache.fontbox.cmap.CMapParser.parseBeginbfrange--CMapParser.java-377
> │   ├─= 
> org.apache.pdfbox.contentstream.operator.text.SetTextLeading.process--SetTextLeading.java-37
> │   ├─= 
> org.apache.pdfbox.pdmodel.font.PDFont.getAverageFontWidth--PDFont.java-402
> │   ├─= 
> org.apache.pdfbox.pdmodel.font.PDType1CFont.--PDType1CFont.java-101
> │   └─= org.apache.pdfbox.util.Matrix.--Matrix.java-70
> ├── java.lang.IllegalArgumentException
> │   ├─= 
> org.apache.fontbox.cff.CFFParser=DictData=Entry.getBoolean--CFFParser.java-1247
> │   ├─= org.apache.fontbox.cff.CFFParser.readCharset--CFFParser.java-1042
> │   ├─= org.apache.fontbox.cff.CFFParser.readEncoding--CFFParser.java-808
> │   ├─= 
> org.apache.fontbox.cff.Type1CharString.callothersubr--Type1CharString.java-383
> │   ├─= 
> org.apache.fontbox.cff.Type1CharString.handleType1Command--Type1CharString.java-319
> │   ├─= org.apache.pdfbox.cos.COSObjectKey.--COSObjectKey.java-54
> │   ├─= org.apache.pdfbox.cos.COSObjectKey.--COSObjectKey.java-58
> │   ├─= 
> org.apache.pdfbox.pdmodel.font.PDFontFactory.createDescendantFont--PDFontFactory.java-128
> │   ├─= 
> org.apache.pdfbox.pdmodel.font.PDFontFactory.createFont--PDFontFactory.java-100
> │   ├─= 
> org.apache.pdfbox.pdmodel.font.PDFontFactory.createFont--PDFontFactory.java-104
> │   ├─= 
> org.apache.pdfbox.pdmodel.font.PDType1Font.--PDType1Font.java-202
> │   └── org.apache.pdfbox.util.Matrix.checkFloatValues--Matrix.java-300
> ├── java.lang.IllegalStateException
> │   ├─= 
> org.apache.fontbox.cff.CFFCharsetCID.getSIDForGID--CFFCharsetCID.java-59
> │   └─= org.apache.pdfbox.pdmodel.PDPageTree.sanitizeType--PDPageTree.java-261
> ├── java.lang.IndexOutOfBoundsException
> │   

[jira] [Updated] (PDFBOX-5398) Parsing fails in 2.0.26 that worked in 2.0.25

2022-04-07 Thread Jira


 [ 
https://issues.apache.org/jira/browse/PDFBOX-5398?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Andreas Lehmkühler updated PDFBOX-5398:
---
Fix Version/s: 2.0.26
   3.0.0 PDFBox

> Parsing fails in 2.0.26 that worked in 2.0.25
> -
>
> Key: PDFBOX-5398
> URL: https://issues.apache.org/jira/browse/PDFBOX-5398
> Project: PDFBox
>  Issue Type: Bug
>  Components: Parsing
>Affects Versions: 2.0.26, 3.0.0 PDFBox
>Reporter: Tilman Hausherr
>Assignee: Andreas Lehmkühler
>Priority: Major
>  Labels: regression
> Fix For: 2.0.26, 3.0.0 PDFBox
>
> Attachments: 077867.pdf, 392443.pdf, 
> crash-024bde7e01045bb3a6ab9d86b13cf411bc35.pdf
>
>
> {noformat}
> März 23, 2022 4:14:13 AM org.apache.pdfbox.pdfparser.BaseParser 
> parseCOSDictionaryNameValuePair
> WARNUNG: Empty COSName at offset 12313
> Exception in thread "main" java.io.IOException: Unknown dir object c='>' 
> cInt=62 peek='>' peekInt=62 at offset 12326 (start offset: 12326)
> at 
> org.apache.pdfbox.pdfparser.BaseParser.parseDirObject(BaseParser.java:928)
> at 
> org.apache.pdfbox.pdfparser.BaseParser.parseCOSDictionaryValue(BaseParser.java:154)
> at 
> org.apache.pdfbox.pdfparser.BaseParser.parseCOSDictionaryNameValuePair(BaseParser.java:303)
> at 
> org.apache.pdfbox.pdfparser.BaseParser.parseCOSDictionary(BaseParser.java:228)
> at 
> org.apache.pdfbox.pdfparser.BaseParser.parseDirObject(BaseParser.java:872)
> at 
> org.apache.pdfbox.pdfparser.BaseParser.parseCOSDictionaryValue(BaseParser.java:154)
> at 
> org.apache.pdfbox.pdfparser.BaseParser.parseCOSDictionaryNameValuePair(BaseParser.java:303)
> at 
> org.apache.pdfbox.pdfparser.BaseParser.parseCOSDictionary(BaseParser.java:228)
> at 
> org.apache.pdfbox.pdfparser.BaseParser.parseDirObject(BaseParser.java:872)
> at 
> org.apache.pdfbox.pdfparser.COSParser.parseFileObject(COSParser.java:916)
> at 
> org.apache.pdfbox.pdfparser.COSParser.parseObjectDynamically(COSParser.java:883)
> at 
> org.apache.pdfbox.pdfparser.COSParser.parseObjectDynamically(COSParser.java:796)
> at 
> org.apache.pdfbox.pdfparser.COSParser.parseDictObjects(COSParser.java:756)
> at 
> org.apache.pdfbox.pdfparser.PDFParser.initialParse(PDFParser.java:187)
> at org.apache.pdfbox.pdfparser.PDFParser.parse(PDFParser.java:226)
> {noformat}
> The cause is not PDFBOX-5283.



--
This message was sent by Atlassian Jira
(v8.20.1#820001)

-
To unsubscribe, e-mail: dev-unsubscr...@pdfbox.apache.org
For additional commands, e-mail: dev-h...@pdfbox.apache.org