[jira] [Closed] (PDFBOX-5791) PDFBox CLI unable to read JPEG Image

2024-03-25 Thread Tilman Hausherr (Jira)


 [ 
https://issues.apache.org/jira/browse/PDFBOX-5791?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Tilman Hausherr closed PDFBOX-5791.
---
Resolution: Not A Problem

> PDFBox CLI unable to read JPEG Image
> 
>
> Key: PDFBOX-5791
> URL: https://issues.apache.org/jira/browse/PDFBOX-5791
> Project: PDFBox
>  Issue Type: Bug
>  Components: Utilities
>Affects Versions: 2.0.27
>Reporter: Kabir Soneja
>Priority: Major
> Attachments: original_doc-11.jpg, screenshot-1.png
>
>
> Hi,
> I am trying to use pdfbox-app jar to execute PDFBox commands through CLI. I 
> am using the CLI to convert PDF to Images. While doing so, for some documents 
> PDFBox app jar is unable to read JPEG2000 Image.
> Commands I executed:
> {code:java}
> java -jar pdfbox-app-2.0.27.jar PDFToImage og-color-doc.pdf {code}
> I realized that PDFBox needs additional jars like jai-imageio-core and 
> jai-imageio-jpeg2000 and I tried including these jars in the class path while 
> executing the pdfbox command but I am still running into the same issue. Is 
> there a specific way to ensure that pdfbox-app jar is able to reference the 
> dependencies it needs when I am executing through the CLI?
> I have all jars present in lib directory and upon executing this command I am 
> getting an error indicating:
> Error: Could not find or load main class org.apache.pdfbox.tools.PDFBox
> Caused by: java.lang.ClassNotFoundException: org.apache.pdfbox.tools.PDFBox
> Command:
> {code:java}
> java -cp "pdfbox-app-2.0.27.jar;lib/*" org.apache.pdfbox.tools.PDFBox 
> PDFToImage og-color-doc.pdf {code}
> Can you please help me understand how to pass the specific jars/libraries 
> that pdfbox-app needs while using the CLI?



--
This message was sent by Atlassian Jira
(v8.20.10#820010)

-
To unsubscribe, e-mail: dev-unsubscr...@pdfbox.apache.org
For additional commands, e-mail: dev-h...@pdfbox.apache.org



[jira] [Commented] (PDFBOX-5791) PDFBox CLI unable to read JPEG Image

2024-03-25 Thread Kabir Soneja (Jira)


[ 
https://issues.apache.org/jira/browse/PDFBOX-5791?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17830780#comment-17830780
 ] 

Kabir Soneja commented on PDFBOX-5791:
--

Thanks [~tilman] 
Yes, I re-downloaded both libraries and then it works.

> PDFBox CLI unable to read JPEG Image
> 
>
> Key: PDFBOX-5791
> URL: https://issues.apache.org/jira/browse/PDFBOX-5791
> Project: PDFBox
>  Issue Type: Bug
>  Components: Utilities
>Affects Versions: 2.0.27
>Reporter: Kabir Soneja
>Priority: Major
> Attachments: original_doc-11.jpg, screenshot-1.png
>
>
> Hi,
> I am trying to use pdfbox-app jar to execute PDFBox commands through CLI. I 
> am using the CLI to convert PDF to Images. While doing so, for some documents 
> PDFBox app jar is unable to read JPEG2000 Image.
> Commands I executed:
> {code:java}
> java -jar pdfbox-app-2.0.27.jar PDFToImage og-color-doc.pdf {code}
> I realized that PDFBox needs additional jars like jai-imageio-core and 
> jai-imageio-jpeg2000 and I tried including these jars in the class path while 
> executing the pdfbox command but I am still running into the same issue. Is 
> there a specific way to ensure that pdfbox-app jar is able to reference the 
> dependencies it needs when I am executing through the CLI?
> I have all jars present in lib directory and upon executing this command I am 
> getting an error indicating:
> Error: Could not find or load main class org.apache.pdfbox.tools.PDFBox
> Caused by: java.lang.ClassNotFoundException: org.apache.pdfbox.tools.PDFBox
> Command:
> {code:java}
> java -cp "pdfbox-app-2.0.27.jar;lib/*" org.apache.pdfbox.tools.PDFBox 
> PDFToImage og-color-doc.pdf {code}
> Can you please help me understand how to pass the specific jars/libraries 
> that pdfbox-app needs while using the CLI?



--
This message was sent by Atlassian Jira
(v8.20.10#820010)

-
To unsubscribe, e-mail: dev-unsubscr...@pdfbox.apache.org
For additional commands, e-mail: dev-h...@pdfbox.apache.org



[jira] [Comment Edited] (PDFBOX-5791) PDFBox CLI unable to read JPEG Image

2024-03-25 Thread Tilman Hausherr (Jira)


[ 
https://issues.apache.org/jira/browse/PDFBOX-5791?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17830778#comment-17830778
 ] 

Tilman Hausherr edited comment on PDFBOX-5791 at 3/26/24 4:28 AM:
--

I don't know if relevant, but you should use 1.4.0, not 1.3.0. One of the two 
you mentioned is outdated.

Please retry and download both and do this in a separate directory so to avoid 
using old files
https://github.com/jai-imageio/jai-imageio-core/releases/tag/jai-imageio-core-1.4.0
https://github.com/jai-imageio/jai-imageio-jpeg2000/releases/tag/jai-imageio-jpeg2000-1.4.0

I downloaded both now and really tested from scratch with 2.0.31 and the 
attached JPEG is what I got:
!original_doc-11.jpg! 


was (Author: tilman):
I don't know if relevant, but you should use 1.4.0, not 1.3.0.
https://github.com/jai-imageio/jai-imageio-core/releases/tag/jai-imageio-core-1.4.0
https://github.com/jai-imageio/jai-imageio-jpeg2000/releases/tag/jai-imageio-jpeg2000-1.4.0

I downloaded both now and really tested from scratch with 2.0.31 and the 
attached JPEG is what I got:
!original_doc-11.jpg! 

> PDFBox CLI unable to read JPEG Image
> 
>
> Key: PDFBOX-5791
> URL: https://issues.apache.org/jira/browse/PDFBOX-5791
> Project: PDFBox
>  Issue Type: Bug
>  Components: Utilities
>Affects Versions: 2.0.27
>Reporter: Kabir Soneja
>Priority: Major
> Attachments: original_doc-11.jpg, screenshot-1.png
>
>
> Hi,
> I am trying to use pdfbox-app jar to execute PDFBox commands through CLI. I 
> am using the CLI to convert PDF to Images. While doing so, for some documents 
> PDFBox app jar is unable to read JPEG2000 Image.
> Commands I executed:
> {code:java}
> java -jar pdfbox-app-2.0.27.jar PDFToImage og-color-doc.pdf {code}
> I realized that PDFBox needs additional jars like jai-imageio-core and 
> jai-imageio-jpeg2000 and I tried including these jars in the class path while 
> executing the pdfbox command but I am still running into the same issue. Is 
> there a specific way to ensure that pdfbox-app jar is able to reference the 
> dependencies it needs when I am executing through the CLI?
> I have all jars present in lib directory and upon executing this command I am 
> getting an error indicating:
> Error: Could not find or load main class org.apache.pdfbox.tools.PDFBox
> Caused by: java.lang.ClassNotFoundException: org.apache.pdfbox.tools.PDFBox
> Command:
> {code:java}
> java -cp "pdfbox-app-2.0.27.jar;lib/*" org.apache.pdfbox.tools.PDFBox 
> PDFToImage og-color-doc.pdf {code}
> Can you please help me understand how to pass the specific jars/libraries 
> that pdfbox-app needs while using the CLI?



--
This message was sent by Atlassian Jira
(v8.20.10#820010)

-
To unsubscribe, e-mail: dev-unsubscr...@pdfbox.apache.org
For additional commands, e-mail: dev-h...@pdfbox.apache.org



[jira] [Commented] (PDFBOX-5791) PDFBox CLI unable to read JPEG Image

2024-03-25 Thread Tilman Hausherr (Jira)


[ 
https://issues.apache.org/jira/browse/PDFBOX-5791?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17830778#comment-17830778
 ] 

Tilman Hausherr commented on PDFBOX-5791:
-

I don't know if relevant, but you should use 1.4.0, not 1.3.0.
https://github.com/jai-imageio/jai-imageio-core/releases/tag/jai-imageio-core-1.4.0
https://github.com/jai-imageio/jai-imageio-jpeg2000/releases/tag/jai-imageio-jpeg2000-1.4.0

I downloaded both now and really tested from scratch with 2.0.31 and the 
attached JPEG is what I got:
!original_doc-11.jpg! 

> PDFBox CLI unable to read JPEG Image
> 
>
> Key: PDFBOX-5791
> URL: https://issues.apache.org/jira/browse/PDFBOX-5791
> Project: PDFBox
>  Issue Type: Bug
>  Components: Utilities
>Affects Versions: 2.0.27
>Reporter: Kabir Soneja
>Priority: Major
> Attachments: original_doc-11.jpg, screenshot-1.png
>
>
> Hi,
> I am trying to use pdfbox-app jar to execute PDFBox commands through CLI. I 
> am using the CLI to convert PDF to Images. While doing so, for some documents 
> PDFBox app jar is unable to read JPEG2000 Image.
> Commands I executed:
> {code:java}
> java -jar pdfbox-app-2.0.27.jar PDFToImage og-color-doc.pdf {code}
> I realized that PDFBox needs additional jars like jai-imageio-core and 
> jai-imageio-jpeg2000 and I tried including these jars in the class path while 
> executing the pdfbox command but I am still running into the same issue. Is 
> there a specific way to ensure that pdfbox-app jar is able to reference the 
> dependencies it needs when I am executing through the CLI?
> I have all jars present in lib directory and upon executing this command I am 
> getting an error indicating:
> Error: Could not find or load main class org.apache.pdfbox.tools.PDFBox
> Caused by: java.lang.ClassNotFoundException: org.apache.pdfbox.tools.PDFBox
> Command:
> {code:java}
> java -cp "pdfbox-app-2.0.27.jar;lib/*" org.apache.pdfbox.tools.PDFBox 
> PDFToImage og-color-doc.pdf {code}
> Can you please help me understand how to pass the specific jars/libraries 
> that pdfbox-app needs while using the CLI?



--
This message was sent by Atlassian Jira
(v8.20.10#820010)

-
To unsubscribe, e-mail: dev-unsubscr...@pdfbox.apache.org
For additional commands, e-mail: dev-h...@pdfbox.apache.org



[jira] [Updated] (PDFBOX-5791) PDFBox CLI unable to read JPEG Image

2024-03-25 Thread Tilman Hausherr (Jira)


 [ 
https://issues.apache.org/jira/browse/PDFBOX-5791?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Tilman Hausherr updated PDFBOX-5791:

Attachment: original_doc-11.jpg

> PDFBox CLI unable to read JPEG Image
> 
>
> Key: PDFBOX-5791
> URL: https://issues.apache.org/jira/browse/PDFBOX-5791
> Project: PDFBox
>  Issue Type: Bug
>  Components: Utilities
>Affects Versions: 2.0.27
>Reporter: Kabir Soneja
>Priority: Major
> Attachments: original_doc-11.jpg, screenshot-1.png
>
>
> Hi,
> I am trying to use pdfbox-app jar to execute PDFBox commands through CLI. I 
> am using the CLI to convert PDF to Images. While doing so, for some documents 
> PDFBox app jar is unable to read JPEG2000 Image.
> Commands I executed:
> {code:java}
> java -jar pdfbox-app-2.0.27.jar PDFToImage og-color-doc.pdf {code}
> I realized that PDFBox needs additional jars like jai-imageio-core and 
> jai-imageio-jpeg2000 and I tried including these jars in the class path while 
> executing the pdfbox command but I am still running into the same issue. Is 
> there a specific way to ensure that pdfbox-app jar is able to reference the 
> dependencies it needs when I am executing through the CLI?
> I have all jars present in lib directory and upon executing this command I am 
> getting an error indicating:
> Error: Could not find or load main class org.apache.pdfbox.tools.PDFBox
> Caused by: java.lang.ClassNotFoundException: org.apache.pdfbox.tools.PDFBox
> Command:
> {code:java}
> java -cp "pdfbox-app-2.0.27.jar;lib/*" org.apache.pdfbox.tools.PDFBox 
> PDFToImage og-color-doc.pdf {code}
> Can you please help me understand how to pass the specific jars/libraries 
> that pdfbox-app needs while using the CLI?



--
This message was sent by Atlassian Jira
(v8.20.10#820010)

-
To unsubscribe, e-mail: dev-unsubscr...@pdfbox.apache.org
For additional commands, e-mail: dev-h...@pdfbox.apache.org



[jira] [Commented] (PDFBOX-5791) PDFBox CLI unable to read JPEG Image

2024-03-25 Thread Kabir Soneja (Jira)


[ 
https://issues.apache.org/jira/browse/PDFBOX-5791?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17830705#comment-17830705
 ] 

Kabir Soneja commented on PDFBOX-5791:
--

[~tilman] After adding the above two jars in classpath, I no longer see the 
error related to decoding the jpeg image but the image being generated has 
distorted colors for the first page.
For reference, I am using the same pdf as mentioned in PDFBOX-5615 and the 
image being generated for the first page is same as the image reported in the 
ticket. Classpath has jai-imageio-core-1.3.0 and jai-imageio-jpeg2000-1.4.0 
still the image being generated is having distorted colors. What could be 
missing in this case?

> PDFBox CLI unable to read JPEG Image
> 
>
> Key: PDFBOX-5791
> URL: https://issues.apache.org/jira/browse/PDFBOX-5791
> Project: PDFBox
>  Issue Type: Bug
>  Components: Utilities
>Affects Versions: 2.0.27
>Reporter: Kabir Soneja
>Priority: Major
> Attachments: screenshot-1.png
>
>
> Hi,
> I am trying to use pdfbox-app jar to execute PDFBox commands through CLI. I 
> am using the CLI to convert PDF to Images. While doing so, for some documents 
> PDFBox app jar is unable to read JPEG2000 Image.
> Commands I executed:
> {code:java}
> java -jar pdfbox-app-2.0.27.jar PDFToImage og-color-doc.pdf {code}
> I realized that PDFBox needs additional jars like jai-imageio-core and 
> jai-imageio-jpeg2000 and I tried including these jars in the class path while 
> executing the pdfbox command but I am still running into the same issue. Is 
> there a specific way to ensure that pdfbox-app jar is able to reference the 
> dependencies it needs when I am executing through the CLI?
> I have all jars present in lib directory and upon executing this command I am 
> getting an error indicating:
> Error: Could not find or load main class org.apache.pdfbox.tools.PDFBox
> Caused by: java.lang.ClassNotFoundException: org.apache.pdfbox.tools.PDFBox
> Command:
> {code:java}
> java -cp "pdfbox-app-2.0.27.jar;lib/*" org.apache.pdfbox.tools.PDFBox 
> PDFToImage og-color-doc.pdf {code}
> Can you please help me understand how to pass the specific jars/libraries 
> that pdfbox-app needs while using the CLI?



--
This message was sent by Atlassian Jira
(v8.20.10#820010)

-
To unsubscribe, e-mail: dev-unsubscr...@pdfbox.apache.org
For additional commands, e-mail: dev-h...@pdfbox.apache.org



[jira] [Commented] (PDFBOX-5791) PDFBox CLI unable to read JPEG Image

2024-03-25 Thread Kabir Soneja (Jira)


[ 
https://issues.apache.org/jira/browse/PDFBOX-5791?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17830703#comment-17830703
 ] 

Kabir Soneja commented on PDFBOX-5791:
--

Thanks [~tilman] That worked. I had to use ":" instead of ";".

> PDFBox CLI unable to read JPEG Image
> 
>
> Key: PDFBOX-5791
> URL: https://issues.apache.org/jira/browse/PDFBOX-5791
> Project: PDFBox
>  Issue Type: Bug
>  Components: Utilities
>Affects Versions: 2.0.27
>Reporter: Kabir Soneja
>Priority: Major
> Attachments: screenshot-1.png
>
>
> Hi,
> I am trying to use pdfbox-app jar to execute PDFBox commands through CLI. I 
> am using the CLI to convert PDF to Images. While doing so, for some documents 
> PDFBox app jar is unable to read JPEG2000 Image.
> Commands I executed:
> {code:java}
> java -jar pdfbox-app-2.0.27.jar PDFToImage og-color-doc.pdf {code}
> I realized that PDFBox needs additional jars like jai-imageio-core and 
> jai-imageio-jpeg2000 and I tried including these jars in the class path while 
> executing the pdfbox command but I am still running into the same issue. Is 
> there a specific way to ensure that pdfbox-app jar is able to reference the 
> dependencies it needs when I am executing through the CLI?
> I have all jars present in lib directory and upon executing this command I am 
> getting an error indicating:
> Error: Could not find or load main class org.apache.pdfbox.tools.PDFBox
> Caused by: java.lang.ClassNotFoundException: org.apache.pdfbox.tools.PDFBox
> Command:
> {code:java}
> java -cp "pdfbox-app-2.0.27.jar;lib/*" org.apache.pdfbox.tools.PDFBox 
> PDFToImage og-color-doc.pdf {code}
> Can you please help me understand how to pass the specific jars/libraries 
> that pdfbox-app needs while using the CLI?



--
This message was sent by Atlassian Jira
(v8.20.10#820010)

-
To unsubscribe, e-mail: dev-unsubscr...@pdfbox.apache.org
For additional commands, e-mail: dev-h...@pdfbox.apache.org



about Scripts.txt

2024-03-25 Thread Dieter von Holten
hi there



while browsing through the sources i came across  OpenTypeScript.java, which
loads the

resource-file Scripts.txt

The file contains a list of circa 2700 Unicode codepoints.

The file is version 10.0.0 of 2017-03-11 .



A reference points to a newer version of this file:



http://www.unicode.org/Public/UCD/latest/ucd/Scripts.txt



which is version 15.1.0 of 2023-07-28, it contains circa 3000 Unicode
codepoints.



i propose to investigate, if this newer file can be included in PdfBox and
works for older Jdk-versions, as supported for PdfBox 2.



MfG

DvH









[jira] [Resolved] (PDFBOX-5790) Don't use a predefined CMap if a ToUnicode CMap is present

2024-03-25 Thread Jira


 [ 
https://issues.apache.org/jira/browse/PDFBOX-5790?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Andreas Lehmkühler resolved PDFBOX-5790.

Fix Version/s: 2.0.32
   4.0.0
   3.0.3 PDFBox
   Resolution: Fixed

> Don't use a predefined CMap if a ToUnicode CMap is present
> --
>
> Key: PDFBOX-5790
> URL: https://issues.apache.org/jira/browse/PDFBOX-5790
> Project: PDFBox
>  Issue Type: Bug
>  Components: Text extraction
>Affects Versions: 2.0.31, 4.0.0, 3.0.3 PDFBox
>Reporter: Andreas Lehmkühler
>Assignee: Andreas Lehmkühler
>Priority: Major
> Fix For: 2.0.32, 4.0.0, 3.0.3 PDFBox
>
> Attachments: p4_fix.pdf
>
>
> The user Luiz Marcelo Modesto reported an issue with the text extraction of 
> the attached pdf  [^p4_fix.pdf] 
> {quote}
> Hi everyone,
> I'm not sure if this is the same as FAQ "How come I am getting 
> gibberish(G38G43G36G51G5) when extracting text?"...
> I'm using PDFBox version 3.0.1 and OpenJDK Runtime Environment (build 
> 11.0.22+7-post-Ubuntu-0ubuntu222.04.1).
> I'm trying to understand how this PDF chunk (from p4_fix.pdf attached)
>   BT
>   /G1F7 6.0 Tf
>   94.871 773.806 Td
>   <004200430044> Tj
>   ET
> becomes "BCD" on PDFBox Debugger (the same on qpdfview, Adobe Reader, 
> Chrome, ...) and becomes "abc" on PDFBox text extraction tool. 
> Using the Poppler pdftotext (version 22.02.0) gives me "BCD" too.
> The renders that allow me to copy the text give me "BCD" text.
> It seems that PDFBox extraction tool follows the item "9.10.2 Mapping 
> character codes to Unicode values" (ISO 32000-2:2020) but all the others 
> choose a different way.
>  Could you help me to understand if there is a problem with the PDF file, 
> with the renders or with the extract text tool? 
> Thank you!
> {quote}



--
This message was sent by Atlassian Jira
(v8.20.10#820010)

-
To unsubscribe, e-mail: dev-unsubscr...@pdfbox.apache.org
For additional commands, e-mail: dev-h...@pdfbox.apache.org



[jira] [Commented] (PDFBOX-5791) PDFBox CLI unable to read JPEG Image

2024-03-25 Thread Tilman Hausherr (Jira)


[ 
https://issues.apache.org/jira/browse/PDFBOX-5791?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17830644#comment-17830644
 ] 

Tilman Hausherr commented on PDFBOX-5791:
-

I'm on windows so I use ";" for the classpath. I tried to use ":" instead and I 
got the same error you got:

Fehler: Hauptklasse org.apache.pdfbox.tools.PDFBox konnte nicht gefunden oder 
geladen werden
Ursache: java.lang.ClassNotFoundException: org.apache.pdfbox.tools.PDFBox

If you're on Linux, use a colon. If you're on windows, use a semicolon in the 
classpath.
https://stackoverflow.com/a/4528456/535646

> PDFBox CLI unable to read JPEG Image
> 
>
> Key: PDFBOX-5791
> URL: https://issues.apache.org/jira/browse/PDFBOX-5791
> Project: PDFBox
>  Issue Type: Bug
>  Components: Utilities
>Affects Versions: 2.0.27
>Reporter: Kabir Soneja
>Priority: Major
> Attachments: screenshot-1.png
>
>
> Hi,
> I am trying to use pdfbox-app jar to execute PDFBox commands through CLI. I 
> am using the CLI to convert PDF to Images. While doing so, for some documents 
> PDFBox app jar is unable to read JPEG2000 Image.
> Commands I executed:
> {code:java}
> java -jar pdfbox-app-2.0.27.jar PDFToImage og-color-doc.pdf {code}
> I realized that PDFBox needs additional jars like jai-imageio-core and 
> jai-imageio-jpeg2000 and I tried including these jars in the class path while 
> executing the pdfbox command but I am still running into the same issue. Is 
> there a specific way to ensure that pdfbox-app jar is able to reference the 
> dependencies it needs when I am executing through the CLI?
> I have all jars present in lib directory and upon executing this command I am 
> getting an error indicating:
> Error: Could not find or load main class org.apache.pdfbox.tools.PDFBox
> Caused by: java.lang.ClassNotFoundException: org.apache.pdfbox.tools.PDFBox
> Command:
> {code:java}
> java -cp "pdfbox-app-2.0.27.jar;lib/*" org.apache.pdfbox.tools.PDFBox 
> PDFToImage og-color-doc.pdf {code}
> Can you please help me understand how to pass the specific jars/libraries 
> that pdfbox-app needs while using the CLI?



--
This message was sent by Atlassian Jira
(v8.20.10#820010)

-
To unsubscribe, e-mail: dev-unsubscr...@pdfbox.apache.org
For additional commands, e-mail: dev-h...@pdfbox.apache.org



[jira] [Commented] (PDFBOX-5791) PDFBox CLI unable to read JPEG Image

2024-03-25 Thread Kabir Soneja (Jira)


[ 
https://issues.apache.org/jira/browse/PDFBOX-5791?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17830639#comment-17830639
 ] 

Kabir Soneja commented on PDFBOX-5791:
--

Thanks, yes I will double check if there is any issue with the jar. But just 
wanted to confirm all the additional jars needed.

> PDFBox CLI unable to read JPEG Image
> 
>
> Key: PDFBOX-5791
> URL: https://issues.apache.org/jira/browse/PDFBOX-5791
> Project: PDFBox
>  Issue Type: Bug
>  Components: Utilities
>Affects Versions: 2.0.27
>Reporter: Kabir Soneja
>Priority: Major
> Attachments: screenshot-1.png
>
>
> Hi,
> I am trying to use pdfbox-app jar to execute PDFBox commands through CLI. I 
> am using the CLI to convert PDF to Images. While doing so, for some documents 
> PDFBox app jar is unable to read JPEG2000 Image.
> Commands I executed:
> {code:java}
> java -jar pdfbox-app-2.0.27.jar PDFToImage og-color-doc.pdf {code}
> I realized that PDFBox needs additional jars like jai-imageio-core and 
> jai-imageio-jpeg2000 and I tried including these jars in the class path while 
> executing the pdfbox command but I am still running into the same issue. Is 
> there a specific way to ensure that pdfbox-app jar is able to reference the 
> dependencies it needs when I am executing through the CLI?
> I have all jars present in lib directory and upon executing this command I am 
> getting an error indicating:
> Error: Could not find or load main class org.apache.pdfbox.tools.PDFBox
> Caused by: java.lang.ClassNotFoundException: org.apache.pdfbox.tools.PDFBox
> Command:
> {code:java}
> java -cp "pdfbox-app-2.0.27.jar;lib/*" org.apache.pdfbox.tools.PDFBox 
> PDFToImage og-color-doc.pdf {code}
> Can you please help me understand how to pass the specific jars/libraries 
> that pdfbox-app needs while using the CLI?



--
This message was sent by Atlassian Jira
(v8.20.10#820010)

-
To unsubscribe, e-mail: dev-unsubscr...@pdfbox.apache.org
For additional commands, e-mail: dev-h...@pdfbox.apache.org



[jira] [Commented] (PDFBOX-5791) PDFBox CLI unable to read JPEG Image

2024-03-25 Thread Tilman Hausherr (Jira)


[ 
https://issues.apache.org/jira/browse/PDFBOX-5791?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17830638#comment-17830638
 ] 

Tilman Hausherr commented on PDFBOX-5791:
-

Yes this is correct. But your call fails earlier, it's the "main" class that 
isn't found (org.apache.pdfbox.tools.PDFBox).

> PDFBox CLI unable to read JPEG Image
> 
>
> Key: PDFBOX-5791
> URL: https://issues.apache.org/jira/browse/PDFBOX-5791
> Project: PDFBox
>  Issue Type: Bug
>  Components: Utilities
>Affects Versions: 2.0.27
>Reporter: Kabir Soneja
>Priority: Major
> Attachments: screenshot-1.png
>
>
> Hi,
> I am trying to use pdfbox-app jar to execute PDFBox commands through CLI. I 
> am using the CLI to convert PDF to Images. While doing so, for some documents 
> PDFBox app jar is unable to read JPEG2000 Image.
> Commands I executed:
> {code:java}
> java -jar pdfbox-app-2.0.27.jar PDFToImage og-color-doc.pdf {code}
> I realized that PDFBox needs additional jars like jai-imageio-core and 
> jai-imageio-jpeg2000 and I tried including these jars in the class path while 
> executing the pdfbox command but I am still running into the same issue. Is 
> there a specific way to ensure that pdfbox-app jar is able to reference the 
> dependencies it needs when I am executing through the CLI?
> I have all jars present in lib directory and upon executing this command I am 
> getting an error indicating:
> Error: Could not find or load main class org.apache.pdfbox.tools.PDFBox
> Caused by: java.lang.ClassNotFoundException: org.apache.pdfbox.tools.PDFBox
> Command:
> {code:java}
> java -cp "pdfbox-app-2.0.27.jar;lib/*" org.apache.pdfbox.tools.PDFBox 
> PDFToImage og-color-doc.pdf {code}
> Can you please help me understand how to pass the specific jars/libraries 
> that pdfbox-app needs while using the CLI?



--
This message was sent by Atlassian Jira
(v8.20.10#820010)

-
To unsubscribe, e-mail: dev-unsubscr...@pdfbox.apache.org
For additional commands, e-mail: dev-h...@pdfbox.apache.org



[jira] [Updated] (PDFBOX-5791) PDFBox CLI unable to read JPEG Image

2024-03-25 Thread Tilman Hausherr (Jira)


 [ 
https://issues.apache.org/jira/browse/PDFBOX-5791?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Tilman Hausherr updated PDFBOX-5791:

Attachment: screenshot-1.png

> PDFBox CLI unable to read JPEG Image
> 
>
> Key: PDFBOX-5791
> URL: https://issues.apache.org/jira/browse/PDFBOX-5791
> Project: PDFBox
>  Issue Type: Bug
>  Components: Utilities
>Affects Versions: 2.0.27
>Reporter: Kabir Soneja
>Priority: Major
> Attachments: screenshot-1.png
>
>
> Hi,
> I am trying to use pdfbox-app jar to execute PDFBox commands through CLI. I 
> am using the CLI to convert PDF to Images. While doing so, for some documents 
> PDFBox app jar is unable to read JPEG2000 Image.
> Commands I executed:
> {code:java}
> java -jar pdfbox-app-2.0.27.jar PDFToImage og-color-doc.pdf {code}
> I realized that PDFBox needs additional jars like jai-imageio-core and 
> jai-imageio-jpeg2000 and I tried including these jars in the class path while 
> executing the pdfbox command but I am still running into the same issue. Is 
> there a specific way to ensure that pdfbox-app jar is able to reference the 
> dependencies it needs when I am executing through the CLI?
> I have all jars present in lib directory and upon executing this command I am 
> getting an error indicating:
> Error: Could not find or load main class org.apache.pdfbox.tools.PDFBox
> Caused by: java.lang.ClassNotFoundException: org.apache.pdfbox.tools.PDFBox
> Command:
> {code:java}
> java -cp "pdfbox-app-2.0.27.jar;lib/*" org.apache.pdfbox.tools.PDFBox 
> PDFToImage og-color-doc.pdf {code}
> Can you please help me understand how to pass the specific jars/libraries 
> that pdfbox-app needs while using the CLI?



--
This message was sent by Atlassian Jira
(v8.20.10#820010)

-
To unsubscribe, e-mail: dev-unsubscr...@pdfbox.apache.org
For additional commands, e-mail: dev-h...@pdfbox.apache.org



[jira] [Commented] (PDFBOX-5791) PDFBox CLI unable to read JPEG Image

2024-03-25 Thread Kabir Soneja (Jira)


[ 
https://issues.apache.org/jira/browse/PDFBOX-5791?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17830636#comment-17830636
 ] 

Kabir Soneja commented on PDFBOX-5791:
--

Thanks, I will double check to see if there is any other issue with the jar.
Just want to confirm which additional jars are needed when using pdfbox-app jar 
for CLI execution.
Currently, I am adding these two jars:
 # jai-imageio-core
 # jai-imageio-jpeg2000

> PDFBox CLI unable to read JPEG Image
> 
>
> Key: PDFBOX-5791
> URL: https://issues.apache.org/jira/browse/PDFBOX-5791
> Project: PDFBox
>  Issue Type: Bug
>  Components: Utilities
>Affects Versions: 2.0.27
>Reporter: Kabir Soneja
>Priority: Major
> Attachments: screenshot-1.png
>
>
> Hi,
> I am trying to use pdfbox-app jar to execute PDFBox commands through CLI. I 
> am using the CLI to convert PDF to Images. While doing so, for some documents 
> PDFBox app jar is unable to read JPEG2000 Image.
> Commands I executed:
> {code:java}
> java -jar pdfbox-app-2.0.27.jar PDFToImage og-color-doc.pdf {code}
> I realized that PDFBox needs additional jars like jai-imageio-core and 
> jai-imageio-jpeg2000 and I tried including these jars in the class path while 
> executing the pdfbox command but I am still running into the same issue. Is 
> there a specific way to ensure that pdfbox-app jar is able to reference the 
> dependencies it needs when I am executing through the CLI?
> I have all jars present in lib directory and upon executing this command I am 
> getting an error indicating:
> Error: Could not find or load main class org.apache.pdfbox.tools.PDFBox
> Caused by: java.lang.ClassNotFoundException: org.apache.pdfbox.tools.PDFBox
> Command:
> {code:java}
> java -cp "pdfbox-app-2.0.27.jar;lib/*" org.apache.pdfbox.tools.PDFBox 
> PDFToImage og-color-doc.pdf {code}
> Can you please help me understand how to pass the specific jars/libraries 
> that pdfbox-app needs while using the CLI?



--
This message was sent by Atlassian Jira
(v8.20.10#820010)

-
To unsubscribe, e-mail: dev-unsubscr...@pdfbox.apache.org
For additional commands, e-mail: dev-h...@pdfbox.apache.org



[jira] [Commented] (PDFBOX-5791) PDFBox CLI unable to read JPEG Image

2024-03-25 Thread Tilman Hausherr (Jira)


[ 
https://issues.apache.org/jira/browse/PDFBOX-5791?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17830635#comment-17830635
 ] 

Tilman Hausherr commented on PDFBOX-5791:
-

To verify that your tar file is ok, open it with 7zip or a similar tool. Here 
is how it looks for me:
 !screenshot-1.png! 

> PDFBox CLI unable to read JPEG Image
> 
>
> Key: PDFBOX-5791
> URL: https://issues.apache.org/jira/browse/PDFBOX-5791
> Project: PDFBox
>  Issue Type: Bug
>  Components: Utilities
>Affects Versions: 2.0.27
>Reporter: Kabir Soneja
>Priority: Major
> Attachments: screenshot-1.png
>
>
> Hi,
> I am trying to use pdfbox-app jar to execute PDFBox commands through CLI. I 
> am using the CLI to convert PDF to Images. While doing so, for some documents 
> PDFBox app jar is unable to read JPEG2000 Image.
> Commands I executed:
> {code:java}
> java -jar pdfbox-app-2.0.27.jar PDFToImage og-color-doc.pdf {code}
> I realized that PDFBox needs additional jars like jai-imageio-core and 
> jai-imageio-jpeg2000 and I tried including these jars in the class path while 
> executing the pdfbox command but I am still running into the same issue. Is 
> there a specific way to ensure that pdfbox-app jar is able to reference the 
> dependencies it needs when I am executing through the CLI?
> I have all jars present in lib directory and upon executing this command I am 
> getting an error indicating:
> Error: Could not find or load main class org.apache.pdfbox.tools.PDFBox
> Caused by: java.lang.ClassNotFoundException: org.apache.pdfbox.tools.PDFBox
> Command:
> {code:java}
> java -cp "pdfbox-app-2.0.27.jar;lib/*" org.apache.pdfbox.tools.PDFBox 
> PDFToImage og-color-doc.pdf {code}
> Can you please help me understand how to pass the specific jars/libraries 
> that pdfbox-app needs while using the CLI?



--
This message was sent by Atlassian Jira
(v8.20.10#820010)

-
To unsubscribe, e-mail: dev-unsubscr...@pdfbox.apache.org
For additional commands, e-mail: dev-h...@pdfbox.apache.org



[jira] [Updated] (PDFBOX-5791) PDFBox CLI unable to read JPEG Image

2024-03-25 Thread Tilman Hausherr (Jira)


 [ 
https://issues.apache.org/jira/browse/PDFBOX-5791?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Tilman Hausherr updated PDFBOX-5791:

Component/s: Utilities

> PDFBox CLI unable to read JPEG Image
> 
>
> Key: PDFBOX-5791
> URL: https://issues.apache.org/jira/browse/PDFBOX-5791
> Project: PDFBox
>  Issue Type: Bug
>  Components: Utilities
>Affects Versions: 2.0.27
>Reporter: Kabir Soneja
>Priority: Major
>
> Hi,
> I am trying to use pdfbox-app jar to execute PDFBox commands through CLI. I 
> am using the CLI to convert PDF to Images. While doing so, for some documents 
> PDFBox app jar is unable to read JPEG2000 Image.
> Commands I executed:
> {code:java}
> java -jar pdfbox-app-2.0.27.jar PDFToImage og-color-doc.pdf {code}
> I realized that PDFBox needs additional jars like jai-imageio-core and 
> jai-imageio-jpeg2000 and I tried including these jars in the class path while 
> executing the pdfbox command but I am still running into the same issue. Is 
> there a specific way to ensure that pdfbox-app jar is able to reference the 
> dependencies it needs when I am executing through the CLI?
> I have all jars present in lib directory and upon executing this command I am 
> getting an error indicating:
> Error: Could not find or load main class org.apache.pdfbox.tools.PDFBox
> Caused by: java.lang.ClassNotFoundException: org.apache.pdfbox.tools.PDFBox
> Command:
> {code:java}
> java -cp "pdfbox-app-2.0.27.jar;lib/*" org.apache.pdfbox.tools.PDFBox 
> PDFToImage og-color-doc.pdf {code}
> Can you please help me understand how to pass the specific jars/libraries 
> that pdfbox-app needs while using the CLI?



--
This message was sent by Atlassian Jira
(v8.20.10#820010)

-
To unsubscribe, e-mail: dev-unsubscr...@pdfbox.apache.org
For additional commands, e-mail: dev-h...@pdfbox.apache.org



[jira] [Updated] (PDFBOX-5791) PDFBox CLI unable to read JPEG Image

2024-03-25 Thread Tilman Hausherr (Jira)


 [ 
https://issues.apache.org/jira/browse/PDFBOX-5791?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Tilman Hausherr updated PDFBOX-5791:

Affects Version/s: 2.0.27

> PDFBox CLI unable to read JPEG Image
> 
>
> Key: PDFBOX-5791
> URL: https://issues.apache.org/jira/browse/PDFBOX-5791
> Project: PDFBox
>  Issue Type: Bug
>Affects Versions: 2.0.27
>Reporter: Kabir Soneja
>Priority: Major
>
> Hi,
> I am trying to use pdfbox-app jar to execute PDFBox commands through CLI. I 
> am using the CLI to convert PDF to Images. While doing so, for some documents 
> PDFBox app jar is unable to read JPEG2000 Image.
> Commands I executed:
> {code:java}
> java -jar pdfbox-app-2.0.27.jar PDFToImage og-color-doc.pdf {code}
> I realized that PDFBox needs additional jars like jai-imageio-core and 
> jai-imageio-jpeg2000 and I tried including these jars in the class path while 
> executing the pdfbox command but I am still running into the same issue. Is 
> there a specific way to ensure that pdfbox-app jar is able to reference the 
> dependencies it needs when I am executing through the CLI?
> I have all jars present in lib directory and upon executing this command I am 
> getting an error indicating:
> Error: Could not find or load main class org.apache.pdfbox.tools.PDFBox
> Caused by: java.lang.ClassNotFoundException: org.apache.pdfbox.tools.PDFBox
> Command:
> {code:java}
> java -cp "pdfbox-app-2.0.27.jar;lib/*" org.apache.pdfbox.tools.PDFBox 
> PDFToImage og-color-doc.pdf {code}
> Can you please help me understand how to pass the specific jars/libraries 
> that pdfbox-app needs while using the CLI?



--
This message was sent by Atlassian Jira
(v8.20.10#820010)

-
To unsubscribe, e-mail: dev-unsubscr...@pdfbox.apache.org
For additional commands, e-mail: dev-h...@pdfbox.apache.org



[jira] [Comment Edited] (PDFBOX-5791) PDFBox CLI unable to read JPEG Image

2024-03-25 Thread Tilman Hausherr (Jira)


[ 
https://issues.apache.org/jira/browse/PDFBOX-5791?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17830599#comment-17830599
 ] 

Tilman Hausherr edited comment on PDFBOX-5791 at 3/25/24 5:08 PM:
--

Maybe your download is broken? The class name and the command line is correct. 
Please download pdfbox-app-2.0.31.jar here
https://pdfbox.apache.org/download.html

Also, depending on what OS you are using, replace ";" with ":".


was (Author: tilman):
Maybe your download is broken? The class name is correct. Please download 
pdfbox-app-2.0.31.jar here
https://pdfbox.apache.org/download.html

Also, depending on what OS you are using, replace ";" with ":".

> PDFBox CLI unable to read JPEG Image
> 
>
> Key: PDFBOX-5791
> URL: https://issues.apache.org/jira/browse/PDFBOX-5791
> Project: PDFBox
>  Issue Type: Bug
>Reporter: Kabir Soneja
>Priority: Major
>
> Hi,
> I am trying to use pdfbox-app jar to execute PDFBox commands through CLI. I 
> am using the CLI to convert PDF to Images. While doing so, for some documents 
> PDFBox app jar is unable to read JPEG2000 Image.
> Commands I executed:
> {code:java}
> java -jar pdfbox-app-2.0.27.jar PDFToImage og-color-doc.pdf {code}
> I realized that PDFBox needs additional jars like jai-imageio-core and 
> jai-imageio-jpeg2000 and I tried including these jars in the class path while 
> executing the pdfbox command but I am still running into the same issue. Is 
> there a specific way to ensure that pdfbox-app jar is able to reference the 
> dependencies it needs when I am executing through the CLI?
> I have all jars present in lib directory and upon executing this command I am 
> getting an error indicating:
> Error: Could not find or load main class org.apache.pdfbox.tools.PDFBox
> Caused by: java.lang.ClassNotFoundException: org.apache.pdfbox.tools.PDFBox
> Command:
> {code:java}
> java -cp "pdfbox-app-2.0.27.jar;lib/*" org.apache.pdfbox.tools.PDFBox 
> PDFToImage og-color-doc.pdf {code}
> Can you please help me understand how to pass the specific jars/libraries 
> that pdfbox-app needs while using the CLI?



--
This message was sent by Atlassian Jira
(v8.20.10#820010)

-
To unsubscribe, e-mail: dev-unsubscr...@pdfbox.apache.org
For additional commands, e-mail: dev-h...@pdfbox.apache.org



[jira] [Commented] (PDFBOX-5791) PDFBox CLI unable to read JPEG Image

2024-03-25 Thread Maruan Sahyoun (Jira)


[ 
https://issues.apache.org/jira/browse/PDFBOX-5791?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17830600#comment-17830600
 ] 

Maruan Sahyoun commented on PDFBOX-5791:


Use the command name directly. E.g.

{code}
java -cp "pdfbox-app-2.0.27.jar;lib/*" org.apache.pdfbox.tools.PDFToImage 
og-color-doc.pdf 
{code}

> PDFBox CLI unable to read JPEG Image
> 
>
> Key: PDFBOX-5791
> URL: https://issues.apache.org/jira/browse/PDFBOX-5791
> Project: PDFBox
>  Issue Type: Bug
>Reporter: Kabir Soneja
>Priority: Major
>
> Hi,
> I am trying to use pdfbox-app jar to execute PDFBox commands through CLI. I 
> am using the CLI to convert PDF to Images. While doing so, for some documents 
> PDFBox app jar is unable to read JPEG2000 Image.
> Commands I executed:
> {code:java}
> java -jar pdfbox-app-2.0.27.jar PDFToImage og-color-doc.pdf {code}
> I realized that PDFBox needs additional jars like jai-imageio-core and 
> jai-imageio-jpeg2000 and I tried including these jars in the class path while 
> executing the pdfbox command but I am still running into the same issue. Is 
> there a specific way to ensure that pdfbox-app jar is able to reference the 
> dependencies it needs when I am executing through the CLI?
> I have all jars present in lib directory and upon executing this command I am 
> getting an error indicating:
> Error: Could not find or load main class org.apache.pdfbox.tools.PDFBox
> Caused by: java.lang.ClassNotFoundException: org.apache.pdfbox.tools.PDFBox
> Command:
> {code:java}
> java -cp "pdfbox-app-2.0.27.jar;lib/*" org.apache.pdfbox.tools.PDFBox 
> PDFToImage og-color-doc.pdf {code}
> Can you please help me understand how to pass the specific jars/libraries 
> that pdfbox-app needs while using the CLI?



--
This message was sent by Atlassian Jira
(v8.20.10#820010)

-
To unsubscribe, e-mail: dev-unsubscr...@pdfbox.apache.org
For additional commands, e-mail: dev-h...@pdfbox.apache.org



[jira] [Commented] (PDFBOX-5791) PDFBox CLI unable to read JPEG Image

2024-03-25 Thread Tilman Hausherr (Jira)


[ 
https://issues.apache.org/jira/browse/PDFBOX-5791?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17830599#comment-17830599
 ] 

Tilman Hausherr commented on PDFBOX-5791:
-

Maybe your download is broken? The class name is correct. Please download 
pdfbox-app-2.0.31.jar here
https://pdfbox.apache.org/download.html

Also, depending on what OS you are using, replace ";" with ":".

> PDFBox CLI unable to read JPEG Image
> 
>
> Key: PDFBOX-5791
> URL: https://issues.apache.org/jira/browse/PDFBOX-5791
> Project: PDFBox
>  Issue Type: Bug
>Reporter: Kabir Soneja
>Priority: Major
>
> Hi,
> I am trying to use pdfbox-app jar to execute PDFBox commands through CLI. I 
> am using the CLI to convert PDF to Images. While doing so, for some documents 
> PDFBox app jar is unable to read JPEG2000 Image.
> Commands I executed:
> {code:java}
> java -jar pdfbox-app-2.0.27.jar PDFToImage og-color-doc.pdf {code}
> I realized that PDFBox needs additional jars like jai-imageio-core and 
> jai-imageio-jpeg2000 and I tried including these jars in the class path while 
> executing the pdfbox command but I am still running into the same issue. Is 
> there a specific way to ensure that pdfbox-app jar is able to reference the 
> dependencies it needs when I am executing through the CLI?
> I have all jars present in lib directory and upon executing this command I am 
> getting an error indicating:
> Error: Could not find or load main class org.apache.pdfbox.tools.PDFBox
> Caused by: java.lang.ClassNotFoundException: org.apache.pdfbox.tools.PDFBox
> Command:
> {code:java}
> java -cp "pdfbox-app-2.0.27.jar;lib/*" org.apache.pdfbox.tools.PDFBox 
> PDFToImage og-color-doc.pdf {code}
> Can you please help me understand how to pass the specific jars/libraries 
> that pdfbox-app needs while using the CLI?



--
This message was sent by Atlassian Jira
(v8.20.10#820010)

-
To unsubscribe, e-mail: dev-unsubscr...@pdfbox.apache.org
For additional commands, e-mail: dev-h...@pdfbox.apache.org



[jira] [Created] (PDFBOX-5791) PDFBox CLI unable to read JPEG Image

2024-03-25 Thread Kabir Soneja (Jira)
Kabir Soneja created PDFBOX-5791:


 Summary: PDFBox CLI unable to read JPEG Image
 Key: PDFBOX-5791
 URL: https://issues.apache.org/jira/browse/PDFBOX-5791
 Project: PDFBox
  Issue Type: Bug
Reporter: Kabir Soneja


Hi,

I am trying to use pdfbox-app jar to execute PDFBox commands through CLI. I am 
using the CLI to convert PDF to Images. While doing so, for some documents 
PDFBox app jar is unable to read JPEG2000 Image.

Commands I executed:
{code:java}
java -jar pdfbox-app-2.0.27.jar PDFToImage og-color-doc.pdf {code}
I realized that PDFBox needs additional jars like jai-imageio-core and 
jai-imageio-jpeg2000 and I tried including these jars in the class path while 
executing the pdfbox command but I am still running into the same issue. Is 
there a specific way to ensure that pdfbox-app jar is able to reference the 
dependencies it needs when I am executing through the CLI?

I have all jars present in lib directory and upon executing this command I am 
getting an error indicating:

Error: Could not find or load main class org.apache.pdfbox.tools.PDFBox
Caused by: java.lang.ClassNotFoundException: org.apache.pdfbox.tools.PDFBox

Command:
{code:java}
java -cp "pdfbox-app-2.0.27.jar;lib/*" org.apache.pdfbox.tools.PDFBox 
PDFToImage og-color-doc.pdf {code}
Can you please help me understand how to pass the specific jars/libraries that 
pdfbox-app needs while using the CLI?



--
This message was sent by Atlassian Jira
(v8.20.10#820010)

-
To unsubscribe, e-mail: dev-unsubscr...@pdfbox.apache.org
For additional commands, e-mail: dev-h...@pdfbox.apache.org



Jenkins build is back to normal : PDFBox » PDFBox-2.0.x #1187

2024-03-25 Thread Apache Jenkins Server
See 



-
To unsubscribe, e-mail: dev-unsubscr...@pdfbox.apache.org
For additional commands, e-mail: dev-h...@pdfbox.apache.org



Jenkins build is back to normal : PDFBox » PDFBox-2.0.x » Apache PDFBox #1187

2024-03-25 Thread Apache Jenkins Server
See 



-
To unsubscribe, e-mail: dev-unsubscr...@pdfbox.apache.org
For additional commands, e-mail: dev-h...@pdfbox.apache.org



Re: [PR] suppress warnings for octals, omit trailing blanks [pdfbox]

2024-03-25 Thread via GitHub


DvonHolten commented on PR #186:
URL: https://github.com/apache/pdfbox/pull/186#issuecomment-2017946640

   octals are fixed


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: dev-unsubscr...@pdfbox.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


-
To unsubscribe, e-mail: dev-unsubscr...@pdfbox.apache.org
For additional commands, e-mail: dev-h...@pdfbox.apache.org



Re: [PR] suppress warnings for octals, omit trailing blanks [pdfbox]

2024-03-25 Thread via GitHub


DvonHolten closed pull request #186: suppress warnings for octals, omit 
trailing blanks
URL: https://github.com/apache/pdfbox/pull/186


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: dev-unsubscr...@pdfbox.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


-
To unsubscribe, e-mail: dev-unsubscr...@pdfbox.apache.org
For additional commands, e-mail: dev-h...@pdfbox.apache.org



Build failed in Jenkins: PDFBox » PDFBox-2.0.x #1186

2024-03-25 Thread Apache Jenkins Server
See 


Changes:


--
[...truncated 177.45 KB...]
at java.base/java.nio.file.Files.newByteChannel(Files.java:422)
at 
java.base/java.nio.file.spi.FileSystemProvider.newInputStream(FileSystemProvider.java:420)
at java.base/java.nio.file.Files.newInputStream(Files.java:156)
at hudson.Util.getDigestOf(Util.java:684)
at 
hudson.maven.reporters.MavenArtifactArchiver.postBuild(MavenArtifactArchiver.java:99)
at 
hudson.maven.Maven3Builder$MavenExecutionListener.sessionEnded(Maven3Builder.java:464)
at 
hudson.maven.Maven3Builder$JenkinsEventSpy.onEvent(Maven3Builder.java:281)
at 
org.apache.maven.eventspy.internal.EventSpyDispatcher.onEvent(EventSpyDispatcher.java:86)
at 
org.apache.maven.eventspy.internal.EventSpyExecutionListener.sessionEnded(EventSpyExecutionListener.java:54)
at 
org.apache.maven.lifecycle.internal.DefaultExecutionEventCatapult.fire(DefaultExecutionEventCatapult.java:57)
at 
org.apache.maven.lifecycle.internal.DefaultExecutionEventCatapult.fire(DefaultExecutionEventCatapult.java:38)
at 
org.apache.maven.lifecycle.internal.LifecycleStarter.execute(LifecycleStarter.java:123)
at org.apache.maven.DefaultMaven.doExecute(DefaultMaven.java:261)
at org.apache.maven.DefaultMaven.doExecute(DefaultMaven.java:173)
at org.apache.maven.DefaultMaven.execute(DefaultMaven.java:101)
at 
org.jvnet.hudson.maven3.launcher.Maven35Launcher.main(Maven35Launcher.java:138)
at 
java.base/jdk.internal.reflect.NativeMethodAccessorImpl.invoke0(Native Method)
at 
java.base/jdk.internal.reflect.NativeMethodAccessorImpl.invoke(NativeMethodAccessorImpl.java:62)
at 
java.base/jdk.internal.reflect.DelegatingMethodAccessorImpl.invoke(DelegatingMethodAccessorImpl.java:43)
at java.base/java.lang.reflect.Method.invoke(Method.java:566)
at 
org.codehaus.plexus.classworlds.launcher.Launcher.launchEnhanced(Launcher.java:283)
at 
org.codehaus.plexus.classworlds.launcher.Launcher.launch(Launcher.java:226)
at jenkins.maven3.agent.Maven35Main.launch(Maven35Main.java:179)
at 
java.base/jdk.internal.reflect.NativeMethodAccessorImpl.invoke0(Native Method)
at 
java.base/jdk.internal.reflect.NativeMethodAccessorImpl.invoke(NativeMethodAccessorImpl.java:62)
at 
java.base/jdk.internal.reflect.DelegatingMethodAccessorImpl.invoke(DelegatingMethodAccessorImpl.java:43)
at java.base/java.lang.reflect.Method.invoke(Method.java:566)
at hudson.maven.Maven3Builder.call(Maven3Builder.java:140)
at hudson.maven.Maven3Builder.call(Maven3Builder.java:71)
at hudson.remoting.UserRequest.perform(UserRequest.java:211)
at hudson.remoting.UserRequest.perform(UserRequest.java:54)
at hudson.remoting.Request$2.run(Request.java:377)
at 
hudson.remoting.InterceptingExecutorService.lambda$wrap$0(InterceptingExecutorService.java:78)
at java.base/java.util.concurrent.FutureTask.run(FutureTask.java:264)
at 
java.base/java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1128)
at 
java.base/java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:628)
at java.base/java.lang.Thread.run(Thread.java:829)
java.nio.file.NoSuchFileException: 

at 
java.base/sun.nio.fs.UnixException.translateToIOException(UnixException.java:92)
at 
java.base/sun.nio.fs.UnixException.rethrowAsIOException(UnixException.java:111)
at 
java.base/sun.nio.fs.UnixException.rethrowAsIOException(UnixException.java:116)
at 
java.base/sun.nio.fs.UnixFileSystemProvider.newByteChannel(UnixFileSystemProvider.java:219)
at java.base/java.nio.file.Files.newByteChannel(Files.java:371)
at java.base/java.nio.file.Files.newByteChannel(Files.java:422)
at 
java.base/java.nio.file.spi.FileSystemProvider.newInputStream(FileSystemProvider.java:420)
at java.base/java.nio.file.Files.newInputStream(Files.java:156)
at hudson.Util.getDigestOf(Util.java:684)
at 
hudson.maven.reporters.MavenArtifactArchiver.postBuild(MavenArtifactArchiver.java:99)
at 
hudson.maven.Maven3Builder$MavenExecutionListener.sessionEnded(Maven3Builder.java:464)
at 
hudson.maven.Maven3Builder$JenkinsEventSpy.onEvent(Maven3Builder.java:281)
at 
org.apache.maven.eventspy.internal.EventSpyDispatcher.onEvent(EventSpyDispatcher.java:86)
at 
org.apache.maven.eventspy.internal.EventSpyExecutionListener.sessionEnded(EventSpyExecutionListener.java:54)
at 
org.apache.maven.lifecycle.internal.DefaultExecutionEventCatapult.fire(DefaultExecutionEventCatapult.java:57)
at 

Build failed in Jenkins: PDFBox » PDFBox-2.0.x » Apache PDFBox #1186

2024-03-25 Thread Apache Jenkins Server
See 


Changes:


--
[...truncated 56.76 KB...]
java.io.FileNotFoundException: target/test-output/crypto/PDFBOX-4453.pdf (No 
such file or directory)
at 
org.apache.pdfbox.encryption.TestSymmetricKeyEncryption.testPDFBox4453(TestSymmetricKeyEncryption.java:294)

[ERROR] testPDFBox5639(org.apache.pdfbox.encryption.TestSymmetricKeyEncryption) 
 Time elapsed: 0.001 s  <<< ERROR!
java.io.FileNotFoundException: target/pdfs/PDFBOX-5639.pdf (No such file or 
directory)
at 
org.apache.pdfbox.encryption.TestSymmetricKeyEncryption.testPDFBox5639(TestSymmetricKeyEncryption.java:316)

[ERROR] 
testPermissions(org.apache.pdfbox.encryption.TestSymmetricKeyEncryption)  Time 
elapsed: 0.001 s  <<< ERROR!
java.lang.NullPointerException
at 
org.apache.pdfbox.encryption.TestSymmetricKeyEncryption.getFileResourceAsByteArray(TestSymmetricKeyEncryption.java:478)
at 
org.apache.pdfbox.encryption.TestSymmetricKeyEncryption.testPermissions(TestSymmetricKeyEncryption.java:121)

[ERROR] testProtection(org.apache.pdfbox.encryption.TestSymmetricKeyEncryption) 
 Time elapsed: 0.001 s  <<< ERROR!
java.lang.NullPointerException
at 
org.apache.pdfbox.encryption.TestSymmetricKeyEncryption.getFileResourceAsByteArray(TestSymmetricKeyEncryption.java:478)
at 
org.apache.pdfbox.encryption.TestSymmetricKeyEncryption.testProtection(TestSymmetricKeyEncryption.java:199)

[INFO] Running org.apache.pdfbox.encryption.TestPublicKeyEncryption
[ERROR] Tests run: 1, Failures: 0, Errors: 1, Skipped: 0, Time elapsed: 0 s <<< 
FAILURE! - in org.apache.pdfbox.encryption.TestPublicKeyEncryption
[ERROR] 
initializationError(org.apache.pdfbox.encryption.TestPublicKeyEncryption)  Time 
elapsed: 0 s  <<< ERROR!
java.lang.NoClassDefFoundError: 
org/apache/pdfbox/pdmodel/encryption/PublicKeyRecipient
Caused by: java.lang.ClassNotFoundException: 
org.apache.pdfbox.pdmodel.encryption.PublicKeyRecipient

[INFO] Running org.apache.pdfbox.pdfparser.RandomAccessSourceTest
[INFO] Tests run: 5, Failures: 0, Errors: 0, Skipped: 0, Time elapsed: 0 s - in 
org.apache.pdfbox.pdfparser.RandomAccessSourceTest
[INFO] Running org.apache.pdfbox.pdfparser.EndstreamOutputStreamTest
[ERROR] Tests run: 2, Failures: 0, Errors: 1, Skipped: 0, Time elapsed: 0.001 s 
<<< FAILURE! - in org.apache.pdfbox.pdfparser.EndstreamOutputStreamTest
[ERROR] 
testPDFBox2079EmbeddedFile(org.apache.pdfbox.pdfparser.EndstreamOutputStreamTest)
  Time elapsed: 0 s  <<< ERROR!
java.io.FileNotFoundException: 
src/test/resources/org/apache/pdfbox/pdfparser/embedded_zip.pdf (No such file 
or directory)
at 
org.apache.pdfbox.pdfparser.EndstreamOutputStreamTest.testPDFBox2079EmbeddedFile(EndstreamOutputStreamTest.java:119)

[INFO] Running org.apache.pdfbox.pdfparser.TestPDFParser
[ERROR] Tests run: 21, Failures: 0, Errors: 21, Skipped: 0, Time elapsed: 0.653 
s <<< FAILURE! - in org.apache.pdfbox.pdfparser.TestPDFParser
[ERROR] testParseGenko(org.apache.pdfbox.pdfparser.TestPDFParser)  Time 
elapsed: 0.03 s  <<< ERROR!
java.io.FileNotFoundException: target/pdfs/genko_oc_shiryo1.pdf (No such file 
or directory)
at 
org.apache.pdfbox.pdfparser.TestPDFParser.testParseGenko(TestPDFParser.java:310)

[ERROR] 
testPDFParserInputStreamScratchFile(org.apache.pdfbox.pdfparser.TestPDFParser)  
Time elapsed: 0.053 s  <<< ERROR!
java.io.FileNotFoundException: src/test/resources/input/yaddatest.pdf (No such 
file or directory)
at 
org.apache.pdfbox.pdfparser.TestPDFParser.testPDFParserInputStreamScratchFile(TestPDFParser.java:113)

[ERROR] testPDFParserFile(org.apache.pdfbox.pdfparser.TestPDFParser)  Time 
elapsed: 0.047 s  <<< ERROR!
java.io.FileNotFoundException: src/test/resources/input/yaddatest.pdf (No such 
file or directory)
at 
org.apache.pdfbox.pdfparser.TestPDFParser.testPDFParserFile(TestPDFParser.java:95)

[ERROR] testPDFBox3208(org.apache.pdfbox.pdfparser.TestPDFParser)  Time 
elapsed: 0.05 s  <<< ERROR!
java.io.FileNotFoundException: 
target/pdfs/PDFBOX-3208-L33MUTT2SVCWGCS6UIYL5TH3PNPXHIS6.pdf (No such file or 
directory)
at 
org.apache.pdfbox.pdfparser.TestPDFParser.testPDFBox3208(TestPDFParser.java:133)

[ERROR] testPDFBox3783(org.apache.pdfbox.pdfparser.TestPDFParser)  Time 
elapsed: 0.032 s  <<< ERROR!
java.io.FileNotFoundException: 
target/pdfs/PDFBOX-3783-72GLBIGUC6LB46ELZFBARRJTLN4RBSQM.pdf (No such file or 
directory)
at 
org.apache.pdfbox.pdfparser.TestPDFParser.testPDFBox3783(TestPDFParser.java:178)

[ERROR] testPDFBox3785(org.apache.pdfbox.pdfparser.TestPDFParser)  Time 
elapsed: 0.037 s  <<< ERROR!
java.io.FileNotFoundException: target/pdfs/PDFBOX-3785-202097.pdf (No such file 
or directory)
at 
org.apache.pdfbox.pdfparser.TestPDFParser.testPDFBox3785(TestPDFParser.java:190)

[ERROR] testPDFBox3940(org.apache.pdfbox.pdfparser.TestPDFParser)  Time 
elapsed: 

Build failed in Jenkins: PDFBox » PDFBox-2.0.x #1185

2024-03-25 Thread Apache Jenkins Server
See 


Changes:


--
Started by user Tilman Hausherr
Running as SYSTEM
Managed by ci-builds-he-fi@2300674
[EnvInject] - Loading node environment variables.
Building remotely on EC2 (ec2-Jenkins Ephemeral Node User-CLI) - Jenkins 
Ephemeral Node - ubuntu label (i-0988cf10251d119fc) (ubuntu ephemeral) in 
workspace 
Cleaning up 
Deleting 
Updating https://svn.apache.org/repos/asf/pdfbox/branches/2.0 at revision 
'2024-03-25T12:42:11.516 +' --quiet
At revision 1916533

No changes for https://svn.apache.org/repos/asf/pdfbox/branches/2.0 since the 
previous build
Parsing POMs
Modules changed, recalculating dependency graph
Established TCP socket on 41151
maven35-agent.jar already up to date
maven35-interceptor.jar already up to date
maven3-interceptor-commons.jar already up to date
[PDFBox-2.0.x] $ /home/jenkins/tools/java/latest11/bin/java -cp 
/home/jenkins/maven35-agent.jar:/home/jenkins/tools/maven/latest/boot/plexus-classworlds-2.7.0.jar:/home/jenkins/tools/maven/latest/conf/logging
 jenkins.maven3.agent.Maven35Main /home/jenkins/tools/maven/latest 
/tmp/remoting.jar /home/jenkins/maven35-interceptor.jar 
/home/jenkins/maven3-interceptor-commons.jar 41151
<===[JENKINS REMOTING CAPACITY]===>   channel started
Executing Maven:  -B -f 
 clean 
deploy -Ppedantic -Dmaven.source.skip=true
[INFO] Scanning for projects...
[HUDSON] Collecting dependencies info
[HUDSON] Collecting dependencies info
[HUDSON] Collecting dependencies info
[HUDSON] Collecting dependencies info
[HUDSON] Collecting dependencies info
[HUDSON] Collecting dependencies info
[HUDSON] Collecting dependencies info
[HUDSON] Collecting dependencies info
[HUDSON] Collecting dependencies info
[HUDSON] Collecting dependencies info
[HUDSON] Collecting dependencies info[INFO] 


[INFO] Reactor Build Order:
[INFO] 
[INFO] Apache PDFBox parent   [pom]
[INFO] Apache FontBox  [bundle]
[INFO] Apache XmpBox   [bundle]
[INFO] Apache PDFBox   [bundle]
[INFO] Apache Preflight[bundle]
[INFO] Apache Preflight application[bundle]
[INFO] Apache PDFBox Debugger [jar]
[INFO] Apache PDFBox tools[jar]
[INFO] Apache PDFBox application   [bundle]
[INFO] Apache PDFBox Debugger application  [bundle]
[INFO] Apache PDFBox examples [jar]
Managed by ci-builds-he-fi@2300674
[INFO] 
[INFO] --< org.apache.pdfbox:pdfbox-parent >---
[INFO] Building Apache PDFBox parent 2.0.32-SNAPSHOT [1/11]
[INFO]   from pom.xml
[INFO] [ pom ]-
[INFO] 
[INFO] --- clean:3.0.0:clean (default-clean) @ pdfbox-parent ---
[INFO] 
[INFO] --- enforcer:1.4.1:enforce (default) @ pdfbox-parent ---
[INFO] 
[INFO] --- enforcer:1.4.1:enforce (enforce-maven-version) @ pdfbox-parent ---
[INFO] 
[INFO] --- remote-resources:1.5:process (process-resource-bundles) @ 
pdfbox-parent ---
[INFO] 
[INFO] --- animal-sniffer:1.17:check (check-java-version) @ pdfbox-parent ---
[INFO] Checking unresolved references to org.codehaus.mojo.signature:java16:1.0
[INFO] 
[INFO] --- site:3.7:attach-descriptor (attach-descriptor) @ pdfbox-parent ---
[INFO] No site descriptor found: nothing to attach.
[INFO] 
[INFO] --- source:3.0.1:jar-no-fork (attach-sources) @ pdfbox-parent ---
[INFO] Skipping source per configuration.
[INFO] 
[INFO] --- apache-rat:0.16.1:check (default) @ pdfbox-parent ---
[INFO] Rat check: Summary over all files. Unapproved: 0, unknown: 0, generated: 
0, approved: 5 licenses.
[INFO] 
[INFO] --- dependency-check:9.0.10:check (default) @ pdfbox-parent ---
[INFO] Checking for updates
[INFO] Skipping the NVD API Update as it was completed within the last 240 
minutes
[INFO] Skipping Known Exploited Vulnerabilities update check since last check 
was within 24 hours.
[INFO] Check for updates complete (475 ms)
[INFO] 

Dependency-Check is an open source tool performing a best effort analysis of 
3rd party dependencies; false positives and false negatives may exist in the 
analysis performed by the tool. Use of the tool and the reporting provided 
constitutes acceptance for use in an AS IS 

Build failed in Jenkins: PDFBox » PDFBox-2.0.x » Apache PDFBox parent #1185

2024-03-25 Thread Apache Jenkins Server
See 


Changes:


--
Established TCP socket on 41151
maven35-agent.jar already up to date
maven35-interceptor.jar already up to date
maven3-interceptor-commons.jar already up to date
<===[JENKINS REMOTING CAPACITY]===>   channel started
Executing Maven:  -B -f 

 clean deploy -Ppedantic -Dmaven.source.skip=true
[INFO] Scanning for projects...
[HUDSON] Collecting dependencies info
[HUDSON] Collecting dependencies info
[HUDSON] Collecting dependencies info
[HUDSON] Collecting dependencies info
[HUDSON] Collecting dependencies info
[HUDSON] Collecting dependencies info
[HUDSON] Collecting dependencies info
[HUDSON] Collecting dependencies info
[HUDSON] Collecting dependencies info
[HUDSON] Collecting dependencies info
[HUDSON] Collecting dependencies info[INFO] 


[INFO] Reactor Build Order:
[INFO] 
[INFO] Apache PDFBox parent   [pom]
[INFO] Apache FontBox  [bundle]
[INFO] Apache XmpBox   [bundle]
[INFO] Apache PDFBox   [bundle]
[INFO] Apache Preflight[bundle]
[INFO] Apache Preflight application[bundle]
[INFO] Apache PDFBox Debugger [jar]
[INFO] Apache PDFBox tools[jar]
[INFO] Apache PDFBox application   [bundle]
[INFO] Apache PDFBox Debugger application  [bundle]
[INFO] Apache PDFBox examples [jar]
Managed by ci-builds-he-fi@2300674
[INFO] 
[INFO] --< org.apache.pdfbox:pdfbox-parent >---
[INFO] Building Apache PDFBox parent 2.0.32-SNAPSHOT [1/11]
[INFO]   from pom.xml
[INFO] [ pom ]-
[INFO] 
[INFO] --- clean:3.0.0:clean (default-clean) @ pdfbox-parent ---
[INFO] 
[INFO] --- enforcer:1.4.1:enforce (default) @ pdfbox-parent ---
[INFO] 
[INFO] --- enforcer:1.4.1:enforce (enforce-maven-version) @ pdfbox-parent ---
[INFO] 
[INFO] --- remote-resources:1.5:process (process-resource-bundles) @ 
pdfbox-parent ---
[INFO] 
[INFO] --- animal-sniffer:1.17:check (check-java-version) @ pdfbox-parent ---
[INFO] Checking unresolved references to org.codehaus.mojo.signature:java16:1.0
[INFO] 
[INFO] --- site:3.7:attach-descriptor (attach-descriptor) @ pdfbox-parent ---
[INFO] No site descriptor found: nothing to attach.
[INFO] 
[INFO] --- source:3.0.1:jar-no-fork (attach-sources) @ pdfbox-parent ---
[INFO] Skipping source per configuration.
[INFO] 
[INFO] --- apache-rat:0.16.1:check (default) @ pdfbox-parent ---
[INFO] Rat check: Summary over all files. Unapproved: 0, unknown: 0, generated: 
0, approved: 5 licenses.
[INFO] 
[INFO] --- dependency-check:9.0.10:check (default) @ pdfbox-parent ---
[INFO] Checking for updates
[INFO] Skipping the NVD API Update as it was completed within the last 240 
minutes
[INFO] Skipping Known Exploited Vulnerabilities update check since last check 
was within 24 hours.
[INFO] Check for updates complete (475 ms)
[INFO] 

Dependency-Check is an open source tool performing a best effort analysis of 
3rd party dependencies; false positives and false negatives may exist in the 
analysis performed by the tool. Use of the tool and the reporting provided 
constitutes acceptance for use in an AS IS condition, and there are NO 
warranties, implied or otherwise, with regard to the analysis or its use. Any 
use of the tool and the reporting provided is at the user's risk. In no event 
shall the copyright holder or OWASP be held liable for any damages whatsoever 
arising out of or in connection with the use of this tool, the analysis 
performed, or the resulting report.


   About ODC: 
https://jeremylong.github.io/DependencyCheck/general/internals.html
   False Positives: 
https://jeremylong.github.io/DependencyCheck/general/suppression.html

 Sponsor: https://github.com/sponsors/jeremylong


[INFO] Analysis Started
[INFO] Finished File Name Analyzer (0 seconds)
[INFO] Finished Dependency Merging Analyzer (0 seconds)
[INFO] Finished Hint Analyzer (0 seconds)
[INFO] Finished Version Filter Analyzer (0 seconds)
[INFO] Created CPE Index (2 seconds)
[INFO] Finished CPE Analyzer (3 seconds)
[INFO] Finished False Positive Analyzer (0 seconds)
[INFO] Finished NVD CVE Analyzer (0 seconds)
[INFO] Finished Sonatype OSS Index Analyzer (0 seconds)
[INFO] Finished Vulnerability Suppression Analyzer (0 seconds)
[INFO] 

Build failed in Jenkins: PDFBox » PDFBox-2.0.x #1184

2024-03-25 Thread Apache Jenkins Server
See 


Changes:

[Tilman Hausherr] PDFBOX-5660: remove NVD cache


--
[...truncated 153.24 KB...]
[INFO] Downloaded 30,000/242,587 (12%)
[INFO] Downloaded 40,000/242,587 (16%)
[INFO] Downloaded 50,000/242,587 (21%)
[INFO] Downloaded 60,000/242,587 (25%)
[INFO] Downloaded 70,000/242,587 (29%)
[INFO] Downloaded 80,000/242,587 (33%)
[INFO] Downloaded 90,000/242,587 (37%)
[INFO] Downloaded 100,000/242,587 (41%)
[INFO] Downloaded 110,000/242,587 (45%)
[INFO] Downloaded 120,000/242,587 (49%)
[INFO] Downloaded 130,000/242,587 (54%)
[INFO] Downloaded 140,000/242,587 (58%)
[INFO] Downloaded 150,000/242,587 (62%)
[INFO] Downloaded 160,000/242,587 (66%)
[INFO] Downloaded 170,000/242,587 (70%)
[INFO] Downloaded 180,000/242,587 (74%)
[INFO] Downloaded 190,000/242,587 (78%)
[INFO] Downloaded 200,000/242,587 (82%)
[INFO] Downloaded 210,000/242,587 (87%)
[INFO] Downloaded 220,000/242,587 (91%)
[INFO] Downloaded 230,000/242,587 (95%)
[INFO] Downloaded 240,000/242,587 (99%)
[INFO] Downloaded 242,587/242,587 (100%)
[INFO] Completed processing batch 1/122 (1%) in 2,246ms
[INFO] Completed processing batch 2/122 (2%) in 2,142ms
[INFO] Completed processing batch 3/122 (2%) in 1,794ms
[INFO] Completed processing batch 4/122 (3%) in 1,641ms
[INFO] Completed processing batch 5/122 (4%) in 2,132ms
[INFO] Completed processing batch 6/122 (5%) in 1,754ms
[INFO] Completed processing batch 7/122 (6%) in 1,750ms
[INFO] Completed processing batch 8/122 (7%) in 1,585ms
[INFO] Completed processing batch 9/122 (7%) in 1,480ms
[INFO] Completed processing batch 10/122 (8%) in 1,700ms
[INFO] Completed processing batch 11/122 (9%) in 1,524ms
[INFO] Completed processing batch 12/122 (10%) in 1,373ms
[INFO] Completed processing batch 13/122 (11%) in 1,324ms
[INFO] Completed processing batch 14/122 (11%) in 1,308ms
[INFO] Completed processing batch 15/122 (12%) in 1,327ms
[INFO] Completed processing batch 16/122 (13%) in 1,638ms
[INFO] Completed processing batch 17/122 (14%) in 1,988ms
[INFO] Completed processing batch 18/122 (15%) in 1,867ms
[INFO] Completed processing batch 19/122 (16%) in 1,944ms
[INFO] Completed processing batch 20/122 (16%) in 2,071ms
[INFO] Completed processing batch 21/122 (17%) in 2,449ms
[INFO] Completed processing batch 22/122 (18%) in 2,692ms
[INFO] Completed processing batch 23/122 (19%) in 3,157ms
[INFO] Completed processing batch 24/122 (20%) in 3,204ms
[INFO] Completed processing batch 25/122 (20%) in 2,304ms
[INFO] Completed processing batch 26/122 (21%) in 3,432ms
[INFO] Completed processing batch 27/122 (22%) in 2,955ms
[INFO] Completed processing batch 28/122 (23%) in 2,893ms
[INFO] Completed processing batch 29/122 (24%) in 4,038ms
[INFO] Completed processing batch 30/122 (25%) in 2,307ms
[INFO] Completed processing batch 31/122 (25%) in 2,556ms
[INFO] Completed processing batch 32/122 (26%) in 2,055ms
[INFO] Completed processing batch 33/122 (27%) in 1,445ms
[INFO] Completed processing batch 34/122 (28%) in 1,841ms
[INFO] Completed processing batch 35/122 (29%) in 1,615ms
[INFO] Completed processing batch 36/122 (30%) in 1,309ms
[INFO] Completed processing batch 37/122 (30%) in 1,069ms
[INFO] Completed processing batch 38/122 (31%) in 1,733ms
[INFO] Completed processing batch 39/122 (32%) in 1,370ms
[INFO] Completed processing batch 40/122 (33%) in 1,419ms
[INFO] Completed processing batch 41/122 (34%) in 1,462ms
[INFO] Completed processing batch 42/122 (34%) in 953ms
[INFO] Completed processing batch 43/122 (35%) in 1,171ms
[INFO] Completed processing batch 44/122 (36%) in 300ms
[INFO] Completed processing batch 45/122 (37%) in 380ms
[INFO] Completed processing batch 46/122 (38%) in 960ms
[INFO] Completed processing batch 47/122 (39%) in 1,229ms
[INFO] Completed processing batch 48/122 (39%) in 1,125ms
[INFO] Completed processing batch 49/122 (40%) in 890ms
[INFO] Completed processing batch 50/122 (41%) in 934ms
[INFO] Completed processing batch 51/122 (42%) in 877ms
[INFO] Completed processing batch 52/122 (43%) in 607ms
[INFO] Completed processing batch 53/122 (43%) in 760ms
[INFO] Completed processing batch 54/122 (44%) in 1,050ms
[INFO] Completed processing batch 55/122 (45%) in 1,305ms
[INFO] Completed processing batch 56/122 (46%) in 968ms
[INFO] Completed processing batch 57/122 (47%) in 940ms
[INFO] Completed processing batch 58/122 (48%) in 1,118ms
[INFO] Completed processing batch 59/122 (48%) in 938ms
[INFO] Completed processing batch 60/122 (49%) in 677ms
[INFO] Completed processing batch 61/122 (50%) in 942ms
[INFO] Completed processing batch 62/122 (51%) in 1,043ms
[INFO] Completed processing batch 63/122 (52%) in 698ms
[INFO] Completed processing batch 64/122 (52%) in 917ms
[INFO] Completed processing batch 65/122 (53%) in 880ms
[INFO] Completed processing batch 66/122 (54%) in 1,035ms
[INFO] Completed processing batch 67/122 (55%) in 750ms
[INFO] Completed 

Build failed in Jenkins: PDFBox » PDFBox-2.0.x » Apache PDFBox parent #1184

2024-03-25 Thread Apache Jenkins Server
See 


Changes:


--
[...truncated 144.27 KB...]
[INFO] Downloaded from central: 
https://repo.maven.apache.org/maven2/org/apache/maven/maven-api-meta/4.0.0-alpha-7/maven-api-meta-4.0.0-alpha-7.jar
 (12 kB at 27 kB/s)
[INFO] Downloading from central: 
https://repo.maven.apache.org/maven2/org/apache/maven/shared/maven-artifact-transfer/0.13.1/maven-artifact-transfer-0.13.1.jar
[INFO] Downloaded from central: 
https://repo.maven.apache.org/maven2/org/eclipse/aether/aether-util/1.0.0.v20140518/aether-util-1.0.0.v20140518.jar
 (146 kB at 343 kB/s)
[INFO] Downloading from central: 
https://repo.maven.apache.org/maven2/org/apache/maven/maven-core/3.1.0/maven-core-3.1.0.jar
[INFO] Downloaded from central: 
https://repo.maven.apache.org/maven2/com/google/guava/guava/32.1.3-jre/guava-32.1.3-jre.jar
 (3.0 MB at 7.1 MB/s)
[INFO] Downloading from central: 
https://repo.maven.apache.org/maven2/org/apache/maven/maven-settings/3.1.0/maven-settings-3.1.0.jar
[INFO] Downloaded from central: 
https://repo.maven.apache.org/maven2/org/eclipse/aether/aether-api/1.0.0.v20140518/aether-api-1.0.0.v20140518.jar
 (136 kB at 315 kB/s)
[INFO] Downloading from central: 
https://repo.maven.apache.org/maven2/org/apache/maven/maven-settings-builder/3.1.0/maven-settings-builder-3.1.0.jar
[INFO] Downloaded from central: 
https://repo.maven.apache.org/maven2/org/apache/maven/maven-settings/3.1.0/maven-settings-3.1.0.jar
 (47 kB at 107 kB/s)
[INFO] Downloading from central: 
https://repo.maven.apache.org/maven2/org/apache/maven/maven-repository-metadata/3.1.0/maven-repository-metadata-3.1.0.jar
[INFO] Downloaded from central: 
https://repo.maven.apache.org/maven2/org/apache/maven/maven-settings-builder/3.1.0/maven-settings-builder-3.1.0.jar
 (41 kB at 95 kB/s)
[INFO] Downloading from central: 
https://repo.maven.apache.org/maven2/org/apache/maven/maven-plugin-api/3.1.0/maven-plugin-api-3.1.0.jar
[INFO] Downloaded from central: 
https://repo.maven.apache.org/maven2/org/apache/maven/maven-repository-metadata/3.1.0/maven-repository-metadata-3.1.0.jar
 (30 kB at 68 kB/s)
[INFO] Downloading from central: 
https://repo.maven.apache.org/maven2/org/apache/maven/maven-model-builder/3.1.0/maven-model-builder-3.1.0.jar
[INFO] Downloaded from central: 
https://repo.maven.apache.org/maven2/org/apache/maven/maven-plugin-api/3.1.0/maven-plugin-api-3.1.0.jar
 (50 kB at 113 kB/s)
[INFO] Downloading from central: 
https://repo.maven.apache.org/maven2/org/apache/maven/maven-aether-provider/3.1.0/maven-aether-provider-3.1.0.jar
[INFO] Downloaded from central: 
https://repo.maven.apache.org/maven2/org/apache/maven/shared/maven-artifact-transfer/0.13.1/maven-artifact-transfer-0.13.1.jar
 (159 kB at 358 kB/s)
[INFO] Downloading from central: 
https://repo.maven.apache.org/maven2/org/eclipse/aether/aether-spi/0.9.0.M2/aether-spi-0.9.0.M2.jar
[INFO] Downloaded from central: 
https://repo.maven.apache.org/maven2/org/eclipse/aether/aether-spi/0.9.0.M2/aether-spi-0.9.0.M2.jar
 (18 kB at 40 kB/s)
[INFO] Downloading from central: 
https://repo.maven.apache.org/maven2/org/eclipse/aether/aether-impl/0.9.0.M2/aether-impl-0.9.0.M2.jar
[INFO] Downloaded from central: 
https://repo.maven.apache.org/maven2/org/apache/maven/maven-model-builder/3.1.0/maven-model-builder-3.1.0.jar
 (159 kB at 353 kB/s)
[INFO] Downloading from central: 
https://repo.maven.apache.org/maven2/org/eclipse/sisu/org.eclipse.sisu.plexus/0.0.0.M2a/org.eclipse.sisu.plexus-0.0.0.M2a.jar
[INFO] Downloaded from central: 
https://repo.maven.apache.org/maven2/org/eclipse/aether/aether-impl/0.9.0.M2/aether-impl-0.9.0.M2.jar
 (145 kB at 318 kB/s)
[INFO] Downloading from central: 
https://repo.maven.apache.org/maven2/org/sonatype/sisu/sisu-guice/3.1.0/sisu-guice-3.1.0-no_aop.jar
[INFO] Downloaded from central: 
https://repo.maven.apache.org/maven2/org/apache/maven/maven-aether-provider/3.1.0/maven-aether-provider-3.1.0.jar
 (60 kB at 131 kB/s)
[INFO] Downloading from central: 
https://repo.maven.apache.org/maven2/aopalliance/aopalliance/1.0/aopalliance-1.0.jar
[INFO] Downloaded from central: 
https://repo.maven.apache.org/maven2/aopalliance/aopalliance/1.0/aopalliance-1.0.jar
 (4.5 kB at 9.6 kB/s)
[INFO] Downloading from central: 
https://repo.maven.apache.org/maven2/org/eclipse/sisu/org.eclipse.sisu.inject/0.0.0.M2a/org.eclipse.sisu.inject-0.0.0.M2a.jar
[INFO] Downloaded from central: 
https://repo.maven.apache.org/maven2/org/eclipse/sisu/org.eclipse.sisu.plexus/0.0.0.M2a/org.eclipse.sisu.plexus-0.0.0.M2a.jar
 (202 kB at 431 kB/s)
[INFO] Downloading from central: 
https://repo.maven.apache.org/maven2/asm/asm/3.3.1/asm-3.3.1.jar
[INFO] Downloaded from central: 
https://repo.maven.apache.org/maven2/org/sonatype/sisu/sisu-guice/3.1.0/sisu-guice-3.1.0-no_aop.jar
 (357 kB at 760 kB/s)
[INFO] Downloading from central: 

[jira] [Commented] (PDFBOX-5660) Improve code quality (5)

2024-03-25 Thread ASF subversion and git services (Jira)


[ 
https://issues.apache.org/jira/browse/PDFBOX-5660?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17830449#comment-17830449
 ] 

ASF subversion and git services commented on PDFBOX-5660:
-

Commit 1916532 from Tilman Hausherr in branch 'pdfbox/branches/2.0'
[ https://svn.apache.org/r1916532 ]

PDFBOX-5660: remove NVD cache

> Improve code quality (5)
> 
>
> Key: PDFBOX-5660
> URL: https://issues.apache.org/jira/browse/PDFBOX-5660
> Project: PDFBox
>  Issue Type: Improvement
>Reporter: Tilman Hausherr
>Priority: Minor
> Attachments: AnnotationSample.Standard.pdf, 
> DRY_refactoring_Typ2CharStringParser.patch, 
> Removed_the_readFully_method_in_the_PfbParser_class_and_replaced__with_calling_readAllByte.patch,
>  
> Simplify_list_and_map_operations,_use_known_size_when_creating_StringBuilder.patch,
>  Simplify_string_conversion_in_PDFHighlighter.patch, 
> Update_string_handling_and_regex_in_several_classes.patch, 
> avoid_multiple_unboxing.patch, code_cleanup.patch, 
> do_not_create_temporary_File_instance.patch, 
> extract_common_code,_move_toUpperCase()_out_of_loop.patch, 
> fix_HTML_error_in_Javadoc.patch, fix_javadoc_problems.patch, 
> introduce_COSArray_of(float___)_to_make_the_code_more_concise_and_avoid_creating_and_copyi.patch,
>  introduce_StringUtil_class_for_reusable_functionality.patch, 
> introduce_constants_COSFLOAT_ZERO_and_COSFloat_ONE_to_avoid_creating_unnecessary_instances.patch,
>  make_inner_class_static.patch, refactor_isEndOfName.patch, 
> remove_code_duplication_in_Type2CharStringParser.patch, 
> remove_obsolete_class_NullOutputStream.patch, 
> remove_unnecessary_calls_to_toString()_String_valueOf().patch, 
> replace_System_getProperty()_calls.patch, screenshot-1.png, 
> simplify_hashCode()_and_equals(),_test_name_first_because_Map_equals()_is_expensive.patch,
>  simplify_stream_operations.patch, use_Map_ofEntries().patch, 
> use_Math_min()_to_make_code_more_readable.patch, use_Objects_equals().patch, 
> use_String_isEmpty()_Collection_isEmpty()_instead_of_checking_length_size.patch,
>  use_String_join().patch, use_switch_for_readability.patch, 
> use_try-with-resources_(since_Java_9_the_variable_declaration_in_the_try_is_not_necessary_.patch
>
>
> This is a longterm issue for the task to improve code quality, by using the 
> SonarQube report, hints in different IDEs, the FindBugs tool and other code 
> quality tools.
> This is a follow-up of PDFBOX-4892, which was getting too long.



--
This message was sent by Atlassian Jira
(v8.20.10#820010)

-
To unsubscribe, e-mail: dev-unsubscr...@pdfbox.apache.org
For additional commands, e-mail: dev-h...@pdfbox.apache.org



[jira] [Commented] (PDFBOX-5660) Improve code quality (5)

2024-03-25 Thread ASF subversion and git services (Jira)


[ 
https://issues.apache.org/jira/browse/PDFBOX-5660?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17830432#comment-17830432
 ] 

ASF subversion and git services commented on PDFBOX-5660:
-

Commit 1916531 from Tilman Hausherr in branch 'pdfbox/branches/3.0'
[ https://svn.apache.org/r1916531 ]

PDFBOX-5660: update commons logging

> Improve code quality (5)
> 
>
> Key: PDFBOX-5660
> URL: https://issues.apache.org/jira/browse/PDFBOX-5660
> Project: PDFBox
>  Issue Type: Improvement
>Reporter: Tilman Hausherr
>Priority: Minor
> Attachments: AnnotationSample.Standard.pdf, 
> DRY_refactoring_Typ2CharStringParser.patch, 
> Removed_the_readFully_method_in_the_PfbParser_class_and_replaced__with_calling_readAllByte.patch,
>  
> Simplify_list_and_map_operations,_use_known_size_when_creating_StringBuilder.patch,
>  Simplify_string_conversion_in_PDFHighlighter.patch, 
> Update_string_handling_and_regex_in_several_classes.patch, 
> avoid_multiple_unboxing.patch, code_cleanup.patch, 
> do_not_create_temporary_File_instance.patch, 
> extract_common_code,_move_toUpperCase()_out_of_loop.patch, 
> fix_HTML_error_in_Javadoc.patch, fix_javadoc_problems.patch, 
> introduce_COSArray_of(float___)_to_make_the_code_more_concise_and_avoid_creating_and_copyi.patch,
>  introduce_StringUtil_class_for_reusable_functionality.patch, 
> introduce_constants_COSFLOAT_ZERO_and_COSFloat_ONE_to_avoid_creating_unnecessary_instances.patch,
>  make_inner_class_static.patch, refactor_isEndOfName.patch, 
> remove_code_duplication_in_Type2CharStringParser.patch, 
> remove_obsolete_class_NullOutputStream.patch, 
> remove_unnecessary_calls_to_toString()_String_valueOf().patch, 
> replace_System_getProperty()_calls.patch, screenshot-1.png, 
> simplify_hashCode()_and_equals(),_test_name_first_because_Map_equals()_is_expensive.patch,
>  simplify_stream_operations.patch, use_Map_ofEntries().patch, 
> use_Math_min()_to_make_code_more_readable.patch, use_Objects_equals().patch, 
> use_String_isEmpty()_Collection_isEmpty()_instead_of_checking_length_size.patch,
>  use_String_join().patch, use_switch_for_readability.patch, 
> use_try-with-resources_(since_Java_9_the_variable_declaration_in_the_try_is_not_necessary_.patch
>
>
> This is a longterm issue for the task to improve code quality, by using the 
> SonarQube report, hints in different IDEs, the FindBugs tool and other code 
> quality tools.
> This is a follow-up of PDFBOX-4892, which was getting too long.



--
This message was sent by Atlassian Jira
(v8.20.10#820010)

-
To unsubscribe, e-mail: dev-unsubscr...@pdfbox.apache.org
For additional commands, e-mail: dev-h...@pdfbox.apache.org



Jenkins build is back to normal : PDFBox » PDFBox-trunk #2055

2024-03-25 Thread Apache Jenkins Server
See 



-
To unsubscribe, e-mail: dev-unsubscr...@pdfbox.apache.org
For additional commands, e-mail: dev-h...@pdfbox.apache.org



[jira] [Resolved] (PDFBOX-5789) Remove release subproject

2024-03-25 Thread Jira


 [ 
https://issues.apache.org/jira/browse/PDFBOX-5789?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Andreas Lehmkühler resolved PDFBOX-5789.

Fix Version/s: 2.0.32
   4.0.0
   3.0.3 PDFBox
   Resolution: Fixed

> Remove release subproject
> -
>
> Key: PDFBOX-5789
> URL: https://issues.apache.org/jira/browse/PDFBOX-5789
> Project: PDFBox
>  Issue Type: Bug
>Affects Versions: 2.0.30, 3.0.2 PDFBox, 4.0.0
>Reporter: Andreas Lehmkühler
>Assignee: Andreas Lehmkühler
>Priority: Major
> Fix For: 2.0.32, 4.0.0, 3.0.3 PDFBox
>
>
> PDFBOX-5699 introduce the new subproject "release" in order to fix some issue 
> with the SCM-URL. 
> In the hindsight it turns out to be an issue. The release project doesn't 
> include any source code and therefore is excluded from the source zip. But as 
> it is still a part of the project itself, it leads to a broken build if 
> someone uses the zip to do so.



--
This message was sent by Atlassian Jira
(v8.20.10#820010)

-
To unsubscribe, e-mail: dev-unsubscr...@pdfbox.apache.org
For additional commands, e-mail: dev-h...@pdfbox.apache.org



Build failed in Jenkins: PDFBox » PDFBox-trunk #2054

2024-03-25 Thread Apache Jenkins Server
See 


Changes:


--
Started by an SCM change
Running as SYSTEM
Managed by ci-builds-he-fi@2300674
[EnvInject] - Loading node environment variables.
Building remotely on builds41 (ubuntu) in workspace 

Cleaning up 
ERROR: Failed to update http://svn.apache.org/repos/asf/pdfbox/trunk
org.tmatesoft.svn.core.SVNException: svn: E155032: The pristine text with 
checksum '$sha1$0d8a658d1ecd3fa00dd8f46d24a9d2fdef18f3e4' was found in the DB 
but not on disk
at 
org.tmatesoft.svn.core.internal.wc.SVNErrorManager.error(SVNErrorManager.java:70)
at 
org.tmatesoft.svn.core.internal.wc.SVNErrorManager.error(SVNErrorManager.java:57)
at 
org.tmatesoft.svn.core.internal.wc17.db.SvnWcDbPristines.checkPristine(SvnWcDbPristines.java:159)
at 
org.tmatesoft.svn.core.internal.wc17.db.SvnWcDbPristines.getPristinePath(SvnWcDbPristines.java:184)
at 
org.tmatesoft.svn.core.internal.wc17.db.SVNWCDb.getPristinePath(SVNWCDb.java:1724)
at 
org.tmatesoft.svn.core.internal.wc17.SVNWCContext.isTextModified(SVNWCContext.java:754)
at 
org.tmatesoft.svn.core.internal.wc17.SVNStatusEditor17.assembleStatus(SVNStatusEditor17.java:356)
at 
org.tmatesoft.svn.core.internal.wc17.SVNStatusEditor17.sendStatusStructure(SVNStatusEditor17.java:216)
at 
org.tmatesoft.svn.core.internal.wc17.SVNStatusEditor17.getDirStatus(SVNStatusEditor17.java:742)
at 
org.tmatesoft.svn.core.internal.wc17.SVNStatusEditor17.walkStatus(SVNStatusEditor17.java:665)
at 
org.tmatesoft.svn.core.internal.wc2.ng.SvnNgGetStatus.run(SvnNgGetStatus.java:132)
at 
org.tmatesoft.svn.core.internal.wc2.ng.SvnNgGetStatus.run(SvnNgGetStatus.java:27)
at 
org.tmatesoft.svn.core.internal.wc2.ng.SvnNgOperationRunner.run(SvnNgOperationRunner.java:20)
at 
org.tmatesoft.svn.core.internal.wc2.SvnOperationRunner.run(SvnOperationRunner.java:21)
at 
org.tmatesoft.svn.core.wc2.SvnOperationFactory.run(SvnOperationFactory.java:1239)
at org.tmatesoft.svn.core.wc2.SvnOperation.run(SvnOperation.java:294)
at 
org.tmatesoft.svn.core.wc.SVNStatusClient.doStatus(SVNStatusClient.java:363)
at 
hudson.scm.subversion.UpdateWithCleanUpdater$TaskImpl.preUpdate(UpdateWithCleanUpdater.java:70)
at 
hudson.scm.subversion.UpdateUpdater$TaskImpl.perform(UpdateUpdater.java:158)
at 
hudson.scm.subversion.WorkspaceUpdater$UpdateTask.delegateTo(WorkspaceUpdater.java:168)
at 
hudson.scm.SubversionSCM$CheckOutUpdateTask.perform(SubversionSCM.java:1086)
at 
hudson.scm.SubversionSCM$CheckOutUpdateTask.run(SubversionSCM.java:1067)
at hudson.scm.SubversionSCM$CheckOutTask.invoke(SubversionSCM.java:1037)
at hudson.scm.SubversionSCM$CheckOutTask.invoke(SubversionSCM.java:1020)
at hudson.FilePath$FileCallableWrapper.call(FilePath.java:3616)
at hudson.remoting.UserRequest.perform(UserRequest.java:211)
at hudson.remoting.UserRequest.perform(UserRequest.java:54)
at hudson.remoting.Request$2.run(Request.java:377)
at 
hudson.remoting.InterceptingExecutorService.lambda$wrap$0(InterceptingExecutorService.java:78)
at java.base/java.util.concurrent.FutureTask.run(FutureTask.java:264)
at 
java.base/java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1128)
at 
java.base/java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:628)
at java.base/java.lang.Thread.run(Thread.java:829)
ERROR: Subversion update failed
hudson.remoting.ProxyException: org.tmatesoft.svn.core.SVNException: svn: 
E155032: The pristine text with checksum 
'$sha1$0d8a658d1ecd3fa00dd8f46d24a9d2fdef18f3e4' was found in the DB but not on 
disk
at 
org.tmatesoft.svn.core.internal.wc.SVNErrorManager.error(SVNErrorManager.java:70)
at 
org.tmatesoft.svn.core.internal.wc.SVNErrorManager.error(SVNErrorManager.java:57)
at 
org.tmatesoft.svn.core.internal.wc17.db.SvnWcDbPristines.checkPristine(SvnWcDbPristines.java:159)
at 
org.tmatesoft.svn.core.internal.wc17.db.SvnWcDbPristines.getPristinePath(SvnWcDbPristines.java:184)
at 
org.tmatesoft.svn.core.internal.wc17.db.SVNWCDb.getPristinePath(SVNWCDb.java:1724)
at 
org.tmatesoft.svn.core.internal.wc17.SVNWCContext.isTextModified(SVNWCContext.java:754)
at 
org.tmatesoft.svn.core.internal.wc17.SVNStatusEditor17.assembleStatus(SVNStatusEditor17.java:356)
at 
org.tmatesoft.svn.core.internal.wc17.SVNStatusEditor17.sendStatusStructure(SVNStatusEditor17.java:216)
at 
org.tmatesoft.svn.core.internal.wc17.SVNStatusEditor17.getDirStatus(SVNStatusEditor17.java:742)
at 

[jira] [Commented] (PDFBOX-5790) Don't use a predefined CMap if a ToUnicode CMap is present

2024-03-25 Thread ASF subversion and git services (Jira)


[ 
https://issues.apache.org/jira/browse/PDFBOX-5790?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17830360#comment-17830360
 ] 

ASF subversion and git services commented on PDFBOX-5790:
-

Commit 1916527 from le...@apache.org in branch 'pdfbox/branches/2.0'
[ https://svn.apache.org/r1916527 ]

PDFBOX-5790: don't use a predefined CMap if a ToUnicode CMap is present

> Don't use a predefined CMap if a ToUnicode CMap is present
> --
>
> Key: PDFBOX-5790
> URL: https://issues.apache.org/jira/browse/PDFBOX-5790
> Project: PDFBox
>  Issue Type: Bug
>  Components: Text extraction
>Affects Versions: 2.0.31, 4.0.0, 3.0.3 PDFBox
>Reporter: Andreas Lehmkühler
>Assignee: Andreas Lehmkühler
>Priority: Major
> Attachments: p4_fix.pdf
>
>
> The user Luiz Marcelo Modesto reported an issue with the text extraction of 
> the attached pdf  [^p4_fix.pdf] 
> {quote}
> Hi everyone,
> I'm not sure if this is the same as FAQ "How come I am getting 
> gibberish(G38G43G36G51G5) when extracting text?"...
> I'm using PDFBox version 3.0.1 and OpenJDK Runtime Environment (build 
> 11.0.22+7-post-Ubuntu-0ubuntu222.04.1).
> I'm trying to understand how this PDF chunk (from p4_fix.pdf attached)
>   BT
>   /G1F7 6.0 Tf
>   94.871 773.806 Td
>   <004200430044> Tj
>   ET
> becomes "BCD" on PDFBox Debugger (the same on qpdfview, Adobe Reader, 
> Chrome, ...) and becomes "abc" on PDFBox text extraction tool. 
> Using the Poppler pdftotext (version 22.02.0) gives me "BCD" too.
> The renders that allow me to copy the text give me "BCD" text.
> It seems that PDFBox extraction tool follows the item "9.10.2 Mapping 
> character codes to Unicode values" (ISO 32000-2:2020) but all the others 
> choose a different way.
>  Could you help me to understand if there is a problem with the PDF file, 
> with the renders or with the extract text tool? 
> Thank you!
> {quote}



--
This message was sent by Atlassian Jira
(v8.20.10#820010)

-
To unsubscribe, e-mail: dev-unsubscr...@pdfbox.apache.org
For additional commands, e-mail: dev-h...@pdfbox.apache.org



[jira] [Commented] (PDFBOX-5790) Don't use a predefined CMap if a ToUnicode CMap is present

2024-03-25 Thread ASF subversion and git services (Jira)


[ 
https://issues.apache.org/jira/browse/PDFBOX-5790?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17830359#comment-17830359
 ] 

ASF subversion and git services commented on PDFBOX-5790:
-

Commit 1916526 from le...@apache.org in branch 'pdfbox/branches/3.0'
[ https://svn.apache.org/r1916526 ]

PDFBOX-5790: don't use a predefined CMap if a ToUnicode CMap is present

> Don't use a predefined CMap if a ToUnicode CMap is present
> --
>
> Key: PDFBOX-5790
> URL: https://issues.apache.org/jira/browse/PDFBOX-5790
> Project: PDFBox
>  Issue Type: Bug
>  Components: Text extraction
>Affects Versions: 2.0.31, 4.0.0, 3.0.3 PDFBox
>Reporter: Andreas Lehmkühler
>Assignee: Andreas Lehmkühler
>Priority: Major
> Attachments: p4_fix.pdf
>
>
> The user Luiz Marcelo Modesto reported an issue with the text extraction of 
> the attached pdf  [^p4_fix.pdf] 
> {quote}
> Hi everyone,
> I'm not sure if this is the same as FAQ "How come I am getting 
> gibberish(G38G43G36G51G5) when extracting text?"...
> I'm using PDFBox version 3.0.1 and OpenJDK Runtime Environment (build 
> 11.0.22+7-post-Ubuntu-0ubuntu222.04.1).
> I'm trying to understand how this PDF chunk (from p4_fix.pdf attached)
>   BT
>   /G1F7 6.0 Tf
>   94.871 773.806 Td
>   <004200430044> Tj
>   ET
> becomes "BCD" on PDFBox Debugger (the same on qpdfview, Adobe Reader, 
> Chrome, ...) and becomes "abc" on PDFBox text extraction tool. 
> Using the Poppler pdftotext (version 22.02.0) gives me "BCD" too.
> The renders that allow me to copy the text give me "BCD" text.
> It seems that PDFBox extraction tool follows the item "9.10.2 Mapping 
> character codes to Unicode values" (ISO 32000-2:2020) but all the others 
> choose a different way.
>  Could you help me to understand if there is a problem with the PDF file, 
> with the renders or with the extract text tool? 
> Thank you!
> {quote}



--
This message was sent by Atlassian Jira
(v8.20.10#820010)

-
To unsubscribe, e-mail: dev-unsubscr...@pdfbox.apache.org
For additional commands, e-mail: dev-h...@pdfbox.apache.org



[jira] [Commented] (PDFBOX-5790) Don't use a predefined CMap if a ToUnicode CMap is present

2024-03-25 Thread ASF subversion and git services (Jira)


[ 
https://issues.apache.org/jira/browse/PDFBOX-5790?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17830358#comment-17830358
 ] 

ASF subversion and git services commented on PDFBOX-5790:
-

Commit 1916525 from le...@apache.org in branch 'pdfbox/trunk'
[ https://svn.apache.org/r1916525 ]

PDFBOX-5790: don't use a predefined CMap if a ToUnicode CMap is present

> Don't use a predefined CMap if a ToUnicode CMap is present
> --
>
> Key: PDFBOX-5790
> URL: https://issues.apache.org/jira/browse/PDFBOX-5790
> Project: PDFBox
>  Issue Type: Bug
>  Components: Text extraction
>Affects Versions: 2.0.31, 4.0.0, 3.0.3 PDFBox
>Reporter: Andreas Lehmkühler
>Assignee: Andreas Lehmkühler
>Priority: Major
> Attachments: p4_fix.pdf
>
>
> The user Luiz Marcelo Modesto reported an issue with the text extraction of 
> the attached pdf  [^p4_fix.pdf] 
> {quote}
> Hi everyone,
> I'm not sure if this is the same as FAQ "How come I am getting 
> gibberish(G38G43G36G51G5) when extracting text?"...
> I'm using PDFBox version 3.0.1 and OpenJDK Runtime Environment (build 
> 11.0.22+7-post-Ubuntu-0ubuntu222.04.1).
> I'm trying to understand how this PDF chunk (from p4_fix.pdf attached)
>   BT
>   /G1F7 6.0 Tf
>   94.871 773.806 Td
>   <004200430044> Tj
>   ET
> becomes "BCD" on PDFBox Debugger (the same on qpdfview, Adobe Reader, 
> Chrome, ...) and becomes "abc" on PDFBox text extraction tool. 
> Using the Poppler pdftotext (version 22.02.0) gives me "BCD" too.
> The renders that allow me to copy the text give me "BCD" text.
> It seems that PDFBox extraction tool follows the item "9.10.2 Mapping 
> character codes to Unicode values" (ISO 32000-2:2020) but all the others 
> choose a different way.
>  Could you help me to understand if there is a problem with the PDF file, 
> with the renders or with the extract text tool? 
> Thank you!
> {quote}



--
This message was sent by Atlassian Jira
(v8.20.10#820010)

-
To unsubscribe, e-mail: dev-unsubscr...@pdfbox.apache.org
For additional commands, e-mail: dev-h...@pdfbox.apache.org



[jira] [Commented] (PDFBOX-5790) Don't use a predefined CMap if a ToUnicode CMap is present

2024-03-25 Thread Jira


[ 
https://issues.apache.org/jira/browse/PDFBOX-5790?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17830357#comment-17830357
 ] 

Andreas Lehmkühler commented on PDFBOX-5790:


It turns out that the given ToUnicode CMap is incomplete and doesn't provide 
any valid mapping. The current implementation uses a possible predefined CMap 
is such cases. The given pdf uses Adobe-Japan1 which produces a wrong mapping.

Some tests reveals that it is a good idea to strictly follow the spec in such 
cases: use the provided ToUnicode CMap. If it doesn't produce any valid 
mapping, use identity mapping.

> Don't use a predefined CMap if a ToUnicode CMap is present
> --
>
> Key: PDFBOX-5790
> URL: https://issues.apache.org/jira/browse/PDFBOX-5790
> Project: PDFBox
>  Issue Type: Bug
>  Components: Text extraction
>Affects Versions: 2.0.31, 4.0.0, 3.0.3 PDFBox
>Reporter: Andreas Lehmkühler
>Assignee: Andreas Lehmkühler
>Priority: Major
> Attachments: p4_fix.pdf
>
>
> The user Luiz Marcelo Modesto reported an issue with the text extraction of 
> the attached pdf  [^p4_fix.pdf] 
> {quote}
> Hi everyone,
> I'm not sure if this is the same as FAQ "How come I am getting 
> gibberish(G38G43G36G51G5) when extracting text?"...
> I'm using PDFBox version 3.0.1 and OpenJDK Runtime Environment (build 
> 11.0.22+7-post-Ubuntu-0ubuntu222.04.1).
> I'm trying to understand how this PDF chunk (from p4_fix.pdf attached)
>   BT
>   /G1F7 6.0 Tf
>   94.871 773.806 Td
>   <004200430044> Tj
>   ET
> becomes "BCD" on PDFBox Debugger (the same on qpdfview, Adobe Reader, 
> Chrome, ...) and becomes "abc" on PDFBox text extraction tool. 
> Using the Poppler pdftotext (version 22.02.0) gives me "BCD" too.
> The renders that allow me to copy the text give me "BCD" text.
> It seems that PDFBox extraction tool follows the item "9.10.2 Mapping 
> character codes to Unicode values" (ISO 32000-2:2020) but all the others 
> choose a different way.
>  Could you help me to understand if there is a problem with the PDF file, 
> with the renders or with the extract text tool? 
> Thank you!
> {quote}



--
This message was sent by Atlassian Jira
(v8.20.10#820010)

-
To unsubscribe, e-mail: dev-unsubscr...@pdfbox.apache.org
For additional commands, e-mail: dev-h...@pdfbox.apache.org



[jira] [Updated] (PDFBOX-5790) Don't use a predefined CMap if a ToUnicode CMap is present

2024-03-25 Thread Jira


 [ 
https://issues.apache.org/jira/browse/PDFBOX-5790?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Andreas Lehmkühler updated PDFBOX-5790:
---
Description: 
The user Luiz Marcelo Modesto reported an issue with the text extraction of the 
attached pdf  [^p4_fix.pdf] 

{quote}
Hi everyone,

I'm not sure if this is the same as FAQ "How come I am getting 
gibberish(G38G43G36G51G5) when extracting text?"...

I'm using PDFBox version 3.0.1 and OpenJDK Runtime Environment (build 
11.0.22+7-post-Ubuntu-0ubuntu222.04.1).

I'm trying to understand how this PDF chunk (from p4_fix.pdf attached)

  BT
  /G1F7 6.0 Tf
  94.871 773.806 Td
  <004200430044> Tj
  ET

becomes "BCD" on PDFBox Debugger (the same on qpdfview, Adobe Reader, 
Chrome, ...) and becomes "abc" on PDFBox text extraction tool. 

Using the Poppler pdftotext (version 22.02.0) gives me "BCD" too.

The renders that allow me to copy the text give me "BCD" text.

It seems that PDFBox extraction tool follows the item "9.10.2 Mapping 
character codes to Unicode values" (ISO 32000-2:2020) but all the others choose 
a different way.

 Could you help me to understand if there is a problem with the PDF file, 
with the renders or with the extract text tool? 

Thank you!
{quote}



  was:
The user Luiz Marcelo Modesto reported an issue with the text extraction of the 
attached pdf

{quote}
Hi everyone,

I'm not sure if this is the same as FAQ "How come I am getting 
gibberish(G38G43G36G51G5) when extracting text?"...

I'm using PDFBox version 3.0.1 and OpenJDK Runtime Environment (build 
11.0.22+7-post-Ubuntu-0ubuntu222.04.1).

I'm trying to understand how this PDF chunk (from p4_fix.pdf attached)

  BT
  /G1F7 6.0 Tf
  94.871 773.806 Td
  <004200430044> Tj
  ET

becomes "BCD" on PDFBox Debugger (the same on qpdfview, Adobe Reader, 
Chrome, ...) and becomes "abc" on PDFBox text extraction tool. 

Using the Poppler pdftotext (version 22.02.0) gives me "BCD" too.

The renders that allow me to copy the text give me "BCD" text.

It seems that PDFBox extraction tool follows the item "9.10.2 Mapping 
character codes to Unicode values" (ISO 32000-2:2020) but all the others choose 
a different way.

 Could you help me to understand if there is a problem with the PDF file, 
with the renders or with the extract text tool? 

Thank you!
{quote}




> Don't use a predefined CMap if a ToUnicode CMap is present
> --
>
> Key: PDFBOX-5790
> URL: https://issues.apache.org/jira/browse/PDFBOX-5790
> Project: PDFBox
>  Issue Type: Bug
>  Components: Text extraction
>Affects Versions: 2.0.31, 4.0.0, 3.0.3 PDFBox
>Reporter: Andreas Lehmkühler
>Assignee: Andreas Lehmkühler
>Priority: Major
> Attachments: p4_fix.pdf
>
>
> The user Luiz Marcelo Modesto reported an issue with the text extraction of 
> the attached pdf  [^p4_fix.pdf] 
> {quote}
> Hi everyone,
> I'm not sure if this is the same as FAQ "How come I am getting 
> gibberish(G38G43G36G51G5) when extracting text?"...
> I'm using PDFBox version 3.0.1 and OpenJDK Runtime Environment (build 
> 11.0.22+7-post-Ubuntu-0ubuntu222.04.1).
> I'm trying to understand how this PDF chunk (from p4_fix.pdf attached)
>   BT
>   /G1F7 6.0 Tf
>   94.871 773.806 Td
>   <004200430044> Tj
>   ET
> becomes "BCD" on PDFBox Debugger (the same on qpdfview, Adobe Reader, 
> Chrome, ...) and becomes "abc" on PDFBox text extraction tool. 
> Using the Poppler pdftotext (version 22.02.0) gives me "BCD" too.
> The renders that allow me to copy the text give me "BCD" text.
> It seems that PDFBox extraction tool follows the item "9.10.2 Mapping 
> character codes to Unicode values" (ISO 32000-2:2020) but all the others 
> choose a different way.
>  Could you help me to understand if there is a problem with the PDF file, 
> with the renders or with the extract text tool? 
> Thank you!
> {quote}



--
This message was sent by Atlassian Jira
(v8.20.10#820010)

-
To unsubscribe, e-mail: dev-unsubscr...@pdfbox.apache.org
For additional commands, e-mail: dev-h...@pdfbox.apache.org



[jira] [Updated] (PDFBOX-5790) Don't use a predefined CMap if a ToUnicode CMap is present

2024-03-25 Thread Jira


 [ 
https://issues.apache.org/jira/browse/PDFBOX-5790?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Andreas Lehmkühler updated PDFBOX-5790:
---
Attachment: p4_fix.pdf

> Don't use a predefined CMap if a ToUnicode CMap is present
> --
>
> Key: PDFBOX-5790
> URL: https://issues.apache.org/jira/browse/PDFBOX-5790
> Project: PDFBox
>  Issue Type: Bug
>  Components: Text extraction
>Affects Versions: 2.0.31, 4.0.0, 3.0.3 PDFBox
>Reporter: Andreas Lehmkühler
>Assignee: Andreas Lehmkühler
>Priority: Major
> Attachments: p4_fix.pdf
>
>
> The user Luiz Marcelo Modesto reported an issue with the text extraction of 
> the attached pdf
> {quote}
> Hi everyone,
> I'm not sure if this is the same as FAQ "How come I am getting 
> gibberish(G38G43G36G51G5) when extracting text?"...
> I'm using PDFBox version 3.0.1 and OpenJDK Runtime Environment (build 
> 11.0.22+7-post-Ubuntu-0ubuntu222.04.1).
> I'm trying to understand how this PDF chunk (from p4_fix.pdf attached)
>   BT
>   /G1F7 6.0 Tf
>   94.871 773.806 Td
>   <004200430044> Tj
>   ET
> becomes "BCD" on PDFBox Debugger (the same on qpdfview, Adobe Reader, 
> Chrome, ...) and becomes "abc" on PDFBox text extraction tool. 
> Using the Poppler pdftotext (version 22.02.0) gives me "BCD" too.
> The renders that allow me to copy the text give me "BCD" text.
> It seems that PDFBox extraction tool follows the item "9.10.2 Mapping 
> character codes to Unicode values" (ISO 32000-2:2020) but all the others 
> choose a different way.
>  Could you help me to understand if there is a problem with the PDF file, 
> with the renders or with the extract text tool? 
> Thank you!
> {quote}



--
This message was sent by Atlassian Jira
(v8.20.10#820010)

-
To unsubscribe, e-mail: dev-unsubscr...@pdfbox.apache.org
For additional commands, e-mail: dev-h...@pdfbox.apache.org



[jira] [Created] (PDFBOX-5790) Don't use a predefined CMap if a ToUnicode CMap is present

2024-03-25 Thread Jira
Andreas Lehmkühler created PDFBOX-5790:
--

 Summary: Don't use a predefined CMap if a ToUnicode CMap is present
 Key: PDFBOX-5790
 URL: https://issues.apache.org/jira/browse/PDFBOX-5790
 Project: PDFBox
  Issue Type: Bug
  Components: Text extraction
Affects Versions: 2.0.31, 4.0.0, 3.0.3 PDFBox
Reporter: Andreas Lehmkühler
Assignee: Andreas Lehmkühler
 Attachments: p4_fix.pdf

The user Luiz Marcelo Modesto reported an issue with the text extraction of the 
attached pdf

{quote}
Hi everyone,

I'm not sure if this is the same as FAQ "How come I am getting 
gibberish(G38G43G36G51G5) when extracting text?"...

I'm using PDFBox version 3.0.1 and OpenJDK Runtime Environment (build 
11.0.22+7-post-Ubuntu-0ubuntu222.04.1).

I'm trying to understand how this PDF chunk (from p4_fix.pdf attached)

  BT
  /G1F7 6.0 Tf
  94.871 773.806 Td
  <004200430044> Tj
  ET

becomes "BCD" on PDFBox Debugger (the same on qpdfview, Adobe Reader, 
Chrome, ...) and becomes "abc" on PDFBox text extraction tool. 

Using the Poppler pdftotext (version 22.02.0) gives me "BCD" too.

The renders that allow me to copy the text give me "BCD" text.

It seems that PDFBox extraction tool follows the item "9.10.2 Mapping 
character codes to Unicode values" (ISO 32000-2:2020) but all the others choose 
a different way.

 Could you help me to understand if there is a problem with the PDF file, 
with the renders or with the extract text tool? 

Thank you!
{quote}





--
This message was sent by Atlassian Jira
(v8.20.10#820010)

-
To unsubscribe, e-mail: dev-unsubscr...@pdfbox.apache.org
For additional commands, e-mail: dev-h...@pdfbox.apache.org