[GitHub] [pdfbox] koppor commented on pull request #102: Add build using GitHub actions

2020-12-31 Thread GitBox


koppor commented on pull request #102:
URL: https://github.com/apache/pdfbox/pull/102#issuecomment-753023028


   OK, got it. I just wanted to make an offer to replace Travis CI. I see ASF 
having the same limitations as Eclipse Foundation, which I just forgot. Sorry 
for the noise.



This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



-
To unsubscribe, e-mail: dev-unsubscr...@pdfbox.apache.org
For additional commands, e-mail: dev-h...@pdfbox.apache.org



[GitHub] [pdfbox] koppor closed pull request #102: Add build using GitHub actions

2020-12-31 Thread GitBox


koppor closed pull request #102:
URL: https://github.com/apache/pdfbox/pull/102


   



This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



-
To unsubscribe, e-mail: dev-unsubscr...@pdfbox.apache.org
For additional commands, e-mail: dev-h...@pdfbox.apache.org



[jira] [Resolved] (PDFBOX-2602) Enhance command line tools

2020-12-31 Thread Maruan Sahyoun (Jira)


 [ 
https://issues.apache.org/jira/browse/PDFBOX-2602?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Maruan Sahyoun resolved PDFBOX-2602.

Resolution: Fixed

Setting to resolved for now. Let's reopen if the users don't like the new names.

> Enhance command line tools
> --
>
> Key: PDFBOX-2602
> URL: https://issues.apache.org/jira/browse/PDFBOX-2602
> Project: PDFBox
>  Issue Type: Bug
>  Components: Utilities
>Affects Versions: 1.8.8, 2.0.0
>Reporter: Maruan Sahyoun
>Assignee: Maruan Sahyoun
>Priority: Minor
> Fix For: 3.0.0 PDFBox
>
>
> The command line tools shall be enhanced to have the same behavior across all 
> tools.
> From the discussion on the dev mailing list
> - add an -h option to print the usage
> - print the usage to System.err and use an exit code of 1 if there was an 
> invalid command line parameter
> - print messages on exceptions to System.err
> - rethrow the exception so java can handle it if it will terminate afterwards 
> anyway
> - use an exit code of 1if rethrowing doesn't make sense
> Additional input:
> https://clig.dev/



--
This message was sent by Atlassian Jira
(v8.3.4#803005)

-
To unsubscribe, e-mail: dev-unsubscr...@pdfbox.apache.org
For additional commands, e-mail: dev-h...@pdfbox.apache.org



[jira] [Resolved] (PDFBOX-5061) Replace jaxb date DatatypeConverter with java.time

2020-12-31 Thread Maruan Sahyoun (Jira)


 [ 
https://issues.apache.org/jira/browse/PDFBOX-5061?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Maruan Sahyoun resolved PDFBOX-5061.

Resolution: Fixed

Setting to resolved. 2.0 will keep the dependency.

> Replace jaxb date DatatypeConverter with java.time
> --
>
> Key: PDFBOX-5061
> URL: https://issues.apache.org/jira/browse/PDFBOX-5061
> Project: PDFBox
>  Issue Type: Task
>Reporter: Oliver Kopp
>Assignee: Maruan Sahyoun
>Priority: Minor
> Fix For: 3.0.0 PDFBox
>
> Attachments: screenshot-1.png
>
>
> {{javax.}}-dependencies have been superseeded by jakarta dependencies.
> To be able to use Apache PDFBox in Java projects using newer JDKs, it would 
> be feasable to use the new jakarta dependencies. I think, only 
> jakarta.xml.bind is affected. [https://eclipse-ee4j.github.io/jaxb-ri/]
> See also https://issues.apache.org/jira/browse/SHIRO-750



--
This message was sent by Atlassian Jira
(v8.3.4#803005)

-
To unsubscribe, e-mail: dev-unsubscr...@pdfbox.apache.org
For additional commands, e-mail: dev-h...@pdfbox.apache.org



[jira] [Commented] (PDFBOX-5061) Replace jaxb date DatatypeConverter with java.time

2020-12-31 Thread Maruan Sahyoun (Jira)


[ 
https://issues.apache.org/jira/browse/PDFBOX-5061?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17257014#comment-17257014
 ] 

Maruan Sahyoun commented on PDFBOX-5061:


There is no possibility in Java 1.6 wo using an external dependeny mainly 
because of SimpleDateFormat not liking the colon in time offsets (which has 
been addressed in Java 1.7).

> Replace jaxb date DatatypeConverter with java.time
> --
>
> Key: PDFBOX-5061
> URL: https://issues.apache.org/jira/browse/PDFBOX-5061
> Project: PDFBox
>  Issue Type: Task
>Reporter: Oliver Kopp
>Assignee: Maruan Sahyoun
>Priority: Minor
> Fix For: 3.0.0 PDFBox
>
> Attachments: screenshot-1.png
>
>
> {{javax.}}-dependencies have been superseeded by jakarta dependencies.
> To be able to use Apache PDFBox in Java projects using newer JDKs, it would 
> be feasable to use the new jakarta dependencies. I think, only 
> jakarta.xml.bind is affected. [https://eclipse-ee4j.github.io/jaxb-ri/]
> See also https://issues.apache.org/jira/browse/SHIRO-750



--
This message was sent by Atlassian Jira
(v8.3.4#803005)

-
To unsubscribe, e-mail: dev-unsubscr...@pdfbox.apache.org
For additional commands, e-mail: dev-h...@pdfbox.apache.org



[jira] [Updated] (PDFBOX-5030) Create Migration guide for 3.0.0

2020-12-31 Thread Jira


 [ 
https://issues.apache.org/jira/browse/PDFBOX-5030?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Andreas Lehmkühler updated PDFBOX-5030:
---
Fix Version/s: 3.0.0 PDFBox

> Create Migration guide for 3.0.0
> 
>
> Key: PDFBOX-5030
> URL: https://issues.apache.org/jira/browse/PDFBOX-5030
> Project: PDFBox
>  Issue Type: Task
>  Components: Documentation
>Reporter: Maruan Sahyoun
>Assignee: Maruan Sahyoun
>Priority: Major
> Fix For: 3.0.0 PDFBox
>
>
> As to start educating about the migration efforts needed to get to 3.0.0 the 
> should be a migration guide (evolving over time) to prepare for the release



--
This message was sent by Atlassian Jira
(v8.3.4#803005)

-
To unsubscribe, e-mail: dev-unsubscr...@pdfbox.apache.org
For additional commands, e-mail: dev-h...@pdfbox.apache.org



[jira] [Resolved] (PDFBOX-4623) COSParser: Infinite recursion

2020-12-31 Thread Jira


 [ 
https://issues.apache.org/jira/browse/PDFBOX-4623?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Andreas Lehmkühler resolved PDFBOX-4623.

Resolution: Fixed

> COSParser: Infinite recursion
> -
>
> Key: PDFBOX-4623
> URL: https://issues.apache.org/jira/browse/PDFBOX-4623
> Project: PDFBox
>  Issue Type: Bug
>  Components: Parsing
>Affects Versions: 2.0.16
> Environment: java version "12" 2019-03-19
> Java(TM) SE Runtime Environment (build 12+33)
> Java HotSpot(TM) 64-Bit Server VM (build 12+33, mixed mode, sharing) 
> MacOS Mojave
>Reporter: Alex Rebert
>Assignee: Andreas Lehmkühler
>Priority: Minor
> Fix For: 3.0.0 PDFBox
>
> Attachments: infinite-recursion.pdf, loop_in_page_tree.pdf, 
> poppler-43279-0.pdf, poppler-91414-1.zip-2.gz-53.pdf
>
>
> Parsing an invalid PDF can lead to an infinite recursion in COSParser, which 
> results in a StackOverflowError.
> *Steps to repro*
>  # Download malformed PDF (attached)
>  # {{Run: java -jar pdfbox-app-2.0.16.jar ExtractText infinite-recursion.pdf}}
> *Stacktrace*
> {noformat}
> Exception in thread "main" java.lang.StackOverflowError [1005/1916]
>  at java.base/sun.nio.cs.UTF_8.updatePositions(UTF_8.java:79)
>  at java.base/sun.nio.cs.UTF_8$Decoder.xflow(UTF_8.java:210)
>  at java.base/sun.nio.cs.UTF_8$Decoder.decodeArrayLoop(UTF_8.java:321)
>  at java.base/sun.nio.cs.UTF_8$Decoder.decodeLoop(UTF_8.java:414)
>  at java.base/java.nio.charset.CharsetDecoder.decode(CharsetDecoder.java:578)
>  at java.base/java.nio.charset.CharsetDecoder.decode(CharsetDecoder.java:801)
>  at org.apache.pdfbox.pdfparser.BaseParser.isValidUTF8(BaseParser.java:787)
>  at org.apache.pdfbox.pdfparser.BaseParser.parseCOSName(BaseParser.java:768)
>  at org.apache.pdfbox.pdfparser.BaseParser.parseDirObject(BaseParser.java:887)
>  at 
> org.apache.pdfbox.pdfparser.BaseParser.parseCOSDictionaryValue(BaseParser.java:154)
>  at 
> org.apache.pdfbox.pdfparser.BaseParser.parseCOSDictionaryNameValuePair(BaseParser.java:283)
>  at 
> org.apache.pdfbox.pdfparser.BaseParser.parseCOSDictionary(BaseParser.java:216)
>  at org.apache.pdfbox.pdfparser.BaseParser.parseDirObject(BaseParser.java:867)
>  at org.apache.pdfbox.pdfparser.COSParser.parseFileObject(COSParser.java:912)
>  at 
> org.apache.pdfbox.pdfparser.COSParser.parseObjectDynamically(COSParser.java:881)
>  at 
> org.apache.pdfbox.pdfparser.COSParser.parseObjectDynamically(COSParser.java:801)
>  at org.apache.pdfbox.pdfparser.COSParser.getLength(COSParser.java:1055)
>  at org.apache.pdfbox.pdfparser.COSParser.parseCOSStream(COSParser.java:1114)
>  at org.apache.pdfbox.pdfparser.COSParser.parseFileObject(COSParser.java:920)
>  at 
> org.apache.pdfbox.pdfparser.COSParser.parseObjectDynamically(COSParser.java:881)
>  at 
> org.apache.pdfbox.pdfparser.COSParser.parseObjectDynamically(COSParser.java:801)
>  at org.apache.pdfbox.pdfparser.COSParser.getLength(COSParser.java:1055)
>  at org.apache.pdfbox.pdfparser.COSParser.parseCOSStream(COSParser.java:1114)
>  at org.apache.pdfbox.pdfparser.COSParser.parseFileObject(COSParser.java:920)
>  at 
> org.apache.pdfbox.pdfparser.COSParser.parseObjectDynamically(COSParser.java:881)
>  at 
> org.apache.pdfbox.pdfparser.COSParser.parseObjectDynamically(COSParser.java:801)
>  at org.apache.pdfbox.pdfparser.COSParser.getLength(COSParser.java:1055)
>  at org.apache.pdfbox.pdfparser.COSParser.parseCOSStream(COSParser.java:1114)
>  at org.apache.pdfbox.pdfparser.COSParser.parseFileObject(COSParser.java:920)
>  at 
> org.apache.pdfbox.pdfparser.COSParser.parseObjectDynamically(COSParser.java:881)
>  at 
> org.apache.pdfbox.pdfparser.COSParser.parseObjectDynamically(COSParser.java:801)
>  at org.apache.pdfbox.pdfparser.COSParser.getLength(COSParser.java:1055)
>  at org.apache.pdfbox.pdfparser.COSParser.parseCOSStream(COSParser.java:1114)
>  ...
> {noformat}
> The file was generated by fuzzing and is (probably) not a valid PDF file.
>  



--
This message was sent by Atlassian Jira
(v8.3.4#803005)

-
To unsubscribe, e-mail: dev-unsubscr...@pdfbox.apache.org
For additional commands, e-mail: dev-h...@pdfbox.apache.org



[jira] [Commented] (PDFBOX-4623) COSParser: Infinite recursion

2020-12-31 Thread Jira


[ 
https://issues.apache.org/jira/browse/PDFBOX-4623?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17256975#comment-17256975
 ] 

Andreas Lehmkühler commented on PDFBOX-4623:


I've found a fix for  [^infinite-recursion.pdf].  [^loop_in_page_tree.pdf] and  
[^poppler-43279-0.pdf]  need a different fix therefore I've created a new 
ticket for those issues.  [^poppler-91414-1.zip-2.gz-53.pdf] doesn't trigger an 
exception at all.

All I said is true for 3.0.0 and I'm afraid there won't be any fix for the 
2.0.x-branch.

> COSParser: Infinite recursion
> -
>
> Key: PDFBOX-4623
> URL: https://issues.apache.org/jira/browse/PDFBOX-4623
> Project: PDFBox
>  Issue Type: Bug
>  Components: Parsing
>Affects Versions: 2.0.16
> Environment: java version "12" 2019-03-19
> Java(TM) SE Runtime Environment (build 12+33)
> Java HotSpot(TM) 64-Bit Server VM (build 12+33, mixed mode, sharing) 
> MacOS Mojave
>Reporter: Alex Rebert
>Assignee: Andreas Lehmkühler
>Priority: Minor
> Fix For: 3.0.0 PDFBox
>
> Attachments: infinite-recursion.pdf, loop_in_page_tree.pdf, 
> poppler-43279-0.pdf, poppler-91414-1.zip-2.gz-53.pdf
>
>
> Parsing an invalid PDF can lead to an infinite recursion in COSParser, which 
> results in a StackOverflowError.
> *Steps to repro*
>  # Download malformed PDF (attached)
>  # {{Run: java -jar pdfbox-app-2.0.16.jar ExtractText infinite-recursion.pdf}}
> *Stacktrace*
> {noformat}
> Exception in thread "main" java.lang.StackOverflowError [1005/1916]
>  at java.base/sun.nio.cs.UTF_8.updatePositions(UTF_8.java:79)
>  at java.base/sun.nio.cs.UTF_8$Decoder.xflow(UTF_8.java:210)
>  at java.base/sun.nio.cs.UTF_8$Decoder.decodeArrayLoop(UTF_8.java:321)
>  at java.base/sun.nio.cs.UTF_8$Decoder.decodeLoop(UTF_8.java:414)
>  at java.base/java.nio.charset.CharsetDecoder.decode(CharsetDecoder.java:578)
>  at java.base/java.nio.charset.CharsetDecoder.decode(CharsetDecoder.java:801)
>  at org.apache.pdfbox.pdfparser.BaseParser.isValidUTF8(BaseParser.java:787)
>  at org.apache.pdfbox.pdfparser.BaseParser.parseCOSName(BaseParser.java:768)
>  at org.apache.pdfbox.pdfparser.BaseParser.parseDirObject(BaseParser.java:887)
>  at 
> org.apache.pdfbox.pdfparser.BaseParser.parseCOSDictionaryValue(BaseParser.java:154)
>  at 
> org.apache.pdfbox.pdfparser.BaseParser.parseCOSDictionaryNameValuePair(BaseParser.java:283)
>  at 
> org.apache.pdfbox.pdfparser.BaseParser.parseCOSDictionary(BaseParser.java:216)
>  at org.apache.pdfbox.pdfparser.BaseParser.parseDirObject(BaseParser.java:867)
>  at org.apache.pdfbox.pdfparser.COSParser.parseFileObject(COSParser.java:912)
>  at 
> org.apache.pdfbox.pdfparser.COSParser.parseObjectDynamically(COSParser.java:881)
>  at 
> org.apache.pdfbox.pdfparser.COSParser.parseObjectDynamically(COSParser.java:801)
>  at org.apache.pdfbox.pdfparser.COSParser.getLength(COSParser.java:1055)
>  at org.apache.pdfbox.pdfparser.COSParser.parseCOSStream(COSParser.java:1114)
>  at org.apache.pdfbox.pdfparser.COSParser.parseFileObject(COSParser.java:920)
>  at 
> org.apache.pdfbox.pdfparser.COSParser.parseObjectDynamically(COSParser.java:881)
>  at 
> org.apache.pdfbox.pdfparser.COSParser.parseObjectDynamically(COSParser.java:801)
>  at org.apache.pdfbox.pdfparser.COSParser.getLength(COSParser.java:1055)
>  at org.apache.pdfbox.pdfparser.COSParser.parseCOSStream(COSParser.java:1114)
>  at org.apache.pdfbox.pdfparser.COSParser.parseFileObject(COSParser.java:920)
>  at 
> org.apache.pdfbox.pdfparser.COSParser.parseObjectDynamically(COSParser.java:881)
>  at 
> org.apache.pdfbox.pdfparser.COSParser.parseObjectDynamically(COSParser.java:801)
>  at org.apache.pdfbox.pdfparser.COSParser.getLength(COSParser.java:1055)
>  at org.apache.pdfbox.pdfparser.COSParser.parseCOSStream(COSParser.java:1114)
>  at org.apache.pdfbox.pdfparser.COSParser.parseFileObject(COSParser.java:920)
>  at 
> org.apache.pdfbox.pdfparser.COSParser.parseObjectDynamically(COSParser.java:881)
>  at 
> org.apache.pdfbox.pdfparser.COSParser.parseObjectDynamically(COSParser.java:801)
>  at org.apache.pdfbox.pdfparser.COSParser.getLength(COSParser.java:1055)
>  at org.apache.pdfbox.pdfparser.COSParser.parseCOSStream(COSParser.java:1114)
>  ...
> {noformat}
> The file was generated by fuzzing and is (probably) not a valid PDF file.
>  



--
This message was sent by Atlassian Jira
(v8.3.4#803005)

-
To unsubscribe, e-mail: dev-unsubscr...@pdfbox.apache.org
For additional commands, e-mail: dev-h...@pdfbox.apache.org



[jira] [Commented] (PDFBOX-4623) COSParser: Infinite recursion

2020-12-31 Thread ASF subversion and git services (Jira)


[ 
https://issues.apache.org/jira/browse/PDFBOX-4623?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17256974#comment-17256974
 ] 

ASF subversion and git services commented on PDFBOX-4623:
-

Commit 1884987 from le...@apache.org in branch 'pdfbox/trunk'
[ https://svn.apache.org/r1884987 ]

PDFBOX-4623: avoid endless recursion if something went wrong when dereferencing 
the referenced object

> COSParser: Infinite recursion
> -
>
> Key: PDFBOX-4623
> URL: https://issues.apache.org/jira/browse/PDFBOX-4623
> Project: PDFBox
>  Issue Type: Bug
>  Components: Parsing
>Affects Versions: 2.0.16
> Environment: java version "12" 2019-03-19
> Java(TM) SE Runtime Environment (build 12+33)
> Java HotSpot(TM) 64-Bit Server VM (build 12+33, mixed mode, sharing) 
> MacOS Mojave
>Reporter: Alex Rebert
>Assignee: Andreas Lehmkühler
>Priority: Minor
> Fix For: 3.0.0 PDFBox
>
> Attachments: infinite-recursion.pdf, loop_in_page_tree.pdf, 
> poppler-43279-0.pdf, poppler-91414-1.zip-2.gz-53.pdf
>
>
> Parsing an invalid PDF can lead to an infinite recursion in COSParser, which 
> results in a StackOverflowError.
> *Steps to repro*
>  # Download malformed PDF (attached)
>  # {{Run: java -jar pdfbox-app-2.0.16.jar ExtractText infinite-recursion.pdf}}
> *Stacktrace*
> {noformat}
> Exception in thread "main" java.lang.StackOverflowError [1005/1916]
>  at java.base/sun.nio.cs.UTF_8.updatePositions(UTF_8.java:79)
>  at java.base/sun.nio.cs.UTF_8$Decoder.xflow(UTF_8.java:210)
>  at java.base/sun.nio.cs.UTF_8$Decoder.decodeArrayLoop(UTF_8.java:321)
>  at java.base/sun.nio.cs.UTF_8$Decoder.decodeLoop(UTF_8.java:414)
>  at java.base/java.nio.charset.CharsetDecoder.decode(CharsetDecoder.java:578)
>  at java.base/java.nio.charset.CharsetDecoder.decode(CharsetDecoder.java:801)
>  at org.apache.pdfbox.pdfparser.BaseParser.isValidUTF8(BaseParser.java:787)
>  at org.apache.pdfbox.pdfparser.BaseParser.parseCOSName(BaseParser.java:768)
>  at org.apache.pdfbox.pdfparser.BaseParser.parseDirObject(BaseParser.java:887)
>  at 
> org.apache.pdfbox.pdfparser.BaseParser.parseCOSDictionaryValue(BaseParser.java:154)
>  at 
> org.apache.pdfbox.pdfparser.BaseParser.parseCOSDictionaryNameValuePair(BaseParser.java:283)
>  at 
> org.apache.pdfbox.pdfparser.BaseParser.parseCOSDictionary(BaseParser.java:216)
>  at org.apache.pdfbox.pdfparser.BaseParser.parseDirObject(BaseParser.java:867)
>  at org.apache.pdfbox.pdfparser.COSParser.parseFileObject(COSParser.java:912)
>  at 
> org.apache.pdfbox.pdfparser.COSParser.parseObjectDynamically(COSParser.java:881)
>  at 
> org.apache.pdfbox.pdfparser.COSParser.parseObjectDynamically(COSParser.java:801)
>  at org.apache.pdfbox.pdfparser.COSParser.getLength(COSParser.java:1055)
>  at org.apache.pdfbox.pdfparser.COSParser.parseCOSStream(COSParser.java:1114)
>  at org.apache.pdfbox.pdfparser.COSParser.parseFileObject(COSParser.java:920)
>  at 
> org.apache.pdfbox.pdfparser.COSParser.parseObjectDynamically(COSParser.java:881)
>  at 
> org.apache.pdfbox.pdfparser.COSParser.parseObjectDynamically(COSParser.java:801)
>  at org.apache.pdfbox.pdfparser.COSParser.getLength(COSParser.java:1055)
>  at org.apache.pdfbox.pdfparser.COSParser.parseCOSStream(COSParser.java:1114)
>  at org.apache.pdfbox.pdfparser.COSParser.parseFileObject(COSParser.java:920)
>  at 
> org.apache.pdfbox.pdfparser.COSParser.parseObjectDynamically(COSParser.java:881)
>  at 
> org.apache.pdfbox.pdfparser.COSParser.parseObjectDynamically(COSParser.java:801)
>  at org.apache.pdfbox.pdfparser.COSParser.getLength(COSParser.java:1055)
>  at org.apache.pdfbox.pdfparser.COSParser.parseCOSStream(COSParser.java:1114)
>  at org.apache.pdfbox.pdfparser.COSParser.parseFileObject(COSParser.java:920)
>  at 
> org.apache.pdfbox.pdfparser.COSParser.parseObjectDynamically(COSParser.java:881)
>  at 
> org.apache.pdfbox.pdfparser.COSParser.parseObjectDynamically(COSParser.java:801)
>  at org.apache.pdfbox.pdfparser.COSParser.getLength(COSParser.java:1055)
>  at org.apache.pdfbox.pdfparser.COSParser.parseCOSStream(COSParser.java:1114)
>  ...
> {noformat}
> The file was generated by fuzzing and is (probably) not a valid PDF file.
>  



--
This message was sent by Atlassian Jira
(v8.3.4#803005)

-
To unsubscribe, e-mail: dev-unsubscr...@pdfbox.apache.org
For additional commands, e-mail: dev-h...@pdfbox.apache.org



[GitHub] [pdfbox] THausherr closed pull request #103: Print stack trace to see why build on github / travis fail here

2020-12-31 Thread GitBox


THausherr closed pull request #103:
URL: https://github.com/apache/pdfbox/pull/103


   



This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



-
To unsubscribe, e-mail: dev-unsubscr...@pdfbox.apache.org
For additional commands, e-mail: dev-h...@pdfbox.apache.org



[jira] [Updated] (PDFBOX-5064) COSParser: Infinite recursion

2020-12-31 Thread Jira


 [ 
https://issues.apache.org/jira/browse/PDFBOX-5064?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Andreas Lehmkühler updated PDFBOX-5064:
---
Attachment: loop_in_page_tree.pdf
poppler-43279-0.pdf

> COSParser: Infinite recursion
> -
>
> Key: PDFBOX-5064
> URL: https://issues.apache.org/jira/browse/PDFBOX-5064
> Project: PDFBox
>  Issue Type: Bug
>  Components: Parsing
>Affects Versions: 2.0.22, 3.0.0 PDFBox
>Reporter: Andreas Lehmkühler
>Priority: Major
> Attachments: loop_in_page_tree.pdf, poppler-43279-0.pdf
>
>
> This is a follow up to PDFBOX-4623 which fixes only one of the infinite 
> recursion with the attached pdfs



--
This message was sent by Atlassian Jira
(v8.3.4#803005)

-
To unsubscribe, e-mail: dev-unsubscr...@pdfbox.apache.org
For additional commands, e-mail: dev-h...@pdfbox.apache.org



[jira] [Created] (PDFBOX-5064) COSParser: Infinite recursion

2020-12-31 Thread Jira
Andreas Lehmkühler created PDFBOX-5064:
--

 Summary: COSParser: Infinite recursion
 Key: PDFBOX-5064
 URL: https://issues.apache.org/jira/browse/PDFBOX-5064
 Project: PDFBox
  Issue Type: Bug
  Components: Parsing
Affects Versions: 2.0.22, 3.0.0 PDFBox
Reporter: Andreas Lehmkühler


This is a follow up to PDFBOX-4623 which fixes only one of the infinite 
recursion with the attached pdfs



--
This message was sent by Atlassian Jira
(v8.3.4#803005)

-
To unsubscribe, e-mail: dev-unsubscr...@pdfbox.apache.org
For additional commands, e-mail: dev-h...@pdfbox.apache.org



[jira] [Resolved] (PDFBOX-5063) testCreateCheckBox fails on travis / github

2020-12-31 Thread Tilman Hausherr (Jira)


 [ 
https://issues.apache.org/jira/browse/PDFBOX-5063?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Tilman Hausherr resolved PDFBOX-5063.
-
Resolution: Fixed

> testCreateCheckBox fails on travis / github
> ---
>
> Key: PDFBOX-5063
> URL: https://issues.apache.org/jira/browse/PDFBOX-5063
> Project: PDFBox
>  Issue Type: Bug
>  Components: AcroForm
>Affects Versions: 2.0.22
>Reporter: Tilman Hausherr
>Assignee: Tilman Hausherr
>Priority: Major
> Fix For: 2.0.23, 3.0.0 PDFBox
>
>
> testCreateCheckBox fails in the travis builds on github. The cause is that 
> the ZapfDingbats font doesn't exist there. But we have the AFM resource file, 
> so we can use that one.



--
This message was sent by Atlassian Jira
(v8.3.4#803005)

-
To unsubscribe, e-mail: dev-unsubscr...@pdfbox.apache.org
For additional commands, e-mail: dev-h...@pdfbox.apache.org



[jira] [Commented] (PDFBOX-5063) testCreateCheckBox fails on travis / github

2020-12-31 Thread ASF subversion and git services (Jira)


[ 
https://issues.apache.org/jira/browse/PDFBOX-5063?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17256971#comment-17256971
 ] 

ASF subversion and git services commented on PDFBOX-5063:
-

Commit 1884986 from Tilman Hausherr in branch 'pdfbox/trunk'
[ https://svn.apache.org/r1884986 ]

PDFBOX-5063: use AFM resource if the ZapfDingbats font isn't available; closes 
#103

> testCreateCheckBox fails on travis / github
> ---
>
> Key: PDFBOX-5063
> URL: https://issues.apache.org/jira/browse/PDFBOX-5063
> Project: PDFBox
>  Issue Type: Bug
>  Components: AcroForm
>Affects Versions: 2.0.22
>Reporter: Tilman Hausherr
>Assignee: Tilman Hausherr
>Priority: Major
> Fix For: 2.0.23, 3.0.0 PDFBox
>
>
> testCreateCheckBox fails in the travis builds on github. The cause is that 
> the ZapfDingbats font doesn't exist there. But we have the AFM resource file, 
> so we can use that one.



--
This message was sent by Atlassian Jira
(v8.3.4#803005)

-
To unsubscribe, e-mail: dev-unsubscr...@pdfbox.apache.org
For additional commands, e-mail: dev-h...@pdfbox.apache.org



[jira] [Commented] (PDFBOX-5063) testCreateCheckBox fails on travis / github

2020-12-31 Thread ASF subversion and git services (Jira)


[ 
https://issues.apache.org/jira/browse/PDFBOX-5063?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17256970#comment-17256970
 ] 

ASF subversion and git services commented on PDFBOX-5063:
-

Commit 1884985 from Tilman Hausherr in branch 'pdfbox/branches/2.0'
[ https://svn.apache.org/r1884985 ]

PDFBOX-5063: use AFM resource if the ZapfDingbats font isn't available; closes 
#103

> testCreateCheckBox fails on travis / github
> ---
>
> Key: PDFBOX-5063
> URL: https://issues.apache.org/jira/browse/PDFBOX-5063
> Project: PDFBox
>  Issue Type: Bug
>  Components: AcroForm
>Affects Versions: 2.0.22
>Reporter: Tilman Hausherr
>Assignee: Tilman Hausherr
>Priority: Major
> Fix For: 2.0.23, 3.0.0 PDFBox
>
>
> testCreateCheckBox fails in the travis builds on github. The cause is that 
> the ZapfDingbats font doesn't exist there. But we have the AFM resource file, 
> so we can use that one.



--
This message was sent by Atlassian Jira
(v8.3.4#803005)

-
To unsubscribe, e-mail: dev-unsubscr...@pdfbox.apache.org
For additional commands, e-mail: dev-h...@pdfbox.apache.org



[jira] [Created] (PDFBOX-5063) testCreateCheckBox fails on travis / github

2020-12-31 Thread Tilman Hausherr (Jira)
Tilman Hausherr created PDFBOX-5063:
---

 Summary: testCreateCheckBox fails on travis / github
 Key: PDFBOX-5063
 URL: https://issues.apache.org/jira/browse/PDFBOX-5063
 Project: PDFBox
  Issue Type: Bug
  Components: AcroForm
Affects Versions: 2.0.22
Reporter: Tilman Hausherr
Assignee: Tilman Hausherr
 Fix For: 2.0.23, 3.0.0 PDFBox


testCreateCheckBox fails in the travis builds on github. The cause is that the 
ZapfDingbats font doesn't exist there. But we have the AFM resource file, so we 
can use that one.



--
This message was sent by Atlassian Jira
(v8.3.4#803005)

-
To unsubscribe, e-mail: dev-unsubscr...@pdfbox.apache.org
For additional commands, e-mail: dev-h...@pdfbox.apache.org



[jira] [Updated] (PDFBOX-4623) COSParser: Infinite recursion

2020-12-31 Thread Jira


 [ 
https://issues.apache.org/jira/browse/PDFBOX-4623?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Andreas Lehmkühler updated PDFBOX-4623:
---
Fix Version/s: 3.0.0 PDFBox

> COSParser: Infinite recursion
> -
>
> Key: PDFBOX-4623
> URL: https://issues.apache.org/jira/browse/PDFBOX-4623
> Project: PDFBox
>  Issue Type: Bug
>  Components: Parsing
>Affects Versions: 2.0.16
> Environment: java version "12" 2019-03-19
> Java(TM) SE Runtime Environment (build 12+33)
> Java HotSpot(TM) 64-Bit Server VM (build 12+33, mixed mode, sharing) 
> MacOS Mojave
>Reporter: Alex Rebert
>Assignee: Andreas Lehmkühler
>Priority: Minor
> Fix For: 3.0.0 PDFBox
>
> Attachments: infinite-recursion.pdf, loop_in_page_tree.pdf, 
> poppler-43279-0.pdf, poppler-91414-1.zip-2.gz-53.pdf
>
>
> Parsing an invalid PDF can lead to an infinite recursion in COSParser, which 
> results in a StackOverflowError.
> *Steps to repro*
>  # Download malformed PDF (attached)
>  # {{Run: java -jar pdfbox-app-2.0.16.jar ExtractText infinite-recursion.pdf}}
> *Stacktrace*
> {noformat}
> Exception in thread "main" java.lang.StackOverflowError [1005/1916]
>  at java.base/sun.nio.cs.UTF_8.updatePositions(UTF_8.java:79)
>  at java.base/sun.nio.cs.UTF_8$Decoder.xflow(UTF_8.java:210)
>  at java.base/sun.nio.cs.UTF_8$Decoder.decodeArrayLoop(UTF_8.java:321)
>  at java.base/sun.nio.cs.UTF_8$Decoder.decodeLoop(UTF_8.java:414)
>  at java.base/java.nio.charset.CharsetDecoder.decode(CharsetDecoder.java:578)
>  at java.base/java.nio.charset.CharsetDecoder.decode(CharsetDecoder.java:801)
>  at org.apache.pdfbox.pdfparser.BaseParser.isValidUTF8(BaseParser.java:787)
>  at org.apache.pdfbox.pdfparser.BaseParser.parseCOSName(BaseParser.java:768)
>  at org.apache.pdfbox.pdfparser.BaseParser.parseDirObject(BaseParser.java:887)
>  at 
> org.apache.pdfbox.pdfparser.BaseParser.parseCOSDictionaryValue(BaseParser.java:154)
>  at 
> org.apache.pdfbox.pdfparser.BaseParser.parseCOSDictionaryNameValuePair(BaseParser.java:283)
>  at 
> org.apache.pdfbox.pdfparser.BaseParser.parseCOSDictionary(BaseParser.java:216)
>  at org.apache.pdfbox.pdfparser.BaseParser.parseDirObject(BaseParser.java:867)
>  at org.apache.pdfbox.pdfparser.COSParser.parseFileObject(COSParser.java:912)
>  at 
> org.apache.pdfbox.pdfparser.COSParser.parseObjectDynamically(COSParser.java:881)
>  at 
> org.apache.pdfbox.pdfparser.COSParser.parseObjectDynamically(COSParser.java:801)
>  at org.apache.pdfbox.pdfparser.COSParser.getLength(COSParser.java:1055)
>  at org.apache.pdfbox.pdfparser.COSParser.parseCOSStream(COSParser.java:1114)
>  at org.apache.pdfbox.pdfparser.COSParser.parseFileObject(COSParser.java:920)
>  at 
> org.apache.pdfbox.pdfparser.COSParser.parseObjectDynamically(COSParser.java:881)
>  at 
> org.apache.pdfbox.pdfparser.COSParser.parseObjectDynamically(COSParser.java:801)
>  at org.apache.pdfbox.pdfparser.COSParser.getLength(COSParser.java:1055)
>  at org.apache.pdfbox.pdfparser.COSParser.parseCOSStream(COSParser.java:1114)
>  at org.apache.pdfbox.pdfparser.COSParser.parseFileObject(COSParser.java:920)
>  at 
> org.apache.pdfbox.pdfparser.COSParser.parseObjectDynamically(COSParser.java:881)
>  at 
> org.apache.pdfbox.pdfparser.COSParser.parseObjectDynamically(COSParser.java:801)
>  at org.apache.pdfbox.pdfparser.COSParser.getLength(COSParser.java:1055)
>  at org.apache.pdfbox.pdfparser.COSParser.parseCOSStream(COSParser.java:1114)
>  at org.apache.pdfbox.pdfparser.COSParser.parseFileObject(COSParser.java:920)
>  at 
> org.apache.pdfbox.pdfparser.COSParser.parseObjectDynamically(COSParser.java:881)
>  at 
> org.apache.pdfbox.pdfparser.COSParser.parseObjectDynamically(COSParser.java:801)
>  at org.apache.pdfbox.pdfparser.COSParser.getLength(COSParser.java:1055)
>  at org.apache.pdfbox.pdfparser.COSParser.parseCOSStream(COSParser.java:1114)
>  ...
> {noformat}
> The file was generated by fuzzing and is (probably) not a valid PDF file.
>  



--
This message was sent by Atlassian Jira
(v8.3.4#803005)

-
To unsubscribe, e-mail: dev-unsubscr...@pdfbox.apache.org
For additional commands, e-mail: dev-h...@pdfbox.apache.org



[jira] [Assigned] (PDFBOX-4623) COSParser: Infinite recursion

2020-12-31 Thread Jira


 [ 
https://issues.apache.org/jira/browse/PDFBOX-4623?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Andreas Lehmkühler reassigned PDFBOX-4623:
--

Assignee: Andreas Lehmkühler

> COSParser: Infinite recursion
> -
>
> Key: PDFBOX-4623
> URL: https://issues.apache.org/jira/browse/PDFBOX-4623
> Project: PDFBox
>  Issue Type: Bug
>  Components: Parsing
>Affects Versions: 2.0.16
> Environment: java version "12" 2019-03-19
> Java(TM) SE Runtime Environment (build 12+33)
> Java HotSpot(TM) 64-Bit Server VM (build 12+33, mixed mode, sharing) 
> MacOS Mojave
>Reporter: Alex Rebert
>Assignee: Andreas Lehmkühler
>Priority: Minor
> Attachments: infinite-recursion.pdf, loop_in_page_tree.pdf, 
> poppler-43279-0.pdf, poppler-91414-1.zip-2.gz-53.pdf
>
>
> Parsing an invalid PDF can lead to an infinite recursion in COSParser, which 
> results in a StackOverflowError.
> *Steps to repro*
>  # Download malformed PDF (attached)
>  # {{Run: java -jar pdfbox-app-2.0.16.jar ExtractText infinite-recursion.pdf}}
> *Stacktrace*
> {noformat}
> Exception in thread "main" java.lang.StackOverflowError [1005/1916]
>  at java.base/sun.nio.cs.UTF_8.updatePositions(UTF_8.java:79)
>  at java.base/sun.nio.cs.UTF_8$Decoder.xflow(UTF_8.java:210)
>  at java.base/sun.nio.cs.UTF_8$Decoder.decodeArrayLoop(UTF_8.java:321)
>  at java.base/sun.nio.cs.UTF_8$Decoder.decodeLoop(UTF_8.java:414)
>  at java.base/java.nio.charset.CharsetDecoder.decode(CharsetDecoder.java:578)
>  at java.base/java.nio.charset.CharsetDecoder.decode(CharsetDecoder.java:801)
>  at org.apache.pdfbox.pdfparser.BaseParser.isValidUTF8(BaseParser.java:787)
>  at org.apache.pdfbox.pdfparser.BaseParser.parseCOSName(BaseParser.java:768)
>  at org.apache.pdfbox.pdfparser.BaseParser.parseDirObject(BaseParser.java:887)
>  at 
> org.apache.pdfbox.pdfparser.BaseParser.parseCOSDictionaryValue(BaseParser.java:154)
>  at 
> org.apache.pdfbox.pdfparser.BaseParser.parseCOSDictionaryNameValuePair(BaseParser.java:283)
>  at 
> org.apache.pdfbox.pdfparser.BaseParser.parseCOSDictionary(BaseParser.java:216)
>  at org.apache.pdfbox.pdfparser.BaseParser.parseDirObject(BaseParser.java:867)
>  at org.apache.pdfbox.pdfparser.COSParser.parseFileObject(COSParser.java:912)
>  at 
> org.apache.pdfbox.pdfparser.COSParser.parseObjectDynamically(COSParser.java:881)
>  at 
> org.apache.pdfbox.pdfparser.COSParser.parseObjectDynamically(COSParser.java:801)
>  at org.apache.pdfbox.pdfparser.COSParser.getLength(COSParser.java:1055)
>  at org.apache.pdfbox.pdfparser.COSParser.parseCOSStream(COSParser.java:1114)
>  at org.apache.pdfbox.pdfparser.COSParser.parseFileObject(COSParser.java:920)
>  at 
> org.apache.pdfbox.pdfparser.COSParser.parseObjectDynamically(COSParser.java:881)
>  at 
> org.apache.pdfbox.pdfparser.COSParser.parseObjectDynamically(COSParser.java:801)
>  at org.apache.pdfbox.pdfparser.COSParser.getLength(COSParser.java:1055)
>  at org.apache.pdfbox.pdfparser.COSParser.parseCOSStream(COSParser.java:1114)
>  at org.apache.pdfbox.pdfparser.COSParser.parseFileObject(COSParser.java:920)
>  at 
> org.apache.pdfbox.pdfparser.COSParser.parseObjectDynamically(COSParser.java:881)
>  at 
> org.apache.pdfbox.pdfparser.COSParser.parseObjectDynamically(COSParser.java:801)
>  at org.apache.pdfbox.pdfparser.COSParser.getLength(COSParser.java:1055)
>  at org.apache.pdfbox.pdfparser.COSParser.parseCOSStream(COSParser.java:1114)
>  at org.apache.pdfbox.pdfparser.COSParser.parseFileObject(COSParser.java:920)
>  at 
> org.apache.pdfbox.pdfparser.COSParser.parseObjectDynamically(COSParser.java:881)
>  at 
> org.apache.pdfbox.pdfparser.COSParser.parseObjectDynamically(COSParser.java:801)
>  at org.apache.pdfbox.pdfparser.COSParser.getLength(COSParser.java:1055)
>  at org.apache.pdfbox.pdfparser.COSParser.parseCOSStream(COSParser.java:1114)
>  ...
> {noformat}
> The file was generated by fuzzing and is (probably) not a valid PDF file.
>  



--
This message was sent by Atlassian Jira
(v8.3.4#803005)

-
To unsubscribe, e-mail: dev-unsubscr...@pdfbox.apache.org
For additional commands, e-mail: dev-h...@pdfbox.apache.org



[jira] [Commented] (PDFBOX-4892) Improve code quality (4)

2020-12-31 Thread ASF subversion and git services (Jira)


[ 
https://issues.apache.org/jira/browse/PDFBOX-4892?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17256967#comment-17256967
 ] 

ASF subversion and git services commented on PDFBOX-4892:
-

Commit 1884984 from le...@apache.org in branch 'pdfbox/trunk'
[ https://svn.apache.org/r1884984 ]

PDFBOX-4892: overwrite temporary files from a failed test run that failed

> Improve code quality (4)
> 
>
> Key: PDFBOX-4892
> URL: https://issues.apache.org/jira/browse/PDFBOX-4892
> Project: PDFBox
>  Issue Type: Improvement
>Affects Versions: 2.0.20
>Reporter: Tilman Hausherr
>Priority: Minor
>
> This is a longterm issue for the task to improve code quality, by using the 
> [SonarQube report|https://sonarcloud.io/project/issues?id=pdfbox-reactor], 
> hints in different IDEs, the FindBugs tool and other code quality tools.
> This is a follow-up of PDFBOX-4071, which was getting too long.



--
This message was sent by Atlassian Jira
(v8.3.4#803005)

-
To unsubscribe, e-mail: dev-unsubscr...@pdfbox.apache.org
For additional commands, e-mail: dev-h...@pdfbox.apache.org



[GitHub] [pdfbox] asfgit closed pull request #93: Updated comment when rendered image is too large

2020-12-31 Thread GitBox


asfgit closed pull request #93:
URL: https://github.com/apache/pdfbox/pull/93


   



This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



-
To unsubscribe, e-mail: dev-unsubscr...@pdfbox.apache.org
For additional commands, e-mail: dev-h...@pdfbox.apache.org



[jira] [Commented] (PDFBOX-5061) Replace jaxb date DatatypeConverter with java.time

2020-12-31 Thread Jira


[ 
https://issues.apache.org/jira/browse/PDFBOX-5061?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17256960#comment-17256960
 ] 

Andreas Lehmkühler commented on PDFBOX-5061:


[~tilman] Thanks for the clarification. Please forget my proposal

> Replace jaxb date DatatypeConverter with java.time
> --
>
> Key: PDFBOX-5061
> URL: https://issues.apache.org/jira/browse/PDFBOX-5061
> Project: PDFBox
>  Issue Type: Task
>Reporter: Oliver Kopp
>Assignee: Maruan Sahyoun
>Priority: Minor
> Fix For: 3.0.0 PDFBox
>
> Attachments: screenshot-1.png
>
>
> {{javax.}}-dependencies have been superseeded by jakarta dependencies.
> To be able to use Apache PDFBox in Java projects using newer JDKs, it would 
> be feasable to use the new jakarta dependencies. I think, only 
> jakarta.xml.bind is affected. [https://eclipse-ee4j.github.io/jaxb-ri/]
> See also https://issues.apache.org/jira/browse/SHIRO-750



--
This message was sent by Atlassian Jira
(v8.3.4#803005)

-
To unsubscribe, e-mail: dev-unsubscr...@pdfbox.apache.org
For additional commands, e-mail: dev-h...@pdfbox.apache.org



[jira] [Commented] (PDFBOX-2602) Enhance command line tools

2020-12-31 Thread Tilman Hausherr (Jira)


[ 
https://issues.apache.org/jira/browse/PDFBOX-2602?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17256954#comment-17256954
 ] 

Tilman Hausherr commented on PDFBOX-2602:
-

OK, keep the new ones for now, unless many users complain.

> Enhance command line tools
> --
>
> Key: PDFBOX-2602
> URL: https://issues.apache.org/jira/browse/PDFBOX-2602
> Project: PDFBox
>  Issue Type: Bug
>  Components: Utilities
>Affects Versions: 1.8.8, 2.0.0
>Reporter: Maruan Sahyoun
>Assignee: Maruan Sahyoun
>Priority: Minor
> Fix For: 3.0.0 PDFBox
>
>
> The command line tools shall be enhanced to have the same behavior across all 
> tools.
> From the discussion on the dev mailing list
> - add an -h option to print the usage
> - print the usage to System.err and use an exit code of 1 if there was an 
> invalid command line parameter
> - print messages on exceptions to System.err
> - rethrow the exception so java can handle it if it will terminate afterwards 
> anyway
> - use an exit code of 1if rethrowing doesn't make sense
> Additional input:
> https://clig.dev/



--
This message was sent by Atlassian Jira
(v8.3.4#803005)

-
To unsubscribe, e-mail: dev-unsubscr...@pdfbox.apache.org
For additional commands, e-mail: dev-h...@pdfbox.apache.org



[jira] [Commented] (PDFBOX-5061) Replace jaxb date DatatypeConverter with java.time

2020-12-31 Thread Tilman Hausherr (Jira)


[ 
https://issues.apache.org/jira/browse/PDFBOX-5061?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17256953#comment-17256953
 ] 

Tilman Hausherr commented on PDFBOX-5061:
-

DateTimeFormatter isn't available below jdk8

> Replace jaxb date DatatypeConverter with java.time
> --
>
> Key: PDFBOX-5061
> URL: https://issues.apache.org/jira/browse/PDFBOX-5061
> Project: PDFBox
>  Issue Type: Task
>Reporter: Oliver Kopp
>Assignee: Maruan Sahyoun
>Priority: Minor
> Fix For: 3.0.0 PDFBox
>
> Attachments: screenshot-1.png
>
>
> {{javax.}}-dependencies have been superseeded by jakarta dependencies.
> To be able to use Apache PDFBox in Java projects using newer JDKs, it would 
> be feasable to use the new jakarta dependencies. I think, only 
> jakarta.xml.bind is affected. [https://eclipse-ee4j.github.io/jaxb-ri/]
> See also https://issues.apache.org/jira/browse/SHIRO-750



--
This message was sent by Atlassian Jira
(v8.3.4#803005)

-
To unsubscribe, e-mail: dev-unsubscr...@pdfbox.apache.org
For additional commands, e-mail: dev-h...@pdfbox.apache.org



[jira] [Updated] (PDFBOX-3066) Text extraction garbled in this file, was OK in 1.8

2020-12-31 Thread Jira


 [ 
https://issues.apache.org/jira/browse/PDFBOX-3066?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Andreas Lehmkühler updated PDFBOX-3066:
---
Fix Version/s: (was: 3.0.0 PDFBox)

> Text extraction garbled in this file, was OK in 1.8
> ---
>
> Key: PDFBOX-3066
> URL: https://issues.apache.org/jira/browse/PDFBOX-3066
> Project: PDFBox
>  Issue Type: Bug
>  Components: Text extraction
>Affects Versions: 2.0.0
>Reporter: Joel Hirsh
>Priority: Major
> Attachments: 936918.pdf, PDFBOX-3066-reduced.pdf, garbled.pdf
>
>
> Attached file, PrintTextLocations shows text garbled, like *,%-))’)) 
> Acrobat copy/paste shows accurate text, and was also fine in 1.8.



--
This message was sent by Atlassian Jira
(v8.3.4#803005)

-
To unsubscribe, e-mail: dev-unsubscr...@pdfbox.apache.org
For additional commands, e-mail: dev-h...@pdfbox.apache.org



[jira] [Commented] (PDFBOX-2138) Corrupted words when using PDFTextStripper

2020-12-31 Thread Jira


[ 
https://issues.apache.org/jira/browse/PDFBOX-2138?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17256952#comment-17256952
 ] 

Andreas Lehmkühler commented on PDFBOX-2138:


Looks like we have to implement something like Adobe (Save As and Copy) 
to reproduce different behaviours/extraction results

> Corrupted words when using PDFTextStripper
> --
>
> Key: PDFBOX-2138
> URL: https://issues.apache.org/jira/browse/PDFBOX-2138
> Project: PDFBox
>  Issue Type: Bug
>  Components: Text extraction
>Affects Versions: 1.8.5, 1.8.6, 2.0.0
> Environment: Windows 7 / 64 bit
>Reporter: Walter Kehl
>Priority: Major
> Fix For: 4.0.0
>
> Attachments: PDFBOX-2138-noClip.pdf, PDFBOX-2138-noClip.png, 
> PDFBOX-2138.pdf, PDFBOX-2138.txt, banking-banana-skins-2014.pdf, 
> banking-banana-skins-2014.txt
>
>
> >> I am using PDFTextStripper (embedded into another application) to get 
> >> the raw text of PDFs so far with good results but recently a PDF file 
> >> has appeared where the output of the PDFTextStripper was corrupted. I 
> >> got sentences like:
> >>
> >>
> >>
> >> "There is al o con ern that b nkers may be pushed to misprice risk 
> >> (No. 6) by the pres ures of c mpetition and an abunda ce of central b 
> >> nk-provided liquidity."
> > Additionally some portions of text appear 
> > twice in the output: first correctly and then corrupted. I have 
> > attached an output created with PDFBox's command line options.
> > If you compare lines 357- 365 with lines 421-429 you see that it is 
> > the same paragraph, first ok and then with characters missing. In the 
> > original source this paragraph is unique.
> > The same seems to happen for the other instances where text is corrupted.
> I also tried it directly on the command line with the same results: input and 
> output files are attached.



--
This message was sent by Atlassian Jira
(v8.3.4#803005)

-
To unsubscribe, e-mail: dev-unsubscr...@pdfbox.apache.org
For additional commands, e-mail: dev-h...@pdfbox.apache.org



[GitHub] [pdfbox] THausherr commented on pull request #103: Print stack trace to see why build on github / travis fail here

2020-12-31 Thread GitBox


THausherr commented on pull request #103:
URL: https://github.com/apache/pdfbox/pull/103#issuecomment-752918965


   It's the missing ZapfDingbats font. I'll see whether it's possible to use 
AFM instead. If not, then I'll abort with a more useful error message.



This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



-
To unsubscribe, e-mail: dev-unsubscr...@pdfbox.apache.org
For additional commands, e-mail: dev-h...@pdfbox.apache.org



[jira] [Commented] (PDFBOX-5061) Replace jaxb date DatatypeConverter with java.time

2020-12-31 Thread Jira


[ 
https://issues.apache.org/jira/browse/PDFBOX-5061?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17256932#comment-17256932
 ] 

Andreas Lehmkühler commented on PDFBOX-5061:


How about committing it to the 2.0 branch as well? It won't remove all javax 
dependencies but will reduce them and it isn't a breaking change.

> Replace jaxb date DatatypeConverter with java.time
> --
>
> Key: PDFBOX-5061
> URL: https://issues.apache.org/jira/browse/PDFBOX-5061
> Project: PDFBox
>  Issue Type: Task
>Reporter: Oliver Kopp
>Assignee: Maruan Sahyoun
>Priority: Minor
> Fix For: 3.0.0 PDFBox
>
> Attachments: screenshot-1.png
>
>
> {{javax.}}-dependencies have been superseeded by jakarta dependencies.
> To be able to use Apache PDFBox in Java projects using newer JDKs, it would 
> be feasable to use the new jakarta dependencies. I think, only 
> jakarta.xml.bind is affected. [https://eclipse-ee4j.github.io/jaxb-ri/]
> See also https://issues.apache.org/jira/browse/SHIRO-750



--
This message was sent by Atlassian Jira
(v8.3.4#803005)

-
To unsubscribe, e-mail: dev-unsubscr...@pdfbox.apache.org
For additional commands, e-mail: dev-h...@pdfbox.apache.org



[GitHub] [pdfbox] THausherr opened a new pull request #103: Print stack trace to see why build on github / travis fail here

2020-12-31 Thread GitBox


THausherr opened a new pull request #103:
URL: https://github.com/apache/pdfbox/pull/103


   This is just to test on github. I'll create a JIRA issue if needed.



This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



-
To unsubscribe, e-mail: dev-unsubscr...@pdfbox.apache.org
For additional commands, e-mail: dev-h...@pdfbox.apache.org



[jira] [Commented] (PDFBOX-4892) Improve code quality (4)

2020-12-31 Thread ASF subversion and git services (Jira)


[ 
https://issues.apache.org/jira/browse/PDFBOX-4892?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17256881#comment-17256881
 ] 

ASF subversion and git services commented on PDFBOX-4892:
-

Commit 1884981 from Tilman Hausherr in branch 'pdfbox/branches/2.0'
[ https://svn.apache.org/r1884981 ]

PDFBOX-4892: correct IOException message, as suggested by IntraCherche; closes 
#93

> Improve code quality (4)
> 
>
> Key: PDFBOX-4892
> URL: https://issues.apache.org/jira/browse/PDFBOX-4892
> Project: PDFBox
>  Issue Type: Improvement
>Affects Versions: 2.0.20
>Reporter: Tilman Hausherr
>Priority: Minor
>
> This is a longterm issue for the task to improve code quality, by using the 
> [SonarQube report|https://sonarcloud.io/project/issues?id=pdfbox-reactor], 
> hints in different IDEs, the FindBugs tool and other code quality tools.
> This is a follow-up of PDFBOX-4071, which was getting too long.



--
This message was sent by Atlassian Jira
(v8.3.4#803005)

-
To unsubscribe, e-mail: dev-unsubscr...@pdfbox.apache.org
For additional commands, e-mail: dev-h...@pdfbox.apache.org



[jira] [Commented] (PDFBOX-4892) Improve code quality (4)

2020-12-31 Thread ASF subversion and git services (Jira)


[ 
https://issues.apache.org/jira/browse/PDFBOX-4892?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17256882#comment-17256882
 ] 

ASF subversion and git services commented on PDFBOX-4892:
-

Commit 1884982 from Tilman Hausherr in branch 'pdfbox/trunk'
[ https://svn.apache.org/r1884982 ]

PDFBOX-4892: correct IOException message, as suggested by IntraCherche; closes 
#93

> Improve code quality (4)
> 
>
> Key: PDFBOX-4892
> URL: https://issues.apache.org/jira/browse/PDFBOX-4892
> Project: PDFBox
>  Issue Type: Improvement
>Affects Versions: 2.0.20
>Reporter: Tilman Hausherr
>Priority: Minor
>
> This is a longterm issue for the task to improve code quality, by using the 
> [SonarQube report|https://sonarcloud.io/project/issues?id=pdfbox-reactor], 
> hints in different IDEs, the FindBugs tool and other code quality tools.
> This is a follow-up of PDFBOX-4071, which was getting too long.



--
This message was sent by Atlassian Jira
(v8.3.4#803005)

-
To unsubscribe, e-mail: dev-unsubscr...@pdfbox.apache.org
For additional commands, e-mail: dev-h...@pdfbox.apache.org



[GitHub] [pdfbox] THausherr commented on pull request #93: Updated comment when rendered image is too large

2020-12-31 Thread GitBox


THausherr commented on pull request #93:
URL: https://github.com/apache/pdfbox/pull/93#issuecomment-752884006


   Of course 
   Good observation!



This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



-
To unsubscribe, e-mail: dev-unsubscr...@pdfbox.apache.org
For additional commands, e-mail: dev-h...@pdfbox.apache.org