Tilman Hausherr created PDFBOX-5822:
---------------------------------------
Summary: IllegalArgumentException: Parameter must be 1-based, but
is 0 when using PDFTextStripperByArea
Key: PDFBOX-5822
URL: https://issues.apache.org/jira/browse/PDFBOX-5822
Project: PDFBox
Issue Type: Bug
Components: Text extraction
Reporter: Tilman Hausherr
As reported by Pascal Schumacher in the users mailing list
https://lists.apache.org/thread/yb42j9s5vp8jsjog9msplbc05y1xqwv3
java.lang.IllegalArgumentException: Parameter must be 1-based, but is 0
at
org.apache.pdfbox.text.PDFTextStripper.setStartPage(PDFTextStripper.java:956)
at
org.apache.pdfbox.text.PDFTextStripperByArea.extractRegions(PDFTextStripperByArea.java:117)
this is because of this earlier seemingly "harmless" commit
https://github.com/apache/pdfbox/commit/5c0abf94367c12c9ac0b464046784d456ce4caf5
that broke PDFTextStripperByArea because it has two calls with 0 parameter.
This wasn't discovered because we have no tests for PDFTextStripperByArea 😬
--
This message was sent by Atlassian Jira
(v8.20.10#820010)
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]