Re: detection of column breaks and page breaks in PDF document

2025-05-29 Thread Andreas Lehmkühler
Am 23.05.25 um 19:19 schrieb Tilman Hausherr: On 23.05.2025 17:01, Robert Rodini wrote: This question is informational.  I use PDFBox utilities to extract text from a large PDF file.  The pages of the PDF always contain a three-column format. PDF Box CLI utility is wonderful since it proces

Re: detection of column breaks and page breaks in PDF document

2025-05-23 Thread Tilman Hausherr
On 23.05.2025 17:01, Robert Rodini wrote: This question is informational. I use PDFBox utilities to extract text from a large PDF file. The pages of the PDF always contain a three-column format. PDF Box CLI utility is wonderful since it processes the columns from top to bottom and left to ri