Are there any known issues involving the setSortByPosition method of PDFTextStripper to cause characters to be handled incorrectly, when enabled? When using the 1.5.0 version of pdfbox, there is a PDF document which correctly returns its text with positional sorting disabled, but enabling it causes the text output to be incorrect. Curiously, this same document is sorted correctly using version 1.3.1 of pdfbox. Also tested was the trunk code with sorting enabled, but the result was even worse.

Reply via email to