[ https://issues.apache.org/jira/browse/PDFBOX-1138?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]
Brad Reynolds updated PDFBOX-1138: ---------------------------------- Priority: Blocker Description: Text extraction fails for some PDFs under the following circumstances: - page is in landscape format - setSortByPosition is true was: [imported from SourceForge] http://sourceforge.net/tracker/index.php?group_id=78314&atid=552832&aid=1410876 Originally submitted by lenuweit on 2006-01-20 07:22. Text extraction fails for some PDFs (see attached one generated by PS printer/Ghostscript) under the following circumstances: - page is in landscape format - setSortByPosition is true Extraction works fine if page is in portrait format. [attachment on SourceForge] http://sourceforge.net/tracker/download.php?group_id=78314&atid=552832&aid=1410876&file_id=164169 testpdfbox.pdf (application/pdf), 5442 bytes sample PDF (1 page in landscape) Affects Version/s: 1.6.0 This appears to have been reported before a long time ago but it was attached as a duplicate to a bug that has been resolved. I'm seeing this problem with 1.6. > CLONE - Text extraction fails for pages in landscape format > ----------------------------------------------------------- > > Key: PDFBOX-1138 > URL: https://issues.apache.org/jira/browse/PDFBOX-1138 > Project: PDFBox > Issue Type: Bug > Components: Text extraction > Affects Versions: 1.6.0 > Reporter: Brad Reynolds > Priority: Blocker > > Text extraction fails for some PDFs under the > following circumstances: > - page is in landscape format > - setSortByPosition is true -- This message is automatically generated by JIRA. If you think it was sent incorrectly, please contact your JIRA administrators: https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa For more information on JIRA, see: http://www.atlassian.com/software/jira