Of course, java 1.6 is the current version of Java. How many people actually need to use 1.5?
On Wed, Feb 16, 2011 at 12:04 PM, Lars Torunski (JIRA) <j...@apache.org>wrote: > > [ > https://issues.apache.org/jira/browse/PDFBOX-956?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12995491#comment-12995491] > > Lars Torunski commented on PDFBOX-956: > -------------------------------------- > > Using NavigableMap in the patch will result in the dependency to Java 6. > > > Poor text extraction performance in PDFTextStripper.java > > -------------------------------------------------------- > > > > Key: PDFBOX-956 > > URL: https://issues.apache.org/jira/browse/PDFBOX-956 > > Project: PDFBox > > Issue Type: Improvement > > Components: Text extraction > > Affects Versions: 1.4.0 > > Reporter: Kevin Jackson > > Assignee: Andreas Lehmkühler > > Fix For: 1.5.0 > > > > Attachments: PDFBOX956-c4ce2fcd_69.txt, > PDFTextStripper.java.patch, c4ce2fcd_69.pdf > > > > > > The worst case performance of the suppressDuplicateOverlappingText logic > in processTextPosition is O(n^2). > > The patch is to use a TreeMap to achieve O(N log N) performance. > > The example PDF took over 2 hours to extract the text before this patch > and less than 10 minute after. > > BTW: The extracted text is also quite different compared to Adobe > Reader. Not sure which is correct but for this document it doesn't matter. > > -- > This message is automatically generated by JIRA. > - > For more information on JIRA, see: http://www.atlassian.com/software/jira > > >