While I agree, and it's true that Java 1.5 has already passed its end of 
life, there are still a lot of people who are using it.  For example, the 
company I work for hasn't upgraded to 1.6 yet.

---- 
Thanks,
Adam





From:
Ted Dunning <ted.dunn...@gmail.com>
To:
dev@pdfbox.apache.org
Date:
02/16/2011 12:13
Subject:
Re: [jira] Commented: (PDFBOX-956) Poor text extraction performance in 
PDFTextStripper.java



Of course, java 1.6 is the current version of Java.

How many people actually need to use 1.5?

On Wed, Feb 16, 2011 at 12:04 PM, Lars Torunski (JIRA) 
<j...@apache.org>wrote:

>
>    [
> 
https://issues.apache.org/jira/browse/PDFBOX-956?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12995491#comment-12995491
]
>
> Lars Torunski commented on PDFBOX-956:
> --------------------------------------
>
> Using NavigableMap in the patch will result in the dependency to Java 6.
>
> > Poor text extraction performance in PDFTextStripper.java
> > --------------------------------------------------------
> >
> >                 Key: PDFBOX-956
> >                 URL: https://issues.apache.org/jira/browse/PDFBOX-956
> >             Project: PDFBox
> >          Issue Type: Improvement
> >          Components: Text extraction
> >    Affects Versions: 1.4.0
> >            Reporter: Kevin Jackson
> >            Assignee: Andreas Lehmkühler
> >             Fix For: 1.5.0
> >
> >         Attachments: PDFBOX956-c4ce2fcd_69.txt,
> PDFTextStripper.java.patch, c4ce2fcd_69.pdf
> >
> >
> > The worst case performance of the suppressDuplicateOverlappingText 
logic
> in processTextPosition is O(n^2).
> > The patch is to use a TreeMap to achieve O(N log N) performance.
> > The example PDF took over 2 hours to extract the text before this 
patch
> and less than 10 minute after.
> > BTW:  The extracted text is also quite different compared to Adobe
> Reader.  Not sure which is correct but for this document it doesn't 
matter.
>
> --
> This message is automatically generated by JIRA.
> -
> For more information on JIRA, see: 
http://www.atlassian.com/software/jira
>
>
>





- FHA 203b; 203k; HECM; VA; USDA; Conventional 
- Warehouse Lines; FHA-Authorized Originators 
- Lending and Servicing in over 45 States 
www.swmc.com   -  www.simplehecmcalculator.com   
Visit  www.swmc.com/resources   for helpful links on Training, Webinars, Lender 
Alerts and Submitting Conditions  

This email and any content within or attached hereto from Sun West Mortgage 
Company, Inc. is confidential and/or legally privileged. The information is 
intended only for the use of the individual or entity named on this email. If 
you are not the intended recipient, you are hereby notified that any 
disclosure, copying, distribution or taking any action in reliance on the 
contents of this email information is strictly prohibited, and that the 
documents should be returned to this office immediately by email. Receipt by 
anyone other than the intended recipient is not a waiver of any privilege. 
Please do not include your social security number, account number, or any other 
personal or financial information in the content of the email. Should you have 
any questions, please call (800) 453 7884.  

Reply via email to