[jira] [Created] (PDFBOX-1447) wasted work in PDFMarkedContentExtractor.processTextPosition()

Adrian Nistor (JIRA) Thu, 15 Nov 2012 10:50:14 -0800

Adrian Nistor created PDFBOX-1447:
-------------------------------------

             Summary: wasted work in 
PDFMarkedContentExtractor.processTextPosition()
                 Key: PDFBOX-1447
                 URL: https://issues.apache.org/jira/browse/PDFBOX-1447
             Project: PDFBox
          Issue Type: Bug
    Affects Versions: 1.7.1
            Reporter: Adrian Nistor



The problem appears in version 1.7.1 and in revision 1409864.  I
attached a one-line patch that fixes it.

In method "PDFMarkedContentExtractor.processTextPosition", the loop
over "sameTextCharacters" should break immediately after
"suppressCharacter" is set to "true".  All the iterations after
"suppressCharacter" is set to "true" do not perform any useful work,
at best they just set "suppressCharacter" again to "true".

Method "processTextPosition" in class "PDFTextStripper" has a similar
loop, and this loop breaks immediately after "suppressCharacter" is
set to "true", just like in the proposed patch.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira

[jira] [Created] (PDFBOX-1447) wasted work in PDFMarkedContentExtractor.processTextPosition()

Reply via email to