Re: [iText-questions] Possible bug in PdfTextExtractor.GetTextFromPage [iTextSharp]

Kevin Day Sat, 04 Feb 2012 14:13:44 -0800

I believe that the bug in LocationTextExtractionStrategy.GetResultantText()
was fixed some time ago - did you experience this problem with the latest
code in HEAD ?


for reference, the line in question in SVN has the following (And
startsWithSpace and endsWithSpace has the null and empty conditions
covered):

                    else if (dist > chunk.charSpaceWidth/2.0f &&
!startsWithSpace(chunk.text) && !endsWithSpace(lastChunk.text))



I will need to ask Paulo to look at this - I don't quite know the full
implications of the uni2cid array - trying to maintain an array that is the
length of the full unicode set isn't practical - increasing the array to 512
or something may address the current situation you find yourself in, but
this seems to me like something that needs a more robust fix, and the whole
unicode/cid transformation stuff is outside of my expertise.

--
View this message in context: 
http://itext-general.2136553.n4.nabble.com/Possible-bug-in-PdfTextExtractor-GetTextFromPage-iTextSharp-tp4342445p4358006.html
Sent from the iText - General mailing list archive at Nabble.com.

------------------------------------------------------------------------------
Try before you buy = See our experts in action!
The most comprehensive online learning library for Microsoft developers
is just $99.99! Visual Studio, SharePoint, SQL - plus HTML5, CSS3, MVC3,
Metro Style Apps, more. Free future releases when you subscribe now!
http://p.sf.net/sfu/learndevnow-dev2
_______________________________________________
iText-questions mailing list
iText-questions@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/itext-questions

iText(R) is a registered trademark of 1T3XT BVBA.
Many questions posted to this list can (and will) be answered with a reference 
to the iText book: http://www.itextpdf.com/book/
Please check the keywords list before you ask for examples: 
http://itextpdf.com/themes/keywords.php

Re: [iText-questions] Possible bug in PdfTextExtractor.GetTextFromPage [iTextSharp]

Reply via email to