Re: [iText-questions] Text box of non-BMP character with zero width

2013-04-01 Thread wwkloo
I see. Thanks a lot. Regards, wwkloo -- View this message in context: http://itext-general.2136553.n4.nabble.com/Text-box-of-non-BMP-character-with-zero-width-tp4657896p4657971.html Sent from the iText - General mailing list archive at Nabble.com

[iText-questions] Problem is check underline/overstrike status of a character in PDF

2013-03-25 Thread wwkloo
Dear all, I am trying to check the underline and overstrike status of characters in a PDF. I heard that underline and overstrike are achieved by graphical objects in PDF. So I tried to detect the rectangle and line objects first. Based on their position and the bounding box position of characters

Re: [iText-questions] Text box of non-BMP character with zero width

2013-03-25 Thread wwkloo
Additional information: - I run the code in iTextSharp 5.4.0 for getting the bounding box. - If you want to get the character code of the first character properly, need to use the svn code. Thanks. Regards, wwkloo -- View this message in context: http://itext-general.2136553.n4.nabble.com

[iText-questions] Text box of non-BMP character with zero width

2013-03-25 Thread wwkloo
displayed properly in the PDF. Please help! Regards, wwkloo iTextExtract_W.pdf <http://itext-general.2136553.n4.nabble.com/file/n4657896/iTextExtract_W.pdf> ===BEGIN CODE SEGMENT=== public class TestExtractionStategy : iTextSharp.text.pdf.parser.ITextExtractionStrategy { p

Re: [iText-questions] [SPAM] Re: Differences btw text extraction from iText and Acrobat Reader?

2013-03-20 Thread wwkloo
Paulo Soares-4 wrote > Fixed in the SVN. > > Paulo Got the SVN code and compiled a new DLL. The output now is = 31 00 20 00 41 D8 47 DD 20 00 0A 00 32 00 20 00 DB 92 20 00 = Both characters are extracted correctly. Thanks a lot! Thank you all! Regards, wwkloo -- View this m

Re: [iText-questions] Differences btw text extraction from iText and Acrobat Reader?

2013-03-20 Thread wwkloo
Leonard Rosenthol-3 wrote > On 3/20/13 8:38 AM, "wwkloo" < > wklogoo@ > > wrote: > >>Additional information: >>When create the PDF with another program, the text can be extracted by >>iText >>and Acrobat Reader XI correctly. >>-

Re: [iText-questions] Differences btw text extraction from iText and Acrobat Reader?

2013-03-20 Thread wwkloo
mkl wrote > wwkloo, > wwkloo wrote >> I followed to try similar things in .Net. With UTF16, the 1st Asian >> character returned is 0xFFFD. The 2nd is correct. > Please also indicate the iTextSharp version you are using. > > If it is fairly new, 5.3.x or 5.4.x, that mi

Re: [iText-questions] Differences btw text extraction from iText and Acrobat Reader?

2013-03-20 Thread wwkloo
wwkloo wrote > > mkl wrote >> wwkloo, >> wwkloo wrote >>> I have a PDF with Asian font >>> iTextExtract_W.pdf >>> <http://itext-general.2136553.n4.nabble.com/file/n4657836/iTextExtract_W.pdf> >>> >>> >>> When I e

Re: [iText-questions] Differences btw text extraction from iText and Acrobat Reader?

2013-03-19 Thread wwkloo
mkl wrote > wwkloo, > wwkloo wrote >> I have a PDF with Asian font >> iTextExtract_W.pdf >> <http://itext-general.2136553.n4.nabble.com/file/n4657836/iTextExtract_W.pdf> >> >> >> When I extract the text from it through TextRenderInfo.Ge