I see. Thanks a lot.
Regards,
wwkloo
--
View this message in context:
http://itext-general.2136553.n4.nabble.com/Text-box-of-non-BMP-character-with-zero-width-tp4657896p4657971.html
Sent from the iText - General mailing list archive at Nabble.com
Dear all,
I am trying to check the underline and overstrike status of characters in a
PDF.
I heard that underline and overstrike are achieved by graphical objects in
PDF. So I tried to detect the rectangle and line objects first. Based on
their position and the bounding box position of characters
Additional information:
- I run the code in iTextSharp 5.4.0 for getting the bounding box.
- If you want to get the character code of the first character properly,
need to use the svn code.
Thanks.
Regards,
wwkloo
--
View this message in context:
http://itext-general.2136553.n4.nabble.com
displayed properly in the PDF.
Please help!
Regards,
wwkloo
iTextExtract_W.pdf
<http://itext-general.2136553.n4.nabble.com/file/n4657896/iTextExtract_W.pdf>
===BEGIN CODE SEGMENT===
public class TestExtractionStategy :
iTextSharp.text.pdf.parser.ITextExtractionStrategy
{
p
Paulo Soares-4 wrote
> Fixed in the SVN.
>
> Paulo
Got the SVN code and compiled a new DLL. The output now is
=
31 00 20 00 41 D8 47 DD 20 00 0A 00 32 00 20 00 DB 92 20 00
=
Both characters are extracted correctly. Thanks a lot! Thank you all!
Regards,
wwkloo
--
View this m
Leonard Rosenthol-3 wrote
> On 3/20/13 8:38 AM, "wwkloo" <
> wklogoo@
> > wrote:
>
>>Additional information:
>>When create the PDF with another program, the text can be extracted by
>>iText
>>and Acrobat Reader XI correctly.
>>-
mkl wrote
> wwkloo,
> wwkloo wrote
>> I followed to try similar things in .Net. With UTF16, the 1st Asian
>> character returned is 0xFFFD. The 2nd is correct.
> Please also indicate the iTextSharp version you are using.
>
> If it is fairly new, 5.3.x or 5.4.x, that mi
wwkloo wrote
>
> mkl wrote
>> wwkloo,
>> wwkloo wrote
>>> I have a PDF with Asian font
>>> iTextExtract_W.pdf
>>> <http://itext-general.2136553.n4.nabble.com/file/n4657836/iTextExtract_W.pdf>
>>>
>>>
>>> When I e
mkl wrote
> wwkloo,
> wwkloo wrote
>> I have a PDF with Asian font
>> iTextExtract_W.pdf
>> <http://itext-general.2136553.n4.nabble.com/file/n4657836/iTextExtract_W.pdf>
>>
>>
>> When I extract the text from it through TextRenderInfo.Ge