Re: [Podofo-users] Get JP characters from PDF
On Mon, 2015-06-22 at 15:24 +, Dineshkumar Ramalingam wrote: Hi, We are getting Ú0ü0¸0 while painting. We have tries with UTF-8, but we are getting like ã…ㅼㇸ 2. Encoding used: new PdfIdentityEncoding( 0, 0x, true ) as mentioned CreationTest.cpp Hi, it's weird. Does the code from the CreationTest.cpp work for you properly? There's used a different font as well. Bye, zyx -- http://www.litePDF.cz i...@litepdf.cz -- Monitor 25 network devices or servers for free with OpManager! OpManager is web-based network management software that monitors network devices and physical virtual servers, alerts via email sms for fault. Monitor 25 devices for free with no restriction. Download now http://ad.doubleclick.net/ddm/clk/292181274;119417398;o ___ Podofo-users mailing list Podofo-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/podofo-users
Re: [Podofo-users] Get JP characters from PDF
Hi, Code portion taken from CreationTest.cpp is not working for me, Code Used: void CreateUnicodeAnnotationText( PdfPage* pPage, PdfDocument* /*pDocument*/ ) { PdfString sJap(reinterpret_castconst pdf_utf8*(「PoDoFo」は今から日本語も話せます。)); PdfAnnotation* pAnnotation = pPage-CreateAnnotation( ePdfAnnotation_Text, PdfRect( 400.0, 200.0, 20.0, 20.0 ) ); PdfString sGerman(reinterpret_castconst pdf_utf8*(Unicode Umlauts: ÄÖÜß)); pAnnotation-SetTitle( sGerman ); pAnnotation-SetContents( sJap ); pAnnotation-SetOpen( true ); } void CreateUnicodeAnnotationFreeText( PdfPage* pPage, PdfDocument* pDocument ) { PdfString sJap(reinterpret_castconst pdf_utf8*(「PoDoFo」は今から日本語も話せます。)); PdfFont* pFont = pDocument-CreateFont( Arial Unicode MS, false, new PdfIdentityEncoding( 0, 0x, true ) ); PdfRect rect( 200.0, 200.0, 200.0, 200.0 ); /* PdfXObject xObj( rect, pDocument ); PdfPainter painter; painter.SetPage( xObj ); painter.SetFont( pFont ); painter.SetColor( 1.0, 0.0, 0.0 ); painter.Rectangle( 10.0, 10.0, 100.0, 100.0 ); painter.FillAndStroke(); painter.DrawText( 100.0, 100.0, sJap ); painter.FinishPage(); */ std::ostringstream oss; oss BT std::endl /pFont-GetIdentifier().GetName() pFont-GetFontSize() Tf std::endl; WriteStringToStream( sJap, oss, pFont ); oss Tj ET std::endl; PdfDictionary fonts; fonts.AddKey(pFont-GetIdentifier().GetName(), pFont-GetObject()-Reference()); PdfDictionary resources; resources.AddKey( PdfName(Fonts), fonts ); PdfAnnotation* pAnnotation = pPage-CreateAnnotation( ePdfAnnotation_FreeText, rect ); PdfString sGerman(reinterpret_castconst pdf_utf8*(Unicode Umlauts: ÄÖÜß)); pAnnotation-SetTitle( sGerman ); pAnnotation-SetContents( sJap ); //pAnnotation-SetAppearanceStream( xObj ); pAnnotation-GetObject()-GetDictionary().AddKey( PdfName(DA), PdfString(oss.str()) ); pAnnotation-GetObject()-GetDictionary().AddKey( PdfName(DR), resources ); } My Podofo version: 0.9.1 Here they used Annotation, but I want to use PdfPainter.DrawText method to draw. Regards, Dinesh -Original Message- From: zyx [mailto:z...@litepdf.cz] Sent: 22 June 2015 21:24 To: podofo-users@lists.sourceforge.net Subject: Re: [Podofo-users] Get JP characters from PDF On Mon, 2015-06-22 at 15:24 +, Dineshkumar Ramalingam wrote: Hi, We are getting Ú0ü0¸0 while painting. We have tries with UTF-8, but we are getting like ã…ㅼㇸ 2. Encoding used: new PdfIdentityEncoding( 0, 0x, true ) as mentioned CreationTest.cpp Hi, it's weird. Does the code from the CreationTest.cpp work for you properly? There's used a different font as well. Bye, zyx -- http://www.litePDF.cz i...@litepdf.cz -- Monitor 25 network devices or servers for free with OpManager! OpManager is web-based network management software that monitors network devices and physical virtual servers, alerts via email sms for fault. Monitor 25 devices for free with no restriction. Download now http://ad.doubleclick.net/ddm/clk/292181274;119417398;o ___ Podofo-users mailing list Podofo-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/podofo-users ::DISCLAIMER:: The contents of this e-mail and any attachment(s) are confidential and intended for the named recipient(s) only. E-mail transmission is not guaranteed to be secure or error-free as information could be intercepted, corrupted, lost, destroyed, arrive late or incomplete, or may contain viruses in transmission. The e mail and its contents (with or without referred errors) shall therefore not attach any liability on the originator or HCL or its affiliates. Views or opinions, if any, presented in this email are solely those of the author and may not necessarily reflect the views or opinions of HCL or its affiliates. Any form of reproduction, dissemination, copying, disclosure, modification, distribution and / or publication of this message without the prior written consent of authorized representative of HCL is strictly prohibited. If you have received this email in error please delete it and notify the sender immediately. Before opening any email and/or attachments, please check them for viruses and other defects. -- Monitor 25 network devices or servers for free with OpManager! OpManager is web-based
Re: [Podofo-users] Get JP characters from PDF
Hi, We can able to get the Unicode string by using 'GetStringW()' method. It returns 'std::wstring'. But if we try to convert 'std::wstring' to PdfString then it gives the wrong characters. We tries below line to convert 'std::wstring' to PdfString, PdfString pdfstring(reinterpret_castconst pdf_utf8*( std::wstring.c_str())); We want to draw this unicode characters in PDF file. Kindly help us to resolve this issue. Regards, Dinesh From: Dineshkumar Ramalingam Sent: 17 June 2015 18:31 To: 'podofo-users@lists.sourceforge.net' Subject: RE: Get JP characters from PDF I tried to get the string by GetStringUtf8, but it's not giving actual. From: Dineshkumar Ramalingam Sent: 17 June 2015 18:09 To: 'podofo-users@lists.sourceforge.net' Subject: Get JP characters from PDF Hi, Here I am trying to get Japanese characters from bookmark title. But it is not giving the expected one. How to encode the pdf string while parsing? Regards, Dinesh ::DISCLAIMER:: The contents of this e-mail and any attachment(s) are confidential and intended for the named recipient(s) only. E-mail transmission is not guaranteed to be secure or error-free as information could be intercepted, corrupted, lost, destroyed, arrive late or incomplete, or may contain viruses in transmission. The e mail and its contents (with or without referred errors) shall therefore not attach any liability on the originator or HCL or its affiliates. Views or opinions, if any, presented in this email are solely those of the author and may not necessarily reflect the views or opinions of HCL or its affiliates. Any form of reproduction, dissemination, copying, disclosure, modification, distribution and / or publication of this message without the prior written consent of authorized representative of HCL is strictly prohibited. If you have received this email in error please delete it and notify the sender immediately. Before opening any email and/or attachments, please check them for viruses and other defects. -- Monitor 25 network devices or servers for free with OpManager! OpManager is web-based network management software that monitors network devices and physical virtual servers, alerts via email sms for fault. Monitor 25 devices for free with no restriction. Download now http://ad.doubleclick.net/ddm/clk/292181274;119417398;o___ Podofo-users mailing list Podofo-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/podofo-users
Re: [Podofo-users] Get JP characters from PDF
Hi, Thanks for your support. We can able to get the Unicode string(pdf_utf16be) from std::wstring. But if we try to paint the string by using PdfPainter, it's not actually painting. const PdfEncoding* pEncoding = new PdfIdentityEncoding(); pdfPainter.SetFont(Doc-CreateFont(Arial,pEncoding,true)); const pdf_utf16be *check = wstring.c_str(); PdfString str(check); pdfPainter.DrawText(100, 100, str); Do we need to specify any specific encoding params for this? Regards, Dinesh -Original Message- From: zyx [mailto:z...@litepdf.cz] Sent: 22 June 2015 14:26 To: podofo-users@lists.sourceforge.net Subject: Re: [Podofo-users] Get JP characters from PDF On Mon, 2015-06-22 at 06:47 +, Dineshkumar Ramalingam wrote: Hi, We can able to get the Unicode string by using ‘GetStringW()’ method. It returns ‘std::wstring’. But if we try to convert ‘std::wstring’ to PdfString then it gives the wrong characters. We tries below line to convert ‘std::wstring’ to PdfString, PdfString pdfstring(reinterpret_castconst pdf_utf8*( std::wstring.c_str())); Hi, std::wstring is defined as typedef basic_stringwchar_t wstring; [1], thus wstring::c_str() returns wchar_t *, which is UTF-16, not UTF-8. You should convert UTF-16 into UTF-8, or use a different constructor of the PdfString (or simply PdfString::setFromWchar_t()). Bye, zyx [1] http://www.cplusplus.com/reference/string/wstring/ -- http://www.litePDF.cz i...@litepdf.cz -- Monitor 25 network devices or servers for free with OpManager! OpManager is web-based network management software that monitors network devices and physical virtual servers, alerts via email sms for fault. Monitor 25 devices for free with no restriction. Download now http://ad.doubleclick.net/ddm/clk/292181274;119417398;o ___ Podofo-users mailing list Podofo-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/podofo-users ::DISCLAIMER:: The contents of this e-mail and any attachment(s) are confidential and intended for the named recipient(s) only. E-mail transmission is not guaranteed to be secure or error-free as information could be intercepted, corrupted, lost, destroyed, arrive late or incomplete, or may contain viruses in transmission. The e mail and its contents (with or without referred errors) shall therefore not attach any liability on the originator or HCL or its affiliates. Views or opinions, if any, presented in this email are solely those of the author and may not necessarily reflect the views or opinions of HCL or its affiliates. Any form of reproduction, dissemination, copying, disclosure, modification, distribution and / or publication of this message without the prior written consent of authorized representative of HCL is strictly prohibited. If you have received this email in error please delete it and notify the sender immediately. Before opening any email and/or attachments, please check them for viruses and other defects. -- Monitor 25 network devices or servers for free with OpManager! OpManager is web-based network management software that monitors network devices and physical virtual servers, alerts via email sms for fault. Monitor 25 devices for free with no restriction. Download now http://ad.doubleclick.net/ddm/clk/292181274;119417398;o ___ Podofo-users mailing list Podofo-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/podofo-users
Re: [Podofo-users] Get JP characters from PDF
On Mon, 2015-06-22 at 06:47 +, Dineshkumar Ramalingam wrote: Hi, We can able to get the Unicode string by using ‘GetStringW()’ method. It returns ‘std::wstring’. But if we try to convert ‘std::wstring’ to PdfString then it gives the wrong characters. We tries below line to convert ‘std::wstring’ to PdfString, PdfString pdfstring(reinterpret_castconst pdf_utf8*( std::wstring.c_str())); Hi, std::wstring is defined as typedef basic_stringwchar_t wstring; [1], thus wstring::c_str() returns wchar_t *, which is UTF-16, not UTF-8. You should convert UTF-16 into UTF-8, or use a different constructor of the PdfString (or simply PdfString::setFromWchar_t()). Bye, zyx [1] http://www.cplusplus.com/reference/string/wstring/ -- http://www.litePDF.cz i...@litepdf.cz -- Monitor 25 network devices or servers for free with OpManager! OpManager is web-based network management software that monitors network devices and physical virtual servers, alerts via email sms for fault. Monitor 25 devices for free with no restriction. Download now http://ad.doubleclick.net/ddm/clk/292181274;119417398;o ___ Podofo-users mailing list Podofo-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/podofo-users
[Podofo-users] Support Required
Hi, Using Podofo we are trying to load the pdf document in to the memory and extract the pdf objects. The function GetObjects().GetObjectCount() returns the invalid object count and when we try to assign the objects to TCIVecObjects, the application crashes. Below is the sample snippet of the code which we are using, PdfMemDocument* doc = NULL; doc = new PdfMemDocument(); doc-Load(D:\\Test.pdf); doc-GetObjects().GetObjectCount(); TCIVecObjects it = doc-GetObjects().begin(); //crashes We also observed that when we have WINDDK installed the same code works and we are able to get the valid object count. Could you please help us on how to make this work. FYI, we have tried with visual studio 2013,2010 and 2008. Thank You. Best Regards, C. Hari ::DISCLAIMER:: The contents of this e-mail and any attachment(s) are confidential and intended for the named recipient(s) only. E-mail transmission is not guaranteed to be secure or error-free as information could be intercepted, corrupted, lost, destroyed, arrive late or incomplete, or may contain viruses in transmission. The e mail and its contents (with or without referred errors) shall therefore not attach any liability on the originator or HCL or its affiliates. Views or opinions, if any, presented in this email are solely those of the author and may not necessarily reflect the views or opinions of HCL or its affiliates. Any form of reproduction, dissemination, copying, disclosure, modification, distribution and / or publication of this message without the prior written consent of authorized representative of HCL is strictly prohibited. If you have received this email in error please delete it and notify the sender immediately. Before opening any email and/or attachments, please check them for viruses and other defects. -- Monitor 25 network devices or servers for free with OpManager! OpManager is web-based network management software that monitors network devices and physical virtual servers, alerts via email sms for fault. Monitor 25 devices for free with no restriction. Download now http://ad.doubleclick.net/ddm/clk/292181274;119417398;o___ Podofo-users mailing list Podofo-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/podofo-users
Re: [Podofo-users] Get JP characters from PDF
On Mon, 2015-06-22 at 16:13 +, Dineshkumar Ramalingam wrote: My Podofo version: 0.9.1 Hi, try to update to at least 0.9.3 (current stable release), but the best if you try the svn trunk, which is at revision 1673 right now. Here they used Annotation, but I want to use PdfPainter.DrawText method to draw. I use this code with the svn version: PdfString sJap(reinterpret_castconst pdf_utf8*(「PoDoFo」は今から日本語も話せます。)); PdfFont* pFont = writer.CreateFont( Arial Unicode MS, false, new PdfIdentityEncoding( 0, 0x, true ) ); painter.SetFont( pFont ); painter.SetColor( 0.0, 0.0, 0.5 ); painter.DrawText( 100.0, 300.0, sJap ); and it prints the text as expected (copypaste the text between painter.DrawMultiLineText() and painter.FinishPage() calls in the main() of the CreationTest.cpp, compile, call and observe the resulting pdf file's second page bottom. Check what your editor made with the UTF-8 strings in the code, maybe it garbled the encoding in the file. In other words, focus on the encoding of the string itself. I'm not able to guide you more. Bye, zyx -- http://www.litePDF.cz i...@litepdf.cz -- Monitor 25 network devices or servers for free with OpManager! OpManager is web-based network management software that monitors network devices and physical virtual servers, alerts via email sms for fault. Monitor 25 devices for free with no restriction. Download now http://ad.doubleclick.net/ddm/clk/292181274;119417398;o ___ Podofo-users mailing list Podofo-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/podofo-users