I was affraid of that. Thank for your help. Sergio
מאת: Leonard Rosenthol <lrose...@adobe.com> אל: Post all your questions about iText here <itext-questions@lists.sourceforge.net>, תאריך: 13-05-13 03:25 PM נושא: Re: [iText-questions] Reading pdf files with hebrew - encoding support Since Hebrew is NOT representable in ANSI, the second PDF in question has been created in such a way as to prevent text extraction. There is nothing you can do (short of OCR). Leonard On 5/13/13 1:40 PM, "ser...@mof.gov.il" <ser...@mof.gov.il> wrote: > >I try to read 2 pdf files with pdfreader. The pdf files contains hebrew >characters. >The encoding in one file is identity-h and in the other is ANSI. >When the encoding is identify-h the hebrew characters are displayed in the >console and printed in a txt file correctly. >When the encoding is ANSI the hebrew characters are displayed as "?" . >The Java method used > > public static void loadPdfString(){ > FileWriter > fileWriter = null; > String > INPUTFILE = >"c:/Sergio/develop/conv_pdf/payslip.pdf"; >//Specifying the file location. >// String >INPUTFILE = "c:/Sergio >/develop/conv_pdf/payslip2.PDF"; //Specifying the file location. > try { > File newTextFile = new File( >"C:/Sergio/develop/conv_pdf/payslip_.txt"); > fileWriter = new > FileWriter(newTextFile); > > PdfReader reader = new PdfReader(INPUTFILE); > int n = reader.getNumberOfPages(); > > String str=PdfTextExtractor.getTextFromPage(reader, 1); >//Extracting the content from a particular page. > > //Print to console > System.out.println(str); > System.out.println("------------------------------"); > //Print to file > fileWriter.write(str); > fileWriter.close(); > } > catch (Exception e) { > System.out.println(e); > } > } > > >Can anyone help me? >What is the best method to read hebrew characters? >Thank you. >Sergio > > > >-------------------------------------------------------------------------- >---- >Learn Graph Databases - Download FREE O'Reilly Book >"Graph Databases" is the definitive new guide to graph databases and >their applications. This 200-page book is written by three acclaimed >leaders in the field. The early access version is available now. >Download your free book today! http://p.sf.net/sfu/neotech_d2d_may >_______________________________________________ >iText-questions mailing list >iText-questions@lists.sourceforge.net >https://lists.sourceforge.net/lists/listinfo/itext-questions > >iText(R) is a registered trademark of 1T3XT BVBA. >Many questions posted to this list can (and will) be answered with a >reference to the iText book: http://www.itextpdf.com/book/ >Please check the keywords list before you ask for examples: >http://itextpdf.com/themes/keywords.php ------------------------------------------------------------------------------ Learn Graph Databases - Download FREE O'Reilly Book "Graph Databases" is the definitive new guide to graph databases and their applications. This 200-page book is written by three acclaimed leaders in the field. The early access version is available now. Download your free book today! http://p.sf.net/sfu/neotech_d2d_may _______________________________________________ iText-questions mailing list iText-questions@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/itext-questions iText(R) is a registered trademark of 1T3XT BVBA. Many questions posted to this list can (and will) be answered with a reference to the iText book: http://www.itextpdf.com/book/ Please check the keywords list before you ask for examples: http://itextpdf.com/themes/keywords.php ------------------------------------------------------------------------------ Learn Graph Databases - Download FREE O'Reilly Book "Graph Databases" is the definitive new guide to graph databases and their applications. This 200-page book is written by three acclaimed leaders in the field. The early access version is available now. Download your free book today! http://p.sf.net/sfu/neotech_d2d_may _______________________________________________ iText-questions mailing list iText-questions@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/itext-questions iText(R) is a registered trademark of 1T3XT BVBA. Many questions posted to this list can (and will) be answered with a reference to the iText book: http://www.itextpdf.com/book/ Please check the keywords list before you ask for examples: http://itextpdf.com/themes/keywords.php