Christian, Christian Eric Paran wrote > When I tried to extract text, the reader extract only * > blanks/destroyed text * > on SOME pdfs.
As already commented on stackoverflow to your crosspost there (http://stackoverflow.com/questions/14490829/itext-pdf-text-extraction-with-fonts-and-styles): mkl wrote > You already know the generic reasons (as addressed in Lee's answer) for > the difficulties of PDF text extraction from answers to your former > questions here. If the pdfs in question are especially cumbersome, you > might want to supply them for inspection. It is very unlikely that font > replacement will do any good in those cases which already return the text > (albeit with funny spaces inserted). In the cases you only retrieve spaces > font replacement might help in certain very very special situations, but > not in general, though. The probably helping replacement in my last sentence there actually refers to situations in which: * the font as embedded in the PDF misses its to-unicode mapping, * you have the same font with that mapping, and * the mapping has not been mixed up in the embedding process. In that case it obviously suffices, BTW, to add your to-unicode mapping to the font in the PDF. There even is another situation in which font replacement might help: If some font replacement already took place before, i.e. if the font currently associated with the text in the PDF is not the font the PDF had originally been typeset with but some replacement (e.g. done for license reasons), replacing the font back to the original one may improve text parsing quality. Regards, Michael -- View this message in context: http://itext-general.2136553.n4.nabble.com/PDF-Text-Extraction-problem-with-fonts-and-styles-tp4657427p4657432.html Sent from the iText - General mailing list archive at Nabble.com. ------------------------------------------------------------------------------ Master Visual Studio, SharePoint, SQL, ASP.NET, C# 2012, HTML5, CSS, MVC, Windows 8 Apps, JavaScript and much more. Keep your skills current with LearnDevNow - 3,200 step-by-step video tutorials by Microsoft MVPs and experts. ON SALE this month only -- learn more at: http://p.sf.net/sfu/learnnow-d2d _______________________________________________ iText-questions mailing list iText-questions@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/itext-questions iText(R) is a registered trademark of 1T3XT BVBA. Many questions posted to this list can (and will) be answered with a reference to the iText book: http://www.itextpdf.com/book/ Please check the keywords list before you ask for examples: http://itextpdf.com/themes/keywords.php