Hi David,
Are you actually expecting this snippet of code to do OCR?
I've not inspected in detail the PDF you attached, but I think it only
consists of images (resulting from the scanning), and no text at all.
Could it be that you are making some wrong assumptions about what the text
extraction in iText does?
Cheers,
alexis
On 29 May 2013 13:07, David Lestón <dles...@provimad.es> wrote:
> Sorry, Here is the psf file.****
>
> ** **
>
> Thank you****
>
> ** **
>
> *De:* David Lestón [mailto:dles...@provimad.es]
> *Enviado el:* miércoles, 29 de mayo de 2013 12:57
> *Para:* 'itext-questions@lists.sourceforge.net'
> *Asunto:* ITextSharp doesn´t read this pdf****
>
> ** **
>
> Hi,****
>
> ** **
>
> I have one problem with this pdf I have attached. With other pdf files I
> execute this code and it works ok. It gives me the text of the page:****
>
> ** **
>
> PdfReader inputDocument = new PdfReader(fileName);****
>
> StringBuilder text = new StringBuilder();****
>
> for (int page = 1; page <= inputDocument.NumberOfPages;
> page++)****
>
> {****
>
> ****
>
> ITextExtractionStrategy strategy = new
> SimpleTextExtractionStrategy();****
>
> string currentText =
> PdfTextExtractor.GetTextFromPage(inputDocument,
> page, strategy);****
>
> ** **
>
> currentText = Encoding.UTF8.GetString(ASCIIEncoding
> .Convert(Encoding.Default, Encoding.UTF8, Encoding
> .Default.GetBytes(currentText)));****
>
> text.Append(currentText);****
>
> }****
>
> inputDocument.Close();****
>
> MessageBox.Show(text.ToString());****
>
> ** **
>
> But with the pdf attached it doesn´t work. The pdf attached is from a
> scannner and generated with ghostscript and other examples are generated
> from Word.****
>
> ** **
>
> Could anybody help me, please?****
>
> ** **
>
> Thank you. Regards****
>
>
> ------------------------------------------------------------------------------
> Introducing AppDynamics Lite, a free troubleshooting tool for Java/.NET
> Get 100% visibility into your production application - at no cost.
> Code-level diagnostics for performance bottlenecks with <2% overhead
> Download for free and get started troubleshooting in minutes.
> http://p.sf.net/sfu/appdyn_d2d_ap1
> _______________________________________________
> iText-questions mailing list
> iText-questions@lists.sourceforge.net
> https://lists.sourceforge.net/lists/listinfo/itext-questions
>
> iText(R) is a registered trademark of 1T3XT BVBA.
> Many questions posted to this list can (and will) be answered with a
> reference to the iText book: http://www.itextpdf.com/book/
> Please check the keywords list before you ask for examples:
> http://itextpdf.com/themes/keywords.php
>
------------------------------------------------------------------------------
Introducing AppDynamics Lite, a free troubleshooting tool for Java/.NET
Get 100% visibility into your production application - at no cost.
Code-level diagnostics for performance bottlenecks with <2% overhead
Download for free and get started troubleshooting in minutes.
http://p.sf.net/sfu/appdyn_d2d_ap1
_______________________________________________
iText-questions mailing list
iText-questions@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/itext-questions
iText(R) is a registered trademark of 1T3XT BVBA.
Many questions posted to this list can (and will) be answered with a reference
to the iText book: http://www.itextpdf.com/book/
Please check the keywords list before you ask for examples:
http://itextpdf.com/themes/keywords.php