Hi.

 

I suposed that an Image could be red and extract text from it. And If I want to 
obtain text from the images, waht is the appropiated form or its impossible?

 

Thank you

 

De: Alexis Pigeon [mailto:pigeon.ale...@gmail.com] 
Enviado el: miércoles, 29 de mayo de 2013 13:33
Para: Post all your questions about iText here
Asunto: Re: [iText-questions] ITextSharp doesn´t read this pdf

 

Hi David,

 

Are you actually expecting this snippet of code to do OCR?

I've not inspected in detail the PDF you attached, but I think it only consists 
of images (resulting from the scanning), and no text at all.

Could it be that you are making some wrong assumptions about what the text 
extraction in iText does?

Cheers,
alexis

 

On 29 May 2013 13:07, David Lestón <dles...@provimad.es> wrote:

Sorry, Here is the psf file.

 

Thank you

 

De: David Lestón [mailto:dles...@provimad.es] 
Enviado el: miércoles, 29 de mayo de 2013 12:57
Para: 'itext-questions@lists.sourceforge.net'
Asunto: ITextSharp doesn´t read this pdf

 

Hi,

 

I have one problem with this pdf I have attached. With other pdf files I 
execute this code and it works ok. It gives me the text of the page:

 

PdfReader inputDocument = new PdfReader(fileName);

StringBuilder text = new StringBuilder();

                for (int page = 1; page <= inputDocument.NumberOfPages; page++)

                {

                    

                    ITextExtractionStrategy strategy = new 
SimpleTextExtractionStrategy();

                    string currentText = 
PdfTextExtractor.GetTextFromPage(inputDocument, page, strategy);

 

                    currentText = 
Encoding.UTF8.GetString(ASCIIEncoding.Convert(Encoding.Default, Encoding.UTF8, 
Encoding.Default.GetBytes(currentText)));

                    text.Append(currentText);

                }

                inputDocument.Close();

                MessageBox.Show(text.ToString());

 

But with the pdf attached it doesn´t work. The pdf attached is from a scannner 
and generated with ghostscript and other examples are generated from Word.

 

Could anybody help me, please?

 

Thank you. Regards


------------------------------------------------------------------------------
Introducing AppDynamics Lite, a free troubleshooting tool for Java/.NET
Get 100% visibility into your production application - at no cost.
Code-level diagnostics for performance bottlenecks with <2% overhead
Download for free and get started troubleshooting in minutes.
http://p.sf.net/sfu/appdyn_d2d_ap1
_______________________________________________
iText-questions mailing list
iText-questions@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/itext-questions

iText(R) is a registered trademark of 1T3XT BVBA.
Many questions posted to this list can (and will) be answered with a reference 
to the iText book: http://www.itextpdf.com/book/
Please check the keywords list before you ask for examples: 
http://itextpdf.com/themes/keywords.php

 

------------------------------------------------------------------------------
Introducing AppDynamics Lite, a free troubleshooting tool for Java/.NET
Get 100% visibility into your production application - at no cost.
Code-level diagnostics for performance bottlenecks with <2% overhead
Download for free and get started troubleshooting in minutes.
http://p.sf.net/sfu/appdyn_d2d_ap1
_______________________________________________
iText-questions mailing list
iText-questions@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/itext-questions

iText(R) is a registered trademark of 1T3XT BVBA.
Many questions posted to this list can (and will) be answered with a reference 
to the iText book: http://www.itextpdf.com/book/
Please check the keywords list before you ask for examples: 
http://itextpdf.com/themes/keywords.php

Reply via email to