Eric,

ericvaleyev wrote:
> I've changed the function ExtractTextString and added PRTokeniser like in
> the book 18.2 page 575. 
> I even tried PDFbox to read the pages generated by Itext. still empty
> string.
> 
> So I think you re right about the way the PDF is generated using import
> pages. 
> However, I still havent found a solution.

Have you considered using the classes from the com.itextpdf.text.pdf.parser
package? They still are in heavy development but already seem to be quite
usefull. Especially considering the quite simple source documents you want
to parse, the parser classes should do the job.

Unless of course you start and store the data you search the content for in
some dedicated metadata or private object...

Regards,   Michael.
-- 
View this message in context: 
http://old.nabble.com/Problem-reading-the-content-from-the-PDF-created-by-ITextSharp-tp27892267p27911376.html
Sent from the iText - General mailing list archive at Nabble.com.


------------------------------------------------------------------------------
Download Intel® Parallel Studio Eval
Try the new software tools for yourself. Speed compiling, find bugs
proactively, and fine-tune applications for parallel performance.
See why Intel Parallel Studio got high marks during beta.
http://p.sf.net/sfu/intel-sw-dev
_______________________________________________
iText-questions mailing list
[email protected]
https://lists.sourceforge.net/lists/listinfo/itext-questions

Buy the iText book: http://www.1t3xt.com/docs/book.php
Check the site with examples before you ask questions: 
http://www.1t3xt.info/examples/
You can also search the keywords list: http://1t3xt.info/tutorials/keywords/

Reply via email to