At 06:28 PM 12/8/2004, Christian Oetterli wrote:
I had to solve this Problem myself recently. I have found a piece of c-code (http://www.codeproject.com/cpp/ExtractPDFText.asp, thanks to the author Shar136) and adapted it to java.

As I wrote in the comments on that article - this solution will fail (miserably in many cases) on a LOT of PDF documents out there...


This is NOT the right way to do text extraction from PDF. If you want to do it - use a library that supports it properly like pdfBox, JPEDAL or Multivalent.


Leonard

---------------------------------------------------------------------------
Leonard Rosenthol                            <mailto:[EMAIL PROTECTED]>
Chief Technical Officer                      <http://www.pdfsages.com>
PDF Sages, Inc.                              215-938-7080 (voice)
                                             215-938-0880 (fax)



-------------------------------------------------------
SF email is sponsored by - The IT Product Guide
Read honest & candid reviews on hundreds of IT Products from real users.
Discover which products truly live up to the hype. Start reading now. http://productguide.itmanagersjournal.com/
_______________________________________________
iText-questions mailing list
[EMAIL PROTECTED]
https://lists.sourceforge.net/lists/listinfo/itext-questions

Reply via email to