John, I know it's possible to extract text with other libraries, but I need a specific way to extract text which is easiest to achieve with iText. I also know that mentioning other libraries are taken to the heart and not accepted well on this list (emotions are not coming accross the wire, so I might be mistaken). I also think that iText has grown to be a more complete library during the years than just a tool for "creating and updating PDF files". In my post I was simply answering to a question that drew my attention and grabbed the opportunity to share one of my problems. I think many of us would really like to see some improvement in this area.
Daniel On 2011.11.15. 16:00, John Renfrew wrote: > All along iText has kept saying it is about creating and updating PDF files, > not parsing or rendering. > > The output from PDF Box is attached for the first page of your PDF to show > that it is possible using another library, but as often said here, pay for > some development, or join in and write (or use something else..) > > > > john renfrew > Right Way Up > > > -----Original Message----- > From: Dániel Kékesi [mailto:dan...@kekesi.com] > Sent: 15 November 2011 14:15 > To: itext-questions@lists.sourceforge.net > Subject: Re: [iText-questions] Save PDF as plain text > > Hi Kevin, > > Not to hijack this thread, but what I'd like to do see is to have support > for more encoding types. For example the attached document produces no > output using any extraction startegy (I tried with 5.1.2). > > Thanks. > Daniel > > On 2011.11.15. 14:26, Kevin Day wrote: >> >> WMJ wrote: >>> >>> The parser is not very powerful or convenient yet, but it does point >>> you to the most detailed part of the PDF text. >>> >> >> WMJ - what would you see as enhancements that would make it more >> powerful or convenient? Understanding the details of this will help us > improve. >> Thanks. >> >> -- >> View this message in context: >> http://itext-general.2136553.n4.nabble.com/Save-PDF-as-plain-text-tp40 >> 41246p4042937.html Sent from the iText - General mailing list archive >> at Nabble.com. >> >> ---------------------------------------------------------------------- >> -------- >> RSA(R) Conference 2012 >> Save $700 by Nov 18 >> Register now >> http://p.sf.net/sfu/rsa-sfdev2dev1 >> _______________________________________________ >> iText-questions mailing list >> iText-questions@lists.sourceforge.net >> https://lists.sourceforge.net/lists/listinfo/itext-questions >> >> iText(R) is a registered trademark of 1T3XT BVBA. >> Many questions posted to this list can (and will) be answered with a >> reference to the iText book: http://www.itextpdf.com/book/ Please check > the keywords list before you ask for examples: > http://itextpdf.com/themes/keywords.php > > > > ------------------------------------------------------------------------------ > RSA(R) Conference 2012 > Save $700 by Nov 18 > Register now > http://p.sf.net/sfu/rsa-sfdev2dev1 > > > > _______________________________________________ > iText-questions mailing list > iText-questions@lists.sourceforge.net > https://lists.sourceforge.net/lists/listinfo/itext-questions > > iText(R) is a registered trademark of 1T3XT BVBA. > Many questions posted to this list can (and will) be answered with a > reference to the iText book: http://www.itextpdf.com/book/ > Please check the keywords list before you ask for examples: > http://itextpdf.com/themes/keywords.php ------------------------------------------------------------------------------ RSA(R) Conference 2012 Save $700 by Nov 18 Register now http://p.sf.net/sfu/rsa-sfdev2dev1 _______________________________________________ iText-questions mailing list iText-questions@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/itext-questions iText(R) is a registered trademark of 1T3XT BVBA. Many questions posted to this list can (and will) be answered with a reference to the iText book: http://www.itextpdf.com/book/ Please check the keywords list before you ask for examples: http://itextpdf.com/themes/keywords.php