On Tue, 20 Sep 2005 02:34 pm, Jeremias Maerki wrote: <snip/> > > But the underlying PDF library looks quite interesting (PDFBox). Has > anyone had any experiences with it? If yes, we should add it to our > PDF post-processors list on the website, if just because it has a > better license than iText.
I am using it in a project which provides on-line searchable PDF files (Government Acts and Regulations) indexed using Lucene and in that context it works fine, that is as a backend activity PDFBox extracts the text components from the PDF for Lucene to index and as an on-line activity once matching documents are found the PDF is searched again using PDFBox to find the search terms in the PDF file so an Acrobat highlight XML file can be constructed. <snip/> > > Jeremias Maerki Manuel
