2) http://www.pdfbox.org/ is a library for extracting text from PDFs (full-text searching with Lucene of PDF documents is made possible through this). The author has switched licenses from LGPL to BSD per my request.
Just fyi, maybe someone finds some reason to use them over here or in some other Apache projects. I made the enquiries because of the clarified ASF policies w.r.t. use of LGPL libraries from within Apache code.
Cheers,
</Steven> -- Steven Noels http://outerthought.org/ Outerthought - Open Source, Java & XML Competence Support Center Read my weblog at http://blogs.cocoondev.org/stevenn/ stevenn at outerthought.org stevenn at apache.org