Check out: http://www.foolabs.com/xpdf (yes, this is a real website)
and http://www.opengroup.org/inforsrv/PDF/xpdf These tools even decrypt! Way too cool. I am working on integrating these into my company's web page, which has already implemented the Lucene search engine My approach will be: in the IndexFiles class, when a file has a PDF extension, it will run this converter, then index the text file but with the PDF file name. _______________________________________________ Lucene-dev mailing list [EMAIL PROTECTED] https://lists.sourceforge.net/lists/listinfo/lucene-dev
