On Wednesday, September 17, 2003, at 08:42 AM, Ben Litchfield wrote:
What, no PDF files!!
Haha!
http://www.pdfbox.org
And I've used pdfbox before - its cool.
And I'm cool with adding PDF and Word indexing to the demo personally, but I didn't want to increase the "weight" of the demo application. If folks feel strongly about it then I'll incorporate it.
A word of warning: PDFBox is fantastic, I agree - but some PDFs are not so... In my application I experienced numerous hangs when PDFBox would start parsing some PDFs (I can send the files to Ben if required), and then got stuck in an infinite wait somewhere... So I came up with a workaround: I run the parser in a separate thread, while waiting in the main thread, and then after a certain timeout I kill the processing thread and return.
-- Best regards, Andrzej Bialecki
------------------------------------------------- Software Architect, System Integration Specialist CEN/ISSS EC Workshop, ECIMF project chair EU FP6 E-Commerce Expert/Evaluator ------------------------------------------------- FreeBSD developer (http://www.freebsd.org)
--------------------------------------------------------------------- To unsubscribe, e-mail: [EMAIL PROTECTED] For additional commands, e-mail: [EMAIL PROTECTED]
