Hello St�phane, thank you for the links. But since PDF indexing is a feature of Jahia, I hope that the Jahia team is still working on resolving issue in their software (pdf support). It's not about performance, but failures we talk. If pdfbox is just crap, it would be a good idea to at least provide adapters for other engines that support productive environments. For me it's like having HSQL DB as a free and instarun database for Jahia (which is good) and ALSO have additional support from scratch for MySQL and Oracle (which is absolutely required).
best regards Daniel Zimmermann On Tue, 07 Dec 2004 15:34:54 +0100, St�phane Croisier <[EMAIL PROTECTED]> wrote: > At 09:44 07/12/2004, you wrote: > >thanks for the patch. We'll test this as soon as possible since not > >beeing able to index > >PDF isn't a option really. PDF indexing in lucene was a key feature > >for our customer ;) > > Jahia packages by default the PDFbox open source library... > > This is clearly not the most performing PDF library but it is however free > (and open source). You may perfectly try to help the PDFBox team fixes > their open bugs or also integrate another commercial java PDF library (it > should not be comlicated. For example, see some perf comparison here: > http://snowtide.com/home/PDFTextStream/Performance but the price per CPU is > obviously not the same any more...). > > I read somewhere that some developers are now trying to try to optimize the > perf + PDF encryption support of PDFBox but I do not know exactly the > status of their work. If you are interested please try to check directly on > their web site/mailing lists: http://www.pdfbox.org/ > > Cheers > St�phane > >
