Hi, I'm looking at some JVM crash issues that have been reported against our digital asset management product that uses PDFBox to extract text, metadata and thumbnail images from PDF documents. In all cases the problem seems to be some underlying JVM or class library issue that PDFBox triggers by calling standard Java APIs.
Having the entire JVM crash for such operations is clearly not acceptable. Since we can't fix the underlying problems and don't want to disable all PDF processing, I'm trying to come up with ways in which PDFBox could work around these issues. See the issue tracker for all the details. Once I'm done with these issues, it would be nice to have a new PDFBox 0.8.1 release containing the workarounds and any other new fixes that we come up with in trunk. BR, Jukka Zitting