Hi,

 

First post to the pdfbox dev team.  So, first things first.  Thank you for
your efforts into this quality product.  Have been using PDFBox for a while
now, and it's held up to a lot of different documents that have needed
parsing and thumbnailing.

 

Onto the issue that I have come across.

 

When calling PDPage.convertToImage(imagetype,resolution) , I have found a
case where the application hangs.  No error messages, no exceptions, no CPU
spike, no I/O.  The error had to do with some strange corruption of the
awtFont, by the time it reached the method PDSimpleFont.drawString.  Some
pretty deep digging turned up that if a font with too few glyphs is sent
into PDSimpleFont.writeFont, it will enter Graphics2D.drawString ... ne'er
to return.

 

If a check is done to count the glyphs and if they fall below a certain
threshold, then just re-instantiate the Font, then all goes well.  Typical
numbers range from 70 to 1000+, so I've gone with an arbitrary value of 40.
All goes well, and I can't measure any difference in performance (except
that my app will actually return from attempting this doc!).

 

I've got a sample PDF which can re-create the problem, and a patch which
seems to solve it sensibly.

 

Who is maintaining this part of the project, for me to send to?

 

My configuration

O/S : Ubuntu 8.04 Headless

PDFBox : 1.5.0

 

The offending string

Font : Tahoma

String : 0x20  (space)

 

 

Thanks,

-Craig

 

Reply via email to