Text extraction slow and /tmp files upwith AWT font files
---------------------------------------------------------
Key: PDFBOX-959
URL: https://issues.apache.org/jira/browse/PDFBOX-959
Project: PDFBox
Issue Type: Bug
Components: Text extraction
Affects Versions: 1.4.0
Reporter: Kevin Jackson
Priority: Critical
Fix For: 1.5.0
During text extraction there is NO need to create AWT fonts.
However the current Type1C Font code creates the AWT always while initializing.
This has several really bad side effects:
1. Wasted time creating the AWT font.
2. The font files are copied into /tmp which fills up after a few thousand text
extractions.
3. The AWT is created in a synchronized region so is single threaded.
The patch is quite simple. Just delay creation of the AWT fint until required.
--
This message is automatically generated by JIRA.
-
For more information on JIRA, see: http://www.atlassian.com/software/jira