[
https://issues.apache.org/jira/browse/TIKA-2862?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16837432#comment-16837432
]
Sergey Beryozkin commented on TIKA-2862:
----------------------------------------
The call path from PDType1Font to RAFDataStream:
{noformat}
17:31:09,714 ERROR [org.apa.pdf.pdm.fon.FileSystemFontProvider] Could not load
font file: /usr/share/fonts/liberation/LiberationSans-Regular.ttf:
java.lang.NullPointerException
at
org.apache.fontbox.ttf.RAFDataStream.readSignedShort(RAFDataStream.java:77)
at
org.apache.fontbox.ttf.TTFDataStream.read32Fixed(TTFDataStream.java:50)
at org.apache.fontbox.ttf.TTFParser.parse(TTFParser.java:132)
at org.apache.fontbox.ttf.TTFParser.parse(TTFParser.java:87)
at
org.apache.pdfbox.pdmodel.font.FileSystemFontProvider.readTrueTypeFont(FileSystemFontProvider.java:731)
at
org.apache.pdfbox.pdmodel.font.FileSystemFontProvider.getTrueTypeFont(FileSystemFontProvider.java:696)
at
org.apache.pdfbox.pdmodel.font.FileSystemFontProvider.access$200(FileSystemFontProvider.java:55)
at
org.apache.pdfbox.pdmodel.font.FileSystemFontProvider$FSFontInfo.getFont(FileSystemFontProvider.java:132)
at
org.apache.pdfbox.pdmodel.font.FontMapperImpl.findFont(FontMapperImpl.java:436)
at
org.apache.pdfbox.pdmodel.font.FontMapperImpl.findFontBoxFont(FontMapperImpl.java:382)
at
org.apache.pdfbox.pdmodel.font.FontMapperImpl.getFontBoxFont(FontMapperImpl.java:359)
at
org.apache.pdfbox.pdmodel.font.PDType1Font.<init>(PDType1Font.java:146)
at
org.apache.pdfbox.pdmodel.font.PDType1Font.<clinit>(PDType1Font.java:91)
{noformat}
> Make PDF Parser Graal native mode ready
> ----------------------------------------
>
> Key: TIKA-2862
> URL: https://issues.apache.org/jira/browse/TIKA-2862
> Project: Tika
> Issue Type: Improvement
> Components: parser
> Affects Versions: 1.20
> Reporter: Sergey Beryozkin
> Assignee: Sergey Beryozkin
> Priority: Major
>
> PDF Parser is not Graal native mode ready yet, the following is reported when
> it is processed as part of Quarkus native mode build:
> Error: Detected a FileDescriptor in the image heap. ....You can manually
> delay class initialization to image run time by using the option
> --delay-class-initialization-to-runtime=<class-name>. ...
> Detailed message:
> Trace: object org.apache.fontbox.ttf.BufferedRandomAccessFile
> object org.apache.fontbox.ttf.RAFDataStream
> object org.apache.fontbox.ttf.TrueTypeFont
> object org.apache.pdfbox.pdmodel.font.PDType1Font
> method
> org.apache.pdfbox.pdmodel.interactive.form.PDAcroForm.verifyOrCreateDefaults()
> Call path from entry point to
> org.apache.pdfbox.pdmodel.interactive.form.PDAcroForm.verifyOrCreateDefaults():
>
> at
> org.apache.pdfbox.pdmodel.interactive.form.PDAcroForm.verifyOrCreateDefaults(PDAcroForm.java:106)
> at
> org.apache.pdfbox.pdmodel.interactive.form.PDAcroForm.<init>(PDAcroForm.java:93)
> at
> org.apache.pdfbox.pdmodel.PDDocumentCatalog.getAcroForm(PDDocumentCatalog.java:108)
> at org.apache.tika.parser.pdf.PDFParser.handleXFAOnly(PDFParser.java:534)
> at org.apache.tika.parser.pdf.PDFParser.parse(PDFParser.java:164)
>
> See also
> [https://medium.com/graalvm/understanding-class-initialization-in-graalvm-native-image-generation-d765b7e4d6ed]
--
This message was sent by Atlassian JIRA
(v7.6.3#76005)