[
https://issues.apache.org/jira/browse/TIKA-2862?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Sergey Beryozkin updated TIKA-2862:
-----------------------------------
Description:
PDF Parser is not Graal native mode ready yet, the following is reported when
it is processed as part of Quarkus native mode build:
{noformat}
Error: Detected a FileDescriptor in the image heap. ....You can manually delay
class initialization to image run time by using the option
--delay-class-initialization-to-runtime=<class-name>. ...
Detailed message:
Trace: object org.apache.fontbox.ttf.BufferedRandomAccessFile
object org.apache.fontbox.ttf.RAFDataStream
object org.apache.fontbox.ttf.TrueTypeFont
object org.apache.pdfbox.pdmodel.font.PDType1Font
method
org.apache.pdfbox.pdmodel.interactive.form.PDAcroForm.verifyOrCreateDefaults()
Call path from entry point to
org.apache.pdfbox.pdmodel.interactive.form.PDAcroForm.verifyOrCreateDefaults():
at
org.apache.pdfbox.pdmodel.interactive.form.PDAcroForm.verifyOrCreateDefaults(PDAcroForm.java:106)
at
org.apache.pdfbox.pdmodel.interactive.form.PDAcroForm.<init>(PDAcroForm.java:93)
at
org.apache.pdfbox.pdmodel.PDDocumentCatalog.getAcroForm(PDDocumentCatalog.java:108)
at org.apache.tika.parser.pdf.PDFParser.handleXFAOnly(PDFParser.java:534)
at org.apache.tika.parser.pdf.PDFParser.parse(PDFParser.java:164)
{noformat}
See also
[https://medium.com/graalvm/understanding-class-initialization-in-graalvm-native-image-generation-d765b7e4d6ed]
was:
PDF Parser is not Graal native mode ready yet, the following is reported when
it is processed as part of Quarkus native mode build:
Error: Detected a FileDescriptor in the image heap. ....You can manually delay
class initialization to image run time by using the option
--delay-class-initialization-to-runtime=<class-name>. ...
Detailed message:
Trace: object org.apache.fontbox.ttf.BufferedRandomAccessFile
object org.apache.fontbox.ttf.RAFDataStream
object org.apache.fontbox.ttf.TrueTypeFont
object org.apache.pdfbox.pdmodel.font.PDType1Font
method
org.apache.pdfbox.pdmodel.interactive.form.PDAcroForm.verifyOrCreateDefaults()
Call path from entry point to
org.apache.pdfbox.pdmodel.interactive.form.PDAcroForm.verifyOrCreateDefaults():
at
org.apache.pdfbox.pdmodel.interactive.form.PDAcroForm.verifyOrCreateDefaults(PDAcroForm.java:106)
at
org.apache.pdfbox.pdmodel.interactive.form.PDAcroForm.<init>(PDAcroForm.java:93)
at
org.apache.pdfbox.pdmodel.PDDocumentCatalog.getAcroForm(PDDocumentCatalog.java:108)
at org.apache.tika.parser.pdf.PDFParser.handleXFAOnly(PDFParser.java:534)
at org.apache.tika.parser.pdf.PDFParser.parse(PDFParser.java:164)
See also
[https://medium.com/graalvm/understanding-class-initialization-in-graalvm-native-image-generation-d765b7e4d6ed]
> Make PDF Parser Graal native mode ready
> ----------------------------------------
>
> Key: TIKA-2862
> URL: https://issues.apache.org/jira/browse/TIKA-2862
> Project: Tika
> Issue Type: Improvement
> Components: parser
> Affects Versions: 1.20
> Reporter: Sergey Beryozkin
> Assignee: Sergey Beryozkin
> Priority: Major
>
> PDF Parser is not Graal native mode ready yet, the following is reported when
> it is processed as part of Quarkus native mode build:
> {noformat}
> Error: Detected a FileDescriptor in the image heap. ....You can manually
> delay class initialization to image run time by using the option
> --delay-class-initialization-to-runtime=<class-name>. ...
> Detailed message:
> Trace: object org.apache.fontbox.ttf.BufferedRandomAccessFile
> object org.apache.fontbox.ttf.RAFDataStream
> object org.apache.fontbox.ttf.TrueTypeFont
> object org.apache.pdfbox.pdmodel.font.PDType1Font
> method
> org.apache.pdfbox.pdmodel.interactive.form.PDAcroForm.verifyOrCreateDefaults()
> Call path from entry point to
> org.apache.pdfbox.pdmodel.interactive.form.PDAcroForm.verifyOrCreateDefaults():
>
> at
> org.apache.pdfbox.pdmodel.interactive.form.PDAcroForm.verifyOrCreateDefaults(PDAcroForm.java:106)
> at
> org.apache.pdfbox.pdmodel.interactive.form.PDAcroForm.<init>(PDAcroForm.java:93)
> at
> org.apache.pdfbox.pdmodel.PDDocumentCatalog.getAcroForm(PDDocumentCatalog.java:108)
> at org.apache.tika.parser.pdf.PDFParser.handleXFAOnly(PDFParser.java:534)
> at org.apache.tika.parser.pdf.PDFParser.parse(PDFParser.java:164)
> {noformat}
> See also
> [https://medium.com/graalvm/understanding-class-initialization-in-graalvm-native-image-generation-d765b7e4d6ed]
--
This message was sent by Atlassian JIRA
(v7.6.3#76005)