[
https://issues.apache.org/jira/browse/PDFBOX-4296?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16776202#comment-16776202
]
Tilman Hausherr commented on PDFBOX-4296:
-----------------------------------------
It could also be that PDFBox has improved, we did some optimization in
PDICCBased. Because of that, please test with a snapshot
https://repository.apache.org/content/groups/snapshots/org/apache/pdfbox/pdfbox-app/2.0.14-SNAPSHOT/
> Question: Performance
> ---------------------
>
> Key: PDFBOX-4296
> URL: https://issues.apache.org/jira/browse/PDFBOX-4296
> Project: PDFBox
> Issue Type: Improvement
> Components: Rendering
> Affects Versions: 2.0.11
> Reporter: Daniel Persson
> Priority: Trivial
> Labels: optimization, performance
>
> Hi Team.
> We use a tool we built using PDFBox to extract text for about 10k pages per
> day. Then we have another tool to extract images using Poppler.
> We want to use PDFBox for both tasks but sadly we see a performance hit using
> PDFBox in the order of 3 times.
> Do you have any backlog / technical dept / ideas on how to improve
> performance?
> We have tried -Dorg.apache.pdfbox.rendering.UsePureJavaCMYKConversion=true
> and that made image generation much slower.
> We have set System.setProperty("sun.java2d.cmm",
> "sun.java2d.cmm.kcms.KcmsServiceProvider") in code.
> We use image libraries from twelvemonkeys, pdfbox and the standard jai
> project.
> I've read in the code that we do double writes for images using transparency
> which might be a culprit.
> I have been allowed to put some time into the project if we have some solid
> leads or a roadmap to reach better performance.
> Hope it's okay to track this issue here instead of a question on the mailing
> list.
> Best regards
> Daniel
--
This message was sent by Atlassian JIRA
(v7.6.3#76005)
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]