[
https://issues.apache.org/jira/browse/PDFBOX-1668?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13734894#comment-13734894
]
Tilman Hausherr commented on PDFBOX-1668:
-----------------------------------------
I prefer "swallow it and add a warning message" because:
1) people won't understand why an exception is coming because "but it displays
in Acrobat Reader", and they'll open lots of new tickets here
2) the PDFs processed by PDFBOX are usually not created by ourselves. They are
created by software upon which we have no influence; sometimes the files are
many years old. I have used PDFBOX mostly for migration projects, where the
customers want to get TIFs instead.
Of course the best solution would be a configurable error strategy, where
people can chose between "abort immediately" and "try to recover gracefully".
> Loading a Russian PDF never finishes
> -------------------------------------
>
> Key: PDFBOX-1668
> URL: https://issues.apache.org/jira/browse/PDFBOX-1668
> Project: PDFBox
> Issue Type: Bug
> Components: Parsing
> Affects Versions: 2.0.0
> Reporter: Sergio Fernández
> Priority: Minor
>
> Try to run this line:
> PDDocument.load(new
> URL("http://www.who.int/entity/foodsafety/publications/general/en/global_strategy_ru.pdf"));
> The loading never finishes... taking a lot of CPU.
> The document size (574K) should not be the problem. I guess something in that
> document causes the issue with PdfBox. And I'd like to know if such could be
> a more general issue or what.
> Thanks!
--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira