[
https://issues.apache.org/jira/browse/PDFBOX-4778?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Andreas Lehmkühler updated PDFBOX-4778:
---------------------------------------
Description:
[~tallison] found a bunch of problematic [pdfs|
https://bugs.ghostscript.com/show_bug.cgi?id=695040] and one of them runs very
slow/forever.
The attached pdf has a broken content stream so that a lot of matrix operations
are skipped due to missing values which lead to illegal values when multiplying
matrices. Some of the calculated text positions are broken and in the end text
extraction with sorting is very slow or runs infinite.
was:
[~tallison] found a bunch of problematic [pdfs|
https://bugs.ghostscript.com/show_bug.cgi?id=695040] and one of them runs very
slow/forever.
The attached pdf has a broken content stream so that a lot of matrix operations
are skipped due to missing values which lead to illegal values when multiplying
matrices. Some of the calculated test positions are broken and in the end text
extraction with sorting is very slow or runs infinite.
> Avoid illegal matrix values
> ---------------------------
>
> Key: PDFBOX-4778
> URL: https://issues.apache.org/jira/browse/PDFBOX-4778
> Project: PDFBox
> Issue Type: Bug
> Components: Parsing
> Affects Versions: 2.0.18, 3.0.0 PDFBox
> Reporter: Andreas Lehmkühler
> Assignee: Andreas Lehmkühler
> Priority: Major
> Fix For: 2.0.20, 3.0.0 PDFBox
>
> Attachments: hang-090214-015108-51.pdf
>
>
> [~tallison] found a bunch of problematic [pdfs|
> https://bugs.ghostscript.com/show_bug.cgi?id=695040] and one of them runs
> very slow/forever.
> The attached pdf has a broken content stream so that a lot of matrix
> operations are skipped due to missing values which lead to illegal values
> when multiplying matrices. Some of the calculated text positions are broken
> and in the end text extraction with sorting is very slow or runs infinite.
--
This message was sent by Atlassian Jira
(v8.3.4#803005)
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]