[
https://issues.apache.org/jira/browse/PDFBOX-2350?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Daniel Scheibe updated PDFBOX-2350:
-----------------------------------
Description:
When rendering the first page of my pdf document the Type1Parser
(org.apache.fontbox.type1.Type1Parser) hangs in a loop in
{{parseBinary(byte[] bytes) throws IOException}}
and "kills" our rendering pipeline. Please find the loop that hangs below:
// find /Private dict
while (!lexer.peekToken().getText().equals("Private"))
{
lexer.nextToken();
}
There is no token named "Private" ever in the list of returned tokens (they're
empty all the time).
Furthermore going deeper into the source code it seems the class reading the
tokens (Type1Lexer) does never finally advance the buffer position and always
returns an empty name token in the readToken(Token prevToken) method.
Looking at the decrypted buffer i cannot get something useful out of it based
on my current understanding.
Unfortunately i cannot provide the pdf in question as it contains confidental
data.
Acrobat Reader XI Version 11.0.08 renders the document just fine.
In addition it seems the pdf was encrypted (40-Bit RC4) with an empty password
and says it's pdf version 1.5.
Does this provide enough information or can i do anything else to help nailing
this one down?
I guess this might be a pdf document structure/feature that is not yet
supported completely but i at least pdfbox should throw an exception instead of
failing "silently"...
was:
When rendering the first page of my pdf document the Type1Parser
(org.apache.fontbox.type1.Type1Parser) hangs in a loop in parseBinary(byte[]
bytes) throws IOException and "kills" our rendering pipeline. Please find the
loop that hangs below:
// find /Private dict
while (!lexer.peekToken().getText().equals("Private"))
{
lexer.nextToken();
}
There is no token named "Private" ever in the list of returned tokens (they're
empty all the time).
Furthermore going deeper into the source code it seems the class reading the
tokens (Type1Lexer) does never finally advance the buffer position and always
returns an empty name token in the readToken(Token prevToken) method.
Looking at the decrypted buffer i cannot get something useful out of it based
on my current understanding.
Unfortunately i cannot provide the pdf in question as it contains confidental
data.
Acrobat Reader XI Version 11.0.08 renders the document just fine.
In addition it seems the pdf was encrypted (40-Bit RC4) with an empty password
and says it's pdf version 1.5.
Does this provide enough information or can i do anything else to help nailing
this one down?
I guess this might be a pdf document structure/feature that is not yet
supported completely but i at least pdfbox should throw an exception instead of
failing "silently"...
> Type1 Parser hangs indefinitely
> -------------------------------
>
> Key: PDFBOX-2350
> URL: https://issues.apache.org/jira/browse/PDFBOX-2350
> Project: PDFBox
> Issue Type: Bug
> Components: FontBox
> Affects Versions: 2.0.0
> Environment: Windows 7, JDK 1.7.0_51-b13
> Reporter: Daniel Scheibe
>
> When rendering the first page of my pdf document the Type1Parser
> (org.apache.fontbox.type1.Type1Parser) hangs in a loop in
> {{parseBinary(byte[] bytes) throws IOException}}
> and "kills" our rendering pipeline. Please find the loop that hangs below:
> // find /Private dict
> while (!lexer.peekToken().getText().equals("Private"))
> {
> lexer.nextToken();
> }
> There is no token named "Private" ever in the list of returned tokens
> (they're empty all the time).
> Furthermore going deeper into the source code it seems the class reading the
> tokens (Type1Lexer) does never finally advance the buffer position and always
> returns an empty name token in the readToken(Token prevToken) method.
> Looking at the decrypted buffer i cannot get something useful out of it based
> on my current understanding.
> Unfortunately i cannot provide the pdf in question as it contains confidental
> data.
> Acrobat Reader XI Version 11.0.08 renders the document just fine.
> In addition it seems the pdf was encrypted (40-Bit RC4) with an empty
> password and says it's pdf version 1.5.
> Does this provide enough information or can i do anything else to help
> nailing this one down?
> I guess this might be a pdf document structure/feature that is not yet
> supported completely but i at least pdfbox should throw an exception instead
> of failing "silently"...
--
This message was sent by Atlassian JIRA
(v6.3.4#6332)