[ 
https://issues.apache.org/jira/browse/PDFBOX-1918?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13901218#comment-13901218
 ] 

Tilman Hausherr commented on PDFBOX-1918:
-----------------------------------------

Your file is broken. Open the PDF with an editor like NOTEPAD++. At the bottom, 
there is

startxref
83636
%%EOF

Now go to position 83636. That should be where "xref" starts, but it is not. 
the xref table starts at position 87885.

If you click on your own link with firefox, you will notice that the built-in 
viewer can't display the file either.

Having said that - in version 2.0 the parser should be more "lenient", but it 
isn't, because of the exception that happens. This may or may not be a bug, 
lets wait for a comment by someone else.

> PDF convert error
> -----------------
>
>                 Key: PDFBOX-1918
>                 URL: https://issues.apache.org/jira/browse/PDFBOX-1918
>             Project: PDFBox
>          Issue Type: Bug
>    Affects Versions: 1.8.4
>            Reporter: Jr. John
>
> Current version has same problem 1.8.4
> D:\Software\pdfbox>java -jar pdfbox-app-1.8.4.jar ConvertColorspace 
> rpt1390780234888753.pdf test.pdf
> 二月 07, 2014 4:59:11 下午 org.apache.pdfbox.pdfparser.BaseParser parseCOSStream
> 警告: Specified stream length 15353 is wrong. Fall back to reading stream until 
> 'endstream'.
> 二月 07, 2014 4:59:11 下午 org.apache.pdfbox.pdfparser.BaseParser parseCOSStream
> 警告: Specified stream length 12156 is wrong. Fall back to reading stream until 
> 'endstream'.
> 二月 07, 2014 4:59:11 下午 org.apache.pdfbox.pdfparser.XrefTrailerResolver 
> setStartxref
> 警告: Did not found XRef object at specified startxref position 83636
> ConvertColorspace failed with the following exception:
> java.io.IOException: Missing closing bracket for hex string. Reached EOS.
> at 
> org.apache.pdfbox.pdfparser.BaseParser.parseCOSHexString(BaseParser.java:1023)
> at org.apache.pdfbox.pdfparser.BaseParser.parseCOSString(BaseParser.java:816)
> at 
> org.apache.pdfbox.pdfparser.PDFStreamParser.parseNextToken(PDFStreamParser.java:259)
> at org.apache.pdfbox.pdfparser.PDFStreamParser.parse(PDFStreamParser.java:133)
> at 
> org.apache.pdfbox.ConvertColorspace.replaceColors(ConvertColorspace.java:88)
> at org.apache.pdfbox.ConvertColorspace.main(ConvertColorspace.java:385)
> at org.apache.pdfbox.PDFBox.main(PDFBox.java:46)
> The issue pdf here 
> https://www.dropbox.com/s/prx5s2ivt8wtztw/rpt1390780234888753.pdf



--
This message was sent by Atlassian JIRA
(v6.1.5#6160)

Reply via email to