[
https://issues.apache.org/jira/browse/PDFBOX-1174?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13160766#comment-13160766
]
Timo Boehme commented on PDFBOX-1174:
-------------------------------------
Only for illustration here is a snippet from a PDF with garbage (remains from
old objects; therefore problematic to find next correct start; I've deactivated
looking for 'stream' in PDFParser#skipToNextObj since it stopped on each
'endstream'):
</rdf:Description>
</rdf:RDF>
</x:xmpmeta>
<?xpacket end="r"?>
endstream
endobj
<</Filter/FlateDecode/First 13/Length 108/N 2/Type/ObjStm>>stream
...K....qz'..w...
3P.
p..K....0Q".ՋN;..A&.....
...#).9m.ƿL,/.>.....@..|..B..
ո..
..V...5.=.J.......g{.Oq.
endstream
endobj
<0C60053F04D4C65448AD9638BA1EB781591C15E992BA7C448D75> >stream
{.#H
.....p...c.T.....b.F}g˫.Z.3)1.n..&.:...A.>[email protected]ʻ[email protected].)..!]..
..
.T.^o....u^
....\....^..
endstream
endobj
/Filter/FlateDecode/First 13/Length 108/N 2/Type/ObjStm>>stream
5..3..~.M^j
.;sX9..i(...y.......m.!SL.R.'.W.:H!;G..c...(O.W^./[email protected].Γ{..i.8.XK..ư............eK.Q....a.
endstream
endobj
Filter/FlateDecode/First 13/Length 108/N 2/Type/ObjStm>>stream
A...ck...ޘ....;.Ÿa.. GwN...w.NCӛ..+.n~......Y., X.Q.@-.
.E..S**.....*6#..d]...#.ȷ..&+;0...С.m...*....
endstream
endobj
lter/FlateDecode/First 13/Length 108/N 2/Type/ObjStm>>stream
..iN......xj?{0<&m....W.#...Z].d!....!...b..>.o.N..Gq28.K.W.k......y f...9
..u{....i..xV ..I....(=6M..W.
endstream
endobj
The first endstream/endobj is regular end of an object. The PDF was produced by
verypdf.com.
> i have problem in BaseParser.readInt
> -------------------------------------
>
> Key: PDFBOX-1174
> URL: https://issues.apache.org/jira/browse/PDFBOX-1174
> Project: PDFBox
> Issue Type: Bug
> Components: Parsing, PDModel
> Affects Versions: 1.6.0
> Reporter: ahmad makram
>
> i can't load PDF to PDDocument.load( )
> it give me this exception
> java.io.IOException: Error: Expected an integer type, actual='Fatal'
> at org.apache.pdfbox.pdfparser.BaseParser.readInt(BaseParser.java:1384)
> at org.apache.pdfbox.pdfparser.PDFParser.parseObject(PDFParser.java:517)
> at org.apache.pdfbox.pdfparser.PDFParser.parse(PDFParser.java:184)
> at org.apache.pdfbox.pdmodel.PDDocument.load(PDDocument.java:1069)
> at org.apache.pdfbox.pdmodel.PDDocument.load(PDDocument.java:1036)
> at org.apache.pdfbox.pdmodel.PDDocument.load(PDDocument.java:1007)
--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators:
https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira