[ 
https://issues.apache.org/jira/browse/PDFBOX-466?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12797110#action_12797110
 ] 

anybudy commented on PDFBOX-466:
--------------------------------

I am using .net version of 0.8.0-incubator (was hard to collect reqired files), 
pdf supplier changed the pdf format and I need to extract text from pdf files 
which are created by cyrstal reports(pdf version 1.6 and acrobat 7.x). I am 
getting an exception. I think it is the same problem? Could you please help me 
with this? I have no java knowledge.

org.apache.pdfbox.exceptions.WrappedIOException was unhandled
StackTrace:
       at org.apache.pdfbox.pdfparser.PDFParser.parse()
       at org.apache.pdfbox.pdmodel.PDDocument.load(InputStream input, 
RandomAccess scratchFile)
       at org.apache.pdfbox.pdmodel.PDDocument.load(InputStream input)
       at org.apache.pdfbox.pdmodel.PDDocument.load(String filename)
       at BetMatik.pdfManipulationClass.readPDF(String fileName, String date) 
in ....


Thank you very much.



> error parsing files generated by crystal reports
> ------------------------------------------------
>
>                 Key: PDFBOX-466
>                 URL: https://issues.apache.org/jira/browse/PDFBOX-466
>             Project: PDFBox
>          Issue Type: Bug
>          Components: FontBox
>            Reporter: Sean Bridges
>             Fix For: 0.8.0-incubator
>
>         Attachments: patch, patch2, patch2_again
>
>
> This is with the latest from svn, Revision: 773978
> From a sample of 13304 pdf documents generated in a very wide variety of 
> ways, I got 200 exceptions with the stack trace,
> Caused by: java.io.IOException: expected='obj' actual='000' 
> org.apache.pdfbox.io.pushbackinputstr...@1049d3
>       at org.apache.pdfbox.pdfparser.PDFParser.parseObject(PDFParser.java:471)
>       at org.apache.pdfbox.pdfparser.PDFParser.parse(PDFParser.java:169)
>       at 
> message_analyzer.extractor.PDFExtractor.getContent(PDFExtractor.java:32)
>       ... 2 more
> I can't give an example file, but the pdfs are all generated by crystal 
> reports.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.

Reply via email to