[
https://issues.apache.org/jira/browse/PDFBOX-466?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12797110#action_12797110
]
anybudy commented on PDFBOX-466:
--------------------------------
I am using .net version of 0.8.0-incubator (was hard to collect reqired files),
pdf supplier changed the pdf format and I need to extract text from pdf files
which are created by cyrstal reports(pdf version 1.6 and acrobat 7.x). I am
getting an exception. I think it is the same problem? Could you please help me
with this? I have no java knowledge.
org.apache.pdfbox.exceptions.WrappedIOException was unhandled
StackTrace:
at org.apache.pdfbox.pdfparser.PDFParser.parse()
at org.apache.pdfbox.pdmodel.PDDocument.load(InputStream input,
RandomAccess scratchFile)
at org.apache.pdfbox.pdmodel.PDDocument.load(InputStream input)
at org.apache.pdfbox.pdmodel.PDDocument.load(String filename)
at BetMatik.pdfManipulationClass.readPDF(String fileName, String date)
in ....
Thank you very much.
> error parsing files generated by crystal reports
> ------------------------------------------------
>
> Key: PDFBOX-466
> URL: https://issues.apache.org/jira/browse/PDFBOX-466
> Project: PDFBox
> Issue Type: Bug
> Components: FontBox
> Reporter: Sean Bridges
> Fix For: 0.8.0-incubator
>
> Attachments: patch, patch2, patch2_again
>
>
> This is with the latest from svn, Revision: 773978
> From a sample of 13304 pdf documents generated in a very wide variety of
> ways, I got 200 exceptions with the stack trace,
> Caused by: java.io.IOException: expected='obj' actual='000'
> org.apache.pdfbox.io.pushbackinputstr...@1049d3
> at org.apache.pdfbox.pdfparser.PDFParser.parseObject(PDFParser.java:471)
> at org.apache.pdfbox.pdfparser.PDFParser.parse(PDFParser.java:169)
> at
> message_analyzer.extractor.PDFExtractor.getContent(PDFExtractor.java:32)
> ... 2 more
> I can't give an example file, but the pdfs are all generated by crystal
> reports.
--
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.