[
https://issues.apache.org/jira/browse/PDFBOX-2293?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Tilman Hausherr resolved PDFBOX-2293.
-------------------------------------
Resolution: Fixed
I'm setting this one to resolved (because I made a slight change in the logs)
after not getting an answer. However, the problem is probably fixed too thanks
to the changes Andreas made in PDFBOX-2250. Please reopen if you still get an
error, but then do also set yourself as a watcher on PDFBOX-2441.
> NonSequential parser gives an error
> -----------------------------------
>
> Key: PDFBOX-2293
> URL: https://issues.apache.org/jira/browse/PDFBOX-2293
> Project: PDFBox
> Issue Type: Bug
> Affects Versions: 2.0.0
> Environment: Linux, JDK 1.6
> Reporter: v gangolli
> Fix For: 2.0.0
>
>
> I get the following error when using the sequential parse with Pdfbox 1.8.5.
> {code}
> expected='endstream' actual=''
> org.apache.pdfbox.io.PushBackInputStream@eb43bd5: java.io.IOException: at
> org.apache.pdfbox.pdfparser.BaseParser.parseCOSStream(BaseParser.java:628)
> [pdfbox-1.8.5.jar:]
> at
> org.apache.pdfbox.pdfparser.PDFParser.parseObject(PDFParser.java:605)
> [pdfbox-1.8.5.jar:]
> at org.apache.pdfbox.pdfparser.PDFParser.parse(PDFParser.java:194)
> [pdfbox-1.8.5.jar:]
> at org.apache.pdfbox.pdmodel.PDDocument.load(PDDocument.java:1220)
> [pdfbox-1.8.5.jar:]
> at org.apache.pdfbox.pdmodel.PDDocument.load(PDDocument.java:1187)
> [pdfbox-1.8.5.jar:]
> at
> org.apache.pdfbox.util.PDFMergerUtility.mergeDocuments(PDFMergerUtility.java:236)
> [pdfbox-1.8.5.jar:]
> at
> org.apache.pdfbox.util.PDFMergerUtility.mergeDocuments(PDFMergerUtility.java:185)
> [pdfbox-1.8.5.jar:]
> {code}
> After looking at some of the fixed issues reported for similar problem(s), I
> have tried using PDFBox 2.0.0 built from the latest repository code and the
> nonsequential parser for the pdf processing. However, the file created as
> randomAccessFile seems to get damaged (cannot be opened in Acrobat Reader
> after the run) when I use PDFbox 2.0.0 for my processing.
> I am unable to attach a sample file because of privacy concerns for the
> content. I also get an error and am not able to generate the merged output.
> The code snippet is as follows-
> {code}
> for (String fName : fileList) {
> pd = null;
> File pdFile = new File(fName);
> fNameStr = fName.substring(0, fName.lastIndexOf('.'))
> + "_new.pdf";
> InputStream is = new FileInputStream(pdFile);
> RandomAccessFile raf = new RandomAccessFile(pdFileNew, "rws");
> pd = PDDocument.loadNonSeq(is, raf );
> pd.getDocumentCatalog();
> pd.save(fNameStr);
> pd.close();
> if (is != null) {
> is.close();
> }
> if(raf != null) {
> raf.close();
> }
> ut.addSource(fNameStr);
> }
> FileOutputStream fos = new FileOutputStream(outFileName);
> ut.setDestinationStream(fos);
> ut.setIgnoreAcroFormErrors(true);
> ut.mergeDocuments();
> fos.close();
> {code}
> Thank You.
--
This message was sent by Atlassian JIRA
(v6.3.4#6332)