Merging multiple PDF into HTTP output stream

Jörn Haferstroh Fri, 06 Dec 2013 16:03:07 -0800

Hi,

first let me give some credits to the developers of pdfbox for this veryusable tool. Please continue your work, guys!

I have a web application storing lots of PDF documents in a database.For easier bulk download and printing, I am using pdfbox to mergemultiple PDF documents into one large PDF document for download. Thedestination stream of the merge is the HTTP output stream, so the mergedPDF data goes directly to the requesting web client.

Today I learned by a "too many open files" error, that pdfbox creates atemporary file for each source input stream and keeps it open until theend of the merge process (I tried to merge 1025 PDF sources into one PDFon a Linux box). Is this behaviour necessary, maybe caused by the PDFformat? However, I was able to handle it by increasing the open filelimit of the user.

When does pdfbox write the first bytes into the merge output stream?Does it happen during the merge process or after the last source hasbeen merged? So, does the requesting web client has to wait for thedownload to start until all sources have been merged or not?


Thanks for information
Joern

Merging multiple PDF into HTTP output stream

Reply via email to