Re: [iText-questions] PdfReader question

2006-10-19 Thread Paulo Soares
- > From: [EMAIL PROTECTED] > [mailto:[EMAIL PROTECTED] On > Behalf Of [EMAIL PROTECTED] > Sent: Wednesday, October 18, 2006 4:00 PM > To: Post all your questions about iText here > Subject: Re: [iText-questions] PdfReader question > > David, > I agree, I've reve

Re: [iText-questions] PdfReader question

2006-10-18 Thread Paulo Soares
To: Post all your questions about iText here > Subject: Re: [iText-questions] PdfReader question > > David, > I agree, I've reverted to converting the Xerox MetaCode to > PostScript and run > distillr to generate the PDF. Files are much smaller and > more manageable so far

Re: [iText-questions] PdfReader question

2006-10-18 Thread billsanders
David, I agree, I've reverted to converting the Xerox MetaCode to PostScript and run distillr to generate the PDF. Files are much smaller and more manageable so far. I just started to run some large test on this new workflow. The performance of PdfReader using the RandomAccessFileOrArray is sti

Re: [iText-questions] PdfReader question

2006-10-18 Thread Webber, David (NIH/OD) [C]
Bill, This may require a different strategy - I'm not sure exactly what processing you are doing. It may be content within the PDF that is causing the problems - such as embedded images with strange scaling sizes. One thing we are doing is to pre-process each PDF to flatten the incoming document

Re: [iText-questions] PdfReader question

2006-10-17 Thread Mick Collins
I think you're correct - our files rarely exceed 40 MB M >From: [EMAIL PROTECTED] >Reply-To: Post all your questions about iText here > >To: Post all your questions about iText here > >Subject: Re: [iText-questions] PdfReader question >Date: Tue, 17 Oct 2006 13:27:

Re: [iText-questions] PdfReader question

2006-10-17 Thread Mick Collins
here > >Subject: Re: [iText-questions] PdfReader question >Date: Tue, 17 Oct 2006 13:07:15 -0500 > >The PDF's are ranging from 27,000 pages and up. >I've played with various forms of jvm settings-Xms512m -Xmx1024m with no >results >with the out of memory using P

Re: [iText-questions] PdfReader question

2006-10-17 Thread billsanders
Let me clarify one point regarding this post. It's not the number of pages, but I believe it's the size of the PDF. The average size on disk for these PDFs can be 700MB or larger for 27,000 pages. This is due to the method in which the PDF's are generated (Xerox MetaCode to PDF conversion). So

Re: [iText-questions] PdfReader question

2006-10-17 Thread billsanders
Paulo, This is all local storage. May be my implementation is causing some of the problems. First of all, I'm using iText via Jython, which seems to work well for PdfReader(String file). When I switch to PdfReader(RandomAccessFileOrArray(strPdfFile), None) the performance drops off significantly

Re: [iText-questions] PdfReader question

2006-10-17 Thread billsanders
The PDF's are ranging from 27,000 pages and up. I've played with various forms of jvm settings-Xms512m -Xmx1024m with no results with the out of memory using PdfReader(String file). It was recommended to use the PdfReader(RandomAccessFileOrArray raf), which causes reading pages and writing them to

Re: [iText-questions] PdfReader question

2006-10-17 Thread Paulo Soares
If you're using a file in a network disk it will take forever. If it's a local disk it will be slower but not that slower if using PdfCopy. Paulo > -Original Message- > From: [EMAIL PROTECTED] > [mailto:[EMAIL PROTECTED] On > Behalf Of Bill Sanders > Sent: Tuesday, October 17, 2006 2:5

Re: [iText-questions] PdfReader question

2006-10-17 Thread Mick Collins
What size pdfs are you dealing with? what are you doing -m reading and copying pdf? I'm processing thousands of pdfs in very short periods of time- minutes. I copy pages from one pdf to another, add blank pages, take blank pages out, stamp images using pdfStamper, count number of pages and concate