Are you actually seeing a limit?  My experience is that PDFs with huge
numbers of pages wind up with very big in-memory xref tables, and that can
consume quite a bit of memory - but if you have enough memory, then there
isn't any practical limit that I'm aware of.

Some things you can do to reduce memory usage:

Open the PDF in partial mode
Use the latest code in SVN (I recently added a bunch of optimization code
for working with super large files)

If you do run into memory issues, you might find that you can work around
them by processing some number of pages at a time, then re-opening the PDF
(in partial mode!), and processing the next chunk of pages.  This will
effectively purge out unneeded xref entries.  You'd probably be looking at
chunks of 10-50K pages or so, but do some testing to see where the garbage
collector starts to get bogged down and go from there.


It's not possible to search a PDF without first extracting it's contents. 
If you don't want to do a linear search, then you should index the text you
extracted.  iText isn't going to help you here - something like the Lucene
project is what you are looking for.

Cheers.





--
View this message in context: 
http://itext-general.2136553.n4.nabble.com/Search-Text-and-Capacity-of-iText-to-read-tp4657270p4657271.html
Sent from the iText - General mailing list archive at Nabble.com.

------------------------------------------------------------------------------
Master SQL Server Development, Administration, T-SQL, SSAS, SSIS, SSRS
and more. Get SQL Server skills now (including 2012) with LearnDevNow -
200+ hours of step-by-step video tutorials by Microsoft MVPs and experts.
SALE $99.99 this month only - learn more at:
http://p.sf.net/sfu/learnmore_122512
_______________________________________________
iText-questions mailing list
iText-questions@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/itext-questions

iText(R) is a registered trademark of 1T3XT BVBA.
Many questions posted to this list can (and will) be answered with a reference 
to the iText book: http://www.itextpdf.com/book/
Please check the keywords list before you ask for examples: 
http://itextpdf.com/themes/keywords.php

Reply via email to