On 20/12/2012 13:07, [email protected] wrote:
Hello,

I have a question about pdf-text extraction and layers. I have Pdf-files that have multiple layers. Only one of these layers is containing text that I am interested in. Is there a way to extract text from this particular layer only? I am familiar with programming, so I just wish to know if API has support for this? So far, I have been able to find only examples to add new layers for Pdf -files.

When you download iText from SourceForge, there's a zip with three jars.
The code you need is in the xtra jar: com.itextpdf.text.pdf.ocg.OCGRemover
With this class, you can remove layers from a PDF, keeping only the one you're interested in. Or, by using OCGParser correctly, you could filter our content of one specific layer.
------------------------------------------------------------------------------
LogMeIn Rescue: Anywhere, Anytime Remote support for IT. Free Trial
Remotely access PCs and mobile devices and provide instant support
Improve your efficiency, and focus on delivering more value-add services
Discover what IT Professionals Know. Rescue delivers
http://p.sf.net/sfu/logmein_12329d2d
_______________________________________________
iText-questions mailing list
[email protected]
https://lists.sourceforge.net/lists/listinfo/itext-questions

iText(R) is a registered trademark of 1T3XT BVBA.
Many questions posted to this list can (and will) be answered with a reference 
to the iText book: http://www.itextpdf.com/book/
Please check the keywords list before you ask for examples: 
http://itextpdf.com/themes/keywords.php

Reply via email to