On 20/12/2012 13:07, [email protected] wrote:
Hello,
I have a question about pdf-text extraction and layers. I have
Pdf-files that have multiple layers. Only one of these layers is
containing text that I am interested in. Is there a way to extract
text from this particular layer only? I am familiar with programming,
so I just wish to know if API has support for this? So far, I have
been able to find only examples to add new layers for Pdf -files.
When you download iText from SourceForge, there's a zip with three jars.
The code you need is in the xtra jar: com.itextpdf.text.pdf.ocg.OCGRemover
With this class, you can remove layers from a PDF, keeping only the one
you're interested in.
Or, by using OCGParser correctly, you could filter our content of one
specific layer.
------------------------------------------------------------------------------
LogMeIn Rescue: Anywhere, Anytime Remote support for IT. Free Trial
Remotely access PCs and mobile devices and provide instant support
Improve your efficiency, and focus on delivering more value-add services
Discover what IT Professionals Know. Rescue delivers
http://p.sf.net/sfu/logmein_12329d2d
_______________________________________________
iText-questions mailing list
[email protected]
https://lists.sourceforge.net/lists/listinfo/itext-questions
iText(R) is a registered trademark of 1T3XT BVBA.
Many questions posted to this list can (and will) be answered with a reference
to the iText book: http://www.itextpdf.com/book/
Please check the keywords list before you ask for examples:
http://itextpdf.com/themes/keywords.php