Hello,
I am using iText to parse text from PDF files. I found that some text are
returned but are not visible.
For example, I got
* “financial derivative” , which does exist on the page
* “FINANCIAL DERIVATIVE” which is visible, but still returned by iText.
In addition, they are not selectable by Adobe acrobat or Foxit.
Does iText has a way to differentiate visible/invisible text? Or does the PDF
format has any specification relating to this?
Feng
------------------------------------------------------------------------------
_______________________________________________
iText-questions mailing list
[email protected]
https://lists.sourceforge.net/lists/listinfo/itext-questions
iText(R) is a registered trademark of 1T3XT BVBA.
Many questions posted to this list can (and will) be answered with a reference
to the iText book: http://www.itextpdf.com/book/
Please check the keywords list before you ask for examples:
http://itextpdf.com/themes/keywords.php