I ended up finding that it was better to use pdfbox for this piece of
functionality...though it wasn't for words (pdfs aren't structured that
way)...so I ended up getting
positions of individual characters and compared to a known piece of
plain text for a page to figure out what word/where it was (combined
with relative position).
pdfbox got me the char and char position though...though thats java :(
On 1/8/13 9:53 PM, Debasis Mandal wrote:
Hello,
I am working on extracting text from pdf and want to get exact
position of all words (in the form of co-ordinates) from pdf by using
itextsharp dll. I am using .Net Framework. But I am facing some
problem - when i am extracting words from pdf, I can not get the right
words. It's split multiple part of a word. For example, If
word="PAGE", first time its render word="PAG" then next render
word="E". Also facing same problem for finding co-ordinate of a word.
Can you help me on how to extract word with position(co-ordinates)
from pdf in .Net Framework.
Thanks,
Debasis Mandal
------------------------------------------------------------------------------
Master Java SE, Java EE, Eclipse, Spring, Hibernate, JavaScript, jQuery
and much more. Keep your Java skills current with LearnJavaNow -
200+ hours of step-by-step video tutorials by Java experts.
SALE $49.99 this month only -- learn more at:
http://p.sf.net/sfu/learnmore_122612
_______________________________________________
iText-questions mailing list
iText-questions@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/itext-questions
iText(R) is a registered trademark of 1T3XT BVBA.
Many questions posted to this list can (and will) be answered with a reference
to the iText book: http://www.itextpdf.com/book/
Please check the keywords list before you ask for examples:
http://itextpdf.com/themes/keywords.php
------------------------------------------------------------------------------
Master Java SE, Java EE, Eclipse, Spring, Hibernate, JavaScript, jQuery
and much more. Keep your Java skills current with LearnJavaNow -
200+ hours of step-by-step video tutorials by Java experts.
SALE $49.99 this month only -- learn more at:
http://p.sf.net/sfu/learnmore_122612
_______________________________________________
iText-questions mailing list
iText-questions@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/itext-questions
iText(R) is a registered trademark of 1T3XT BVBA.
Many questions posted to this list can (and will) be answered with a reference
to the iText book: http://www.itextpdf.com/book/
Please check the keywords list before you ask for examples:
http://itextpdf.com/themes/keywords.php