Kris,

have a look at the examples[1] and especially at the ExtractText.java example[2].

Cheers,
Erik

[1] http://svn.apache.org/viewvc/pdfbox/trunk/pdfbox/src/main/java/org/apache/pdfbox/examples/

[2] http://svn.apache.org/viewvc/pdfbox/trunk/pdfbox/src/main/java/org/apache/pdfbox/ExtractText.java?view=log

On 21.07.10 16:20, Kris Jack wrote:
Hi everyone,

I have just discovered the pdfbox today and have started to play around with
it.  I can figure out how to extract text from a pdf and was wondering if it
is possible to extract text and font information from a pdf, producing
output equivalent to:

<bold>This is a sentence in bold</bold>.<italics>This is a sentence in
italics</italics>.  ?

Many thanks in advance,
Kris

Reply via email to