Kris,
have a look at the examples[1] and especially at the ExtractText.java
example[2].
Cheers,
Erik
[1]
http://svn.apache.org/viewvc/pdfbox/trunk/pdfbox/src/main/java/org/apache/pdfbox/examples/
[2]
http://svn.apache.org/viewvc/pdfbox/trunk/pdfbox/src/main/java/org/apache/pdfbox/ExtractText.java?view=log
On 21.07.10 16:20, Kris Jack wrote:
Hi everyone,
I have just discovered the pdfbox today and have started to play around with
it. I can figure out how to extract text from a pdf and was wondering if it
is possible to extract text and font information from a pdf, producing
output equivalent to:
<bold>This is a sentence in bold</bold>.<italics>This is a sentence in
italics</italics>. ?
Many thanks in advance,
Kris