Hi,

On Tue, Aug 4, 2009 at 9:30 AM, Michael
Wechner<michael.wech...@wyona.com> wrote:
> String XHTMLBean.getHead().getMeta(XHTMLBean.DESCRIPTION)
> String XHTMLBean.getHead().getTitle()

These you can get from the Metadata object.

> String[] XHTMLBean.getBody().getParagraphs();

This is a bit troublesome as not all parsers produce paragraphs of
content. For example the Excel parser produces XHTML tables.

You can either get just the plain character stream using tools like
BodyContentHandler, or the full XHTML output as SAX events (which you
can serialize to a byte stream if you want). I'm not sure if there's
any reasonable intermediate content abstraction.

BR,

Jukka Zitting

Reply via email to