2011/9/26 Ram Kane <[email protected]>

> I've tried that. The problem is that it works on a document level
>
> I need to be able to extract content for a given page.
>

Does it make sense to extract content by paragraph?


>
> Thx a lot for the code though.
>
>
> On Mon, Sep 26, 2011 at 2:46 AM, Devin Han <[email protected]> wrote:
> > Hi Ram,
> >
> > I suppose you only want to extract the text(header, footer, comments ,
> end
> > note, etc) and don't care page break.
> > Please see the sample code.
> >
> >       TextDocument
> > textdoc=(TextDocument)TextDocument.loadDocument("textExtractor.odt");
> >       EditableTextExtractor extractorD =
> > EditableTextExtractor.newOdfEditableTextExtractor(textdoc);
> >       String output = extractorD.getText();
> >       System.out.println(output);
> >
> > This code fragment will return all of the context except header and
> > footer.For content in footer and header, please reference.
> >            Header header = textdoc.getHeader();
> >            output =TextExtractor.getText(header.getOdfElement());
> >            System.out.println(output);
> >
> >            Footer footer = textdoc.getFooter();
> >            output =TextExtractor.getText(footer.getOdfElement());
> >            System.out.println(output);
> >
>



-- 
-Devin

Reply via email to