I process docx for extraction only, but it seems that openxml4j itself
is as far from the ideal of text extraction tool as POI+openxml is :)

Thanks for your reply, I'll think how it is better to  solve my problem!

On 4/9/08, Nick Burch <[EMAIL PROTECTED]> wrote:
> On Wed, 9 Apr 2008, Yury Batrakov wrote:
>  > I've just tried to wrap hyperlinks and comments to XWPF classes but
>  > stumbled on the same problem: in XWPF we should deal with paragraphs,
>  > records, etc to fetch (for example) hyperlink text, instead of using
>  > pretty methods such as CTWorksheet.getHyperlinks() . Could you give me
>  > more convinient-to-read explanation (rather than build.xml) to implement
>  > such methods via xmlbeans?
>
>
> If you're doing serious processing of .docx files, and not just text
>  extraction, you might find docx4j a better fit for you. For now, with poi,
>  we're just concentrating on text extraction for .docx. It seems silly to
>  put lots of work into a full .docx implementation, when there's already
>  one in openxml4j, and another in docx4j!
>
>  The xmlbeans built jar is just a compiled version of the ooxml xsd schema
>  files. You'll need to read the ooxml specifications to figure out how it
>  all fits together :/
>
>
>  Nick
>
>  ---------------------------------------------------------------------
>  To unsubscribe, e-mail: [EMAIL PROTECTED]
>  For additional commands, e-mail: [EMAIL PROTECTED]
>
>

---------------------------------------------------------------------
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, e-mail: [EMAIL PROTECTED]

Reply via email to