I process docx for extraction only, but it seems that openxml4j itself is as far from the ideal of text extraction tool as POI+openxml is :)
Thanks for your reply, I'll think how it is better to solve my problem! On 4/9/08, Nick Burch <[EMAIL PROTECTED]> wrote: > On Wed, 9 Apr 2008, Yury Batrakov wrote: > > I've just tried to wrap hyperlinks and comments to XWPF classes but > > stumbled on the same problem: in XWPF we should deal with paragraphs, > > records, etc to fetch (for example) hyperlink text, instead of using > > pretty methods such as CTWorksheet.getHyperlinks() . Could you give me > > more convinient-to-read explanation (rather than build.xml) to implement > > such methods via xmlbeans? > > > If you're doing serious processing of .docx files, and not just text > extraction, you might find docx4j a better fit for you. For now, with poi, > we're just concentrating on text extraction for .docx. It seems silly to > put lots of work into a full .docx implementation, when there's already > one in openxml4j, and another in docx4j! > > The xmlbeans built jar is just a compiled version of the ooxml xsd schema > files. You'll need to read the ooxml specifications to figure out how it > all fits together :/ > > > Nick > > --------------------------------------------------------------------- > To unsubscribe, e-mail: [EMAIL PROTECTED] > For additional commands, e-mail: [EMAIL PROTECTED] > > --------------------------------------------------------------------- To unsubscribe, e-mail: [EMAIL PROTECTED] For additional commands, e-mail: [EMAIL PROTECTED]
