Although not pure Java you can use OOffice's UNO-Bridge to extract almost everything from a word, excel, ppt, ... .
Take a look at: http://api.openoffice.org Tom Andreas Höhmann wrote: > Hi folks, > > on http://jakarta.apache.org/poi/hwpf/projectplan.html there is a > milestone for year 2003 (add support for tables and lists). Is it > possible right now to "extract" doc-lists/tables? Extraction of > Paragraphs is working i think - i try some code. > > I would like to *push* the doc-part of poi ... how can i help? > Is there any "poi-doc-project-leader" how can answer/discuss some > questions about poi-development. I need more informations about the > "maintarget" of poi (specially the doc-part of poi). In other words ... > where we go with poi :) > > I have done a little investigation to find some *free* libraries with > the capability to extract text, list, tables and images from doc and > ppt-files ... but there is no (other) pure-java-solution available. > Has anyone a idea how can i do that??? Please! It's very urgent :) > > Greetings from Leipzig (Germany) > Andreas > > --------------------------------------------------------------------- > To unsubscribe, e-mail: [EMAIL PROTECTED] > Mailing List: http://jakarta.apache.org/site/mail2.html#poi > The Apache Jakarta Poi Project: http://jakarta.apache.org/poi/ > >
signature.asc
Description: OpenPGP digital signature
