Thanks Nick and Helfi. I will go through those examples.
On Thu, Sep 22, 2011 at 2:50 PM, Nick Burch <[email protected]> wrote: > On Thu, 22 Sep 2011, Jebarlin Robertson wrote: > >> I am using Apache POI for reading MS Office 2007 documents, so I need some >> sample code to read the text from all the MS Office documents. >> > > If you just want plain text, then POI already has code for this. You'd want > classes such as org.apache.poi.xssf.extractor.**XSSFExcelExtractor and > org.apache.poi.xslf.extractor.**XSLFPowerPointExtractor > > Apache Tika may also help you here, it has code to extract contents from a > wide variety of files (including Microsoft Office ones via POI), all via a > common interface > > Nick > > > ------------------------------**------------------------------**--------- > To unsubscribe, e-mail: > [email protected].**org<[email protected]> > For additional commands, e-mail: [email protected] > >
