PDFBox can extract the text from the PDF files for you, however unpacking the zip file, locating the PDF documents, saving in a different format and rezipping I believe is something you'll have to handle with other other libraries like commons-compress [1].
Hope that helps. Branden [1] https://commons.apache.org/proper/commons-compress/ On Wed, Apr 20, 2016 at 12:51 PM, David Green <[email protected]> wrote: > . . . and save the text files in the same tree structure on another drive ? > this seems a big ask > > -- > Regards > David --------------------------------------------------------------------- To unsubscribe, e-mail: [email protected] For additional commands, e-mail: [email protected]

