Might want to look at Tika (which uses PDFBox) for that. Let's say you have an <inputdir> that contains your zips.
java -jar tika-app.jar -J -t -i <inputdir> -o <outputdir> See if that gets you close enough. -----Original Message----- From: [email protected] [mailto:[email protected]] On Behalf Of David Green Sent: Wednesday, April 20, 2016 3:51 PM To: [email protected] Subject: is it possible to batch extract text from pdf files within a tree of folders within a zip file ? . . . and save the text files in the same tree structure on another drive ? this seems a big ask -- Regards David

