Might want to look at Tika (which uses PDFBox) for that.

Let's say you have an <inputdir> that contains your zips.

java -jar tika-app.jar -J -t -i <inputdir> -o <outputdir>

See if that gets you close enough.

-----Original Message-----
From: [email protected] [mailto:[email protected]] On Behalf 
Of David Green
Sent: Wednesday, April 20, 2016 3:51 PM
To: [email protected]
Subject: is it possible to batch extract text from pdf files within a tree of 
folders within a zip file ?

. . . and save the text files in the same tree structure on another drive ?
this seems a big ask

-- 
Regards
David

Reply via email to