2009/9/4 Jed Rothwell <[email protected]>: >> Also, did you know you can batch-ocr any number of pdfs at a time? > > With what program?
Acrobat Pro (Advanced > Document Processing > Batch Processing > New sequence...) You can do all sorts of batch processing with this, not just OCR, to all pdfs in the source folder you specify, with the processed files going to the destination folder you specify. BTW, apart from those you described, another simple way to know if a pdf is image only is to do a "select all" in Acrobat Pro, if it finds no text to select it tells you so and asks you if you want to perform an OCR. If/when it does have underlying text, it shows it as blocks. Whether selected or not, right-clicking the image of a word shows you the underlying text for that word at the bottom of the contextual menu: "look up xxxx" Again, thanks for your great library work, I hope you'll get the institutional support you requested. Michel

