On Sat, Mar 31, 2012 at 12:09 PM, Jeff Johnson <[email protected]> wrote:
> I need to convert 300 identical PDF's (different information in the same
> fields) to text.  What is the best way to go about this?  Is there a
> product on the market that does this efficiently?  IOW takes all pdfs in
> a folder and converts them to text with a .txt extension in on process?

If you have a Linux/Unix/OSX box handy, or CygWin installed on your
Windows box, the commandline pdftotext will do the conversion.
However, as the manual page notes, "Some  PDF  files  contain  fonts
whose  encodings  have been mangled beyond recognition.  There is no
way (short of OCR) to extract text from these files."

-- 
Ted Roche
Ted Roche & Associates, LLC
http://www.tedroche.com

_______________________________________________
Post Messages to: [email protected]
Subscription Maintenance: http://leafe.com/mailman/listinfo/profox
OT-free version of this list: http://leafe.com/mailman/listinfo/profoxtech
Searchable Archive: http://leafe.com/archives/search/profox
This message: 
http://leafe.com/archives/byMID/profox/cacw6n4skjbp7c3xybm6gb20e+vdecxu8mabghjdk3oypdrr...@mail.gmail.com
** All postings, unless explicitly stated otherwise, are the opinions of the 
author, and do not constitute legal or medical advice. This statement is added 
to the messages for those lawyers who are too stupid to see the obvious.

Reply via email to