Re: [opensuse] PDF OCR

Kai Ponte Wed, 12 Dec 2007 13:46:42 -0800

On Wednesday 12 December 2007 10:52, Ken Schneider wrote:
> Roger Oberholtzer pecked at the keyboard and wrote:
> > Hello
> >
> > We have a network printer that will scan docs and send them as pdf docs
> > to an e-mail address in the company. Is there any software with OpenSUSE
> > 10.3 that can do OCR from a PDF doc? I am guessing that the doc contains
> > tiff images of the scanned documents. Any and all pointers are welcome.
>
> Have you tried pdftotext ?



I will happily recommend Tesseract.  

http://code.google.com/p/tesseract-ocr/

Here's a how-to on how to do PDF to text, though I've yet to be able to 
convert PDF to TIFF yet...

http://www.groklaw.net/articlebasic.php?story=20061210115516438

And a few more articles...

http://www.linuxjournal.com/article/9676

http://www.howtoforge.com/ocr_with_tesseract_on_ubuntu704

-- 
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, e-mail: [EMAIL PROTECTED]

Re: [opensuse] PDF OCR

Reply via email to