I do not have my own page built just for speedy-ocr at the moment. The Ubuntu 10.0.4 Lucid package is hosted on Launchpad in our Vinux Lucid PPA.
To add the Vinux Lucid repository to your system, type: sudo add-apt-repository ppa:vinux/vinux-lucid Then install speedy-ocr with the following two commands: sudo apt-get update sudo apt-get install speedy-ocr The download page for this package is at: https://launchpad.net/~vinux/+archive/vinux-lucid/+sourcepub/1517877/+listing-archive-extra The source code is also available from git by typing the following at a command line: git clone [email protected]:speedy-ocr.git Don Marang Vinux Software Development Coordinator (vinux.org.uk) There is just so much stuff in the world that, to me, is devoid of any real substance, value, and content that I just try to make sure that I am working on things that matter. Dean Kamen From: zdenko podobny Sent: Tuesday, February 22, 2011 2:27 AM To: [email protected] Subject: Re: VietOCR v2.0/3.1 & VietOCR.NET v2.0 Releases Hello, can you please post a link, where I can find "speedy-ocr bash script" Zd. On Tue, Feb 8, 2011 at 10:06 AM, SpeedyChair <[email protected]> wrote: Another way to prepare a PDF document for tesseract is to use the 'convert' command from the ImageMagick package to split an image only PDF file into a series of GrayScale TIFF images, one for each page. This convert command can work on just about any image. For PDF conversions, it actually makes ghostscript do all of the work. This same syntax also works with multi-page TIFF files and Postscript files. convert mydoc.pdf -type GrayScale -depth 8 -scene 1 mydoc-%03d.tif Then you would need to loop through the TIFF files to perform OCR on each page image. In a day or two, I will update my speedy-ocr bash script, which will now handle PDF image files. Don Marang Vinux Software Coordinator - vinux.org.uk There is just so much stuff in the world that, to me, is devoid of any real substance, value, and content that I just try to make sure that I am working on things that matter. Dean Kamen From: KHEM Sochenda Sent: Monday, February 07, 2011 10:23 PM To: [email protected] Subject: Re: VietOCR v2.0/3.1 & VietOCR.NET v2.0 Releases Dear Quan, I would like to know how to let tesseract OCR work with pdf documents. Thank you very much in advance for you kind response. With Best Regards, Sochenda On Tue, Feb 8, 2011 at 7:56 AM, Quan Nguyen <[email protected]> wrote: A Java/.NET GUI frontend for Tesseract OCR engine. The releases include the following fixes and improvements: * Add support for spellcheck suggestion in context menu * Improve program accessibility and usability * Add support for downloading and installing language data packs and appropriate spell dictionaries * Add UI localization for Lithuanian and Slovak * Update Tesseract OCR engine to 3.01 (r551) (v3.1 only) http://vietocr.sf.net -- You received this message because you are subscribed to the Google Groups "tesseract-ocr" group. To post to this group, send email to [email protected]. To unsubscribe from this group, send email to [email protected]. For more options, visit this group at http://groups.google.com/group/tesseract-ocr?hl=en. -- You received this message because you are subscribed to the Google Groups "tesseract-ocr" group. To post to this group, send email to [email protected]. To unsubscribe from this group, send email to [email protected]. For more options, visit this group at http://groups.google.com/group/tesseract-ocr?hl=en. -- You received this message because you are subscribed to the Google Groups "tesseract-ocr" group. To post to this group, send email to [email protected]. To unsubscribe from this group, send email to [email protected]. For more options, visit this group at http://groups.google.com/group/tesseract-ocr?hl=en. -- You received this message because you are subscribed to the Google Groups "tesseract-ocr" group. To post to this group, send email to [email protected]. To unsubscribe from this group, send email to [email protected]. For more options, visit this group at http://groups.google.com/group/tesseract-ocr?hl=en. -- You received this message because you are subscribed to the Google Groups "tesseract-ocr" group. To post to this group, send email to [email protected]. To unsubscribe from this group, send email to [email protected]. For more options, visit this group at http://groups.google.com/group/tesseract-ocr?hl=en.

