Re: [Tesseract 3] English training text

2011-02-22 Thread Dmitry Silaev
Interesting. I was wondering about Cube since its traces began to appear in the source code but had no enough time to investigate it thorougly Zdenko, would you please kindly share your other findings on Cube? Regards, Dmitry On Tue, Feb 22, 2011 at 11:13 AM, zdenko podobny zde...@gmail.com

Re: problem in single word recognition

2011-02-22 Thread Dmitry Silaev
I might not understood you fully, but this is an obvious excerpt from baseapi.h: Each SetRectangle clears the recogntion results so multiple rectangles can be recognized with the same image Indeed, SetRectangle() calls ClearResults() which deletes the pageres and clears the block list ready for

Re: [Tesseract 3] English training text

2011-02-22 Thread zdenko podobny
Dmitry, unfortunately I have not enough of time for tests :-(. I still hope Ray will release more info before final 3.01. At the moment I focus on box editor. BR, Zdenko On Tue, Feb 22, 2011 at 9:27 AM, Dmitry Silaev daemons2...@gmail.comwrote: Interesting. I was wondering about Cube since

Re: VietOCR v2.0/3.1 VietOCR.NET v2.0 Releases

2011-02-22 Thread SpeedyChair
I do not have my own page built just for speedy-ocr at the moment. The Ubuntu 10.0.4 Lucid package is hosted on Launchpad in our Vinux Lucid PPA. To add the Vinux Lucid repository to your system, type: sudo add-apt-repository ppa:vinux/vinux-lucid Then install speedy-ocr with the

Re: Image pre-processing for good OCR results

2011-02-22 Thread Tom Morris
On Feb 20, 9:02 pm, Jon Andersen jande...@gmail.com wrote: My project athttp://RecordAGrave.comis about recording headstones from graves and posting the text and images on the Net so that people can research their family history.  I would appreciate some advice on how to pre-process these

problem in the mftraining part of the tesseract training

2011-02-22 Thread Open sourced nick
I've successfully created a box file with tesseract br now after running the unicharset_extractor br having it creating the unicharset file that looks like: ... n 3 NULL -1 s 3 NULL 23 t 3 NULL 43 ... I've continued with this command mftraining -U unicharset -O

Re: Image pre-processing for good OCR results

2011-02-22 Thread Jon Andersen
Vicky, I may be able to convert your local-minima code to OpenCV code; can you send me the result files as well as the filter? I wrote some Python code that uses OpenCV to crop the headstone images to show just the stone. Its not perfect, but it works OK. The Hough algorithm and the other

RE: Image pre-processing for good OCR results

2011-02-22 Thread Cong Nguyen
Dear Jon, Beginning for analyzing; I try also to detect lines, corners; but results are not good. I think due to images are low contrast. Please try to analyze with some data line profiles: ROI-left-profile: https://picasaweb.google.com/congnguyenba/TesseractBasedOCR#5576706091073985