[tesseract-ocr] How to recognize some specific symbols with Tess4.0

2017-07-31 Thread robertyoung0511
Hello, I'm trying to apply Tess4.0 to recongnize the simplified Chinese with the command as: argc = 13; argv[1] =

Re: [tesseract-ocr] Building tesseract 4.0.0 from master on OS X

2017-07-31 Thread Kevin Schiesser
I used brew to install the dependencies and then ran the following: $ ./autogen.sh $ make $ sudo make install $ make training The last command exits with the following: ld: library not found for -lgobject-2.0 collect2: error: ld returned 1 exit status make[1]: *** [text2image] Error 1 make: ***

Re: [tesseract-ocr] Building tesseract 4.0.0 from master on OS X

2017-07-31 Thread 'Stefan Weil' via tesseract-ocr
Kevin, how did you run the failing builds on macOS? I just tested building with `brew install tesseract --HEAD --with-training-tools` and had no problems. An automake based builds also works with MacPorts. No modifications were needed for Tesseract git master. -- You received this message

[tesseract-ocr] tesseract-ocr-ell, tesseract-ocr-grc: improvements

2017-07-31 Thread dimitrDimitr
At http://www.elspell.gr/myspell there is OpenOffice Greek Dictionary v0.9 with 800.000 greek words encoded with windows-1253, under MPL 1.1/GPL 2.0/LGPL 2.1 License. Polytonic characters aren't used after 1982 and we don't have

[tesseract-ocr] Re: Tesseract AdaptToWordStr usage?

2017-07-31 Thread Syed Uzair
Sorry made a mistake in attached file names. When i uncomment lines 18,19 my console reads like output3.png (attachment). output2.png is the debug file. Thanks On Monday, July 31, 2017 at 5:09:37 PM UTC+5:30, Syed Uzair wrote: > > Hello all > > I am trying to extract text from the attached

Re: [tesseract-ocr] ERROR: Could not find training text file

2017-07-31 Thread ShreeDevi Kumar
add a line similar to following to your training command, pointing to where you have your training text --training_text ../langdata/eng/eng.training_text \ ShreeDevi भजन - कीर्तन - आरती @ http://bhajans.ramparivar.com On Mon, Jul

[tesseract-ocr] Tesseract AdaptToWordStr usage?

2017-07-31 Thread Syed Uzair
Hello all I am trying to extract text from the attached image (010003.bin.png) using tesserocr (python wrapper for Tesseract 3.04 API). When i used the script TestAdapttoWord.py (attachment) with the lines 18,19 commented my console reads like output1.png (attachment) and when i uncomment

[tesseract-ocr] ERROR: Could not find training text file

2017-07-31 Thread Ava Nimaee
Hi . sorry I used this syntax: training/tesstrain.sh --fonts_dir /usr/share/fonts --lang eng --linedata_only \ --noextract_font_properties --langdata_dir langdata \ --tessdata_dir tessdata \ --fontlist "Times New Roman," --output_dir engtrain Befor that i create boxfile and tif and