[tesseract-ocr] Train Tesseract 4.0 LSTM based on images

2017-04-11 Thread Ahmad Moawad
this is the part from https://github.com/tesseract-ocr/tesseract/wiki/TrainingTesseract-4.00 My question related to the image part not making training from text The overall training process is similar to training 3.04

[tesseract-ocr] Help in TrainingTesseract 4.00 Finetune

2017-04-11 Thread Ahmad Moawad
Hello All, I want help in trainingTesseract 4.00 Finetune https://github.com/tesseract-ocr/tesseract/wiki/TrainingTesseract-4.00---Finetune I want to know some parameter such as: 1- langdata_dir is that the file in https://github.com/tesseract-ocr/langdata training/tesstrain.sh --fonts_dir

Re: [tesseract-ocr] Re: Tesseract Installation

2017-04-11 Thread ShreeDevi Kumar
Also, if you want training tools, you need to build them separately - see https://github.com/tesseract-ocr/tesseract/wiki/Compiling make training sudo make training-install ShreeDevi भजन - कीर्तन - आरती @ http://bhajans.ramparivar.com

[tesseract-ocr] Re: Tesseract Installation

2017-04-11 Thread shree
On Tuesday, April 11, 2017 at 4:10:26 PM UTC+5:30, Ibr wrote: > > > Note: I'm using windows 10 bash > I use it too, but via mobaxterm, which makes it easier to use see http://mobaxterm.mobatek.net/download-home-edition.html -- You received this message because you are subscribed to the

Re: [tesseract-ocr] Tesseract Installation

2017-04-11 Thread ShreeDevi Kumar
You can ignore it. I get it too when using sudo 2nd time. Host name must be the id for your computer under windows10. Have u tried running tesseract after that? - excuse the brevity, sent from mobile On 11-Apr-2017 4:10 PM, "Ibr" wrote: Hi, I'm trying to install the

[tesseract-ocr] Tesseract Installation

2017-04-11 Thread Ibr
Hi, I'm trying to install the tesseract following the steps from this website ,i ran the command for the step 5 all worked fine except the command *sudo ldconfig *and it returned the error *sudo: unable to

Re: [tesseract-ocr] How to add Armenian language support to tesseract

2017-04-11 Thread ShreeDevi Kumar
I have added this at https://github.com/tesseract-ocr/langdata/issues/67 Please add more information there: Which language code - arm or hye Modern Armenian or Classical Armenian Sources for primary texts in unicode the Armenian language to use for training Freely available unicode fonts to

Re: [tesseract-ocr] Help with blurred OCR but "simple text"

2017-04-11 Thread Javier Abascal
Hi, I have tried to use a otsu threshold and It didn't work very well. I am still not being able to recognize the word Carolline for example. Here is the code I used for it. *Any other ideas people?* :):) from PIL import Image img = Image.open("example_ocr_1.jpg").convert('L') img_array =

[tesseract-ocr] How to add Armenian language support to tesseract

2017-04-11 Thread vner75
Dear all, I am trying tesseart recently and it is really a very good product. I would like to ask if there is any tutorial or steps about how we can add a new language support to the package? for example Armenian language. Thank you in advance. Regards, Vahe -- You received this message