[tesseract-ocr] How can i train tesseract from images directly ?

2017-08-01 Thread Harshit Gupta
I am having images from number plate of vehicles which isn't a standard font. I have cropped images of characters in number plate. I tried the following - 1. Created a grid of all images so tesseract can read them for training and generated tif file for it. 2. Then i generated box

Re: [tesseract-ocr] "Can't encode transcript" error when using "lstmtraining" command with Tess4.0

2017-08-01 Thread robertyoung0511
When I use the new traineddata, it will *report **an **error : cannot find the chi_sim.traineddata. Does the new traineddata only support the Tess4.0 alpa release? I use the newest code release.* 在 2017年8月1日星期二 UTC+8下午4:45:07,shree写道: > > Ray has uploaded new traineddata files in >

Re: [tesseract-ocr] "Can't encode transcript" error when using "lstmtraining" command with Tess4.0

2017-08-01 Thread robertyoung0511
OK,I will have a try. Thanks 在 2017年8月1日星期二 UTC+8下午4:45:07,shree写道: > > Ray has uploaded new traineddata files in > https://github.com/tesseract-ocr/tessdata/tree/master/best > > Why don't you first try recognition with that > > ShreeDevi >

Re: [tesseract-ocr] "Can't encode transcript" error when using "lstmtraining" command with Tess4.0

2017-08-01 Thread ShreeDevi Kumar
Ray has uploaded new traineddata files in https://github.com/tesseract-ocr/tessdata/tree/master/best Why don't you first try recognition with that ShreeDevi भजन - कीर्तन - आरती @ http://bhajans.ramparivar.com On Tue, Aug 1, 2017 at

Re: [tesseract-ocr] "Can't encode transcript" error when using "lstmtraining" command with Tess4.0

2017-08-01 Thread robertyoung0511
Hello, Shree: I'm sorry, but whether can I use more than one unicharset, such as chi_sim and eng and so on, to finetune the training? Maybe some special characters can be in other unicharsets. If I find it/them, maybe I will train my traineddata with more unicharsets, and the special