Re: [tesseract-ocr] Assert failed:in file weightmatrix.cpp, line 244

2018-07-24 Thread Emiliano Isaza Villamizar
If anyone is following this thread and are using OCR-D, I had to modify the .py file because I kept getting a Unicode error, just add these lines to the file: import sys reload(sys) sys.setdefaultencoding('utf-8') On Tuesday, July 24, 2018 at 4:41:45 PM UTC-5, Emiliano Isaza Villamizar wrote:

Re: [tesseract-ocr] Assert failed:in file weightmatrix.cpp, line 244

2018-07-24 Thread Emiliano Isaza Villamizar
I anyone is following this thread and are using OCR-D, I had to change the start of the .py file by adding these lines because I kept getting and unicode error: *import sys* *reload(sys)* *sys.setdefaultencoding('utf-8')* On Tuesday, July 24, 2018 at 4:41:45 PM UTC-5, Emiliano Isaza Villamizar

Re: [tesseract-ocr] Assert failed:in file weightmatrix.cpp, line 244

2018-07-24 Thread Emiliano Isaza Villamizar
It worked maybe I was using another *eng.traineddata. *Thank you for your time Shree and Lorenzo kind regards, Emiliano On Tuesday, July 24, 2018 at 11:40:34 AM UTC-5, shree wrote: > > * --continue_from >>> >>> /home/tulipan1637/Documents/Emiliano/OCR/OCRtraining/ocrd-train/tessdata/eng.l

Re: [tesseract-ocr] Assert failed:in file weightmatrix.cpp, line 244

2018-07-24 Thread shree
> > * --continue_from >> >> /home/tulipan1637/Documents/Emiliano/OCR/OCRtraining/ocrd-train/tessdata/eng.lstm >> >> \* >> * --old_traineddata >> /home/tulipan1637/Documents/Emiliano/OCR/OCRtraining/ocrd-train/tessdata/eng.traineddata >> >> \* >> > Use eng.traineddata from tessdata_best

Re: [tesseract-ocr] Assert failed:in file weightmatrix.cpp, line 244

2018-07-24 Thread Emiliano Isaza Villamizar
I'm using OCR-D I compiled it again changing the .traineddata in the original file but it hasn't worked. I still get the same error. Iteration 0: ALIGNED TRUTH : Zhejiang Huamei Holding Co Ltd Iteration 0: BEST OCR TEXT : ₩Z₩h₩e₩j₩i₩a₩n₩ ₩₩u₩a₩m₩e ₩₩o₩₩d₩i₩n₩ ₩C₩o ₩L₩₩d File /home/tulipan1637/D

Re: [tesseract-ocr] Assert failed:in file weightmatrix.cpp, line 244

2018-07-24 Thread Emiliano Isaza Villamizar
I'm using OCR-D that uses 4.0.0-beta.1 On Tuesday, July 24, 2018 at 12:05:22 AM UTC-5, shree wrote: > > Which version of tesseract are you using? > > Please post output of > > tesseract -v > > On Tue 24 Jul, 2018, 2:26 AM Emiliano Isaza Villamizar, > wrote: > >> Hello everyone, >> >> >> 'm trying

Re: [tesseract-ocr] Assert failed:in file weightmatrix.cpp, line 244

2018-07-24 Thread Lorenzo Bolzani
I had this error when I was mixing best models with non best models. I would try to run again combine_tessdata -e base_model/eng.traineddata base_model/eng.lstm to generate the eng.lstm from the "_best" model (the ones from /usr/share/tessdata are not the "_best" models). Then if the error is s

Re: [tesseract-ocr] Assert failed:in file weightmatrix.cpp, line 244

2018-07-23 Thread Shree Devi Kumar
Which version of tesseract are you using? Please post output of tesseract -v On Tue 24 Jul, 2018, 2:26 AM Emiliano Isaza Villamizar, wrote: > Hello everyone, > > > 'm trying to train tesseract to improve the detection of some prices such > as: CN¥2,400.48. I got got to a point that I keep gett

[tesseract-ocr] Assert failed:in file weightmatrix.cpp, line 244

2018-07-23 Thread Emiliano Isaza Villamizar
Hello everyone, 'm trying to train tesseract to improve the detection of some prices such as: CN¥2,400.48. I got got to a point that I keep getting this error: *total=`cat data/all-lstmf | wc -l` \* * no=`echo "$total * 0.90 / 1" | bc`; \* * head -n "$no" data/all-lstmf > "data/list.train"*