If anyone is following this thread and are using OCR-D, I had to modify the
.py file because I kept getting a Unicode error, just add these lines to
the file:
import sys
reload(sys)
sys.setdefaultencoding('utf-8')
On Tuesday, July 24, 2018 at 4:41:45 PM UTC-5, Emiliano Isaza Villamizar
wrote:
I anyone is following this thread and are using OCR-D, I had to change the
start of the .py file by adding these lines because I kept getting and
unicode error:
*import sys*
*reload(sys)*
*sys.setdefaultencoding('utf-8')*
On Tuesday, July 24, 2018 at 4:41:45 PM UTC-5, Emiliano Isaza Villamizar
It worked maybe I was using another *eng.traineddata. *Thank you for your
time Shree and Lorenzo
kind regards,
Emiliano
On Tuesday, July 24, 2018 at 11:40:34 AM UTC-5, shree wrote:
>
> * --continue_from
>>>
>>> /home/tulipan1637/Documents/Emiliano/OCR/OCRtraining/ocrd-train/tessdata/eng.l
>
> * --continue_from
>>
>> /home/tulipan1637/Documents/Emiliano/OCR/OCRtraining/ocrd-train/tessdata/eng.lstm
>>
>> \*
>> * --old_traineddata
>> /home/tulipan1637/Documents/Emiliano/OCR/OCRtraining/ocrd-train/tessdata/eng.traineddata
>>
>> \*
>>
>
Use eng.traineddata from tessdata_best
I'm using OCR-D I compiled it again changing the .traineddata in the
original file but it hasn't worked. I still get the same error.
Iteration 0: ALIGNED TRUTH : Zhejiang Huamei Holding Co Ltd
Iteration 0: BEST OCR TEXT : ₩Z₩h₩e₩j₩i₩a₩n₩ ₩₩u₩a₩m₩e ₩₩o₩₩d₩i₩n₩ ₩C₩o
₩L₩₩d
File
/home/tulipan1637/D
I'm using OCR-D that uses 4.0.0-beta.1
On Tuesday, July 24, 2018 at 12:05:22 AM UTC-5, shree wrote:
>
> Which version of tesseract are you using?
>
> Please post output of
>
> tesseract -v
>
> On Tue 24 Jul, 2018, 2:26 AM Emiliano Isaza Villamizar, > wrote:
>
>> Hello everyone,
>>
>>
>> 'm trying
I had this error when I was mixing best models with non best models.
I would try to run again
combine_tessdata -e base_model/eng.traineddata base_model/eng.lstm
to generate the eng.lstm from the "_best" model (the ones from
/usr/share/tessdata are not the "_best" models).
Then if the error is s
Which version of tesseract are you using?
Please post output of
tesseract -v
On Tue 24 Jul, 2018, 2:26 AM Emiliano Isaza Villamizar,
wrote:
> Hello everyone,
>
>
> 'm trying to train tesseract to improve the detection of some prices such
> as: CN¥2,400.48. I got got to a point that I keep gett
Hello everyone,
'm trying to train tesseract to improve the detection of some prices such
as: CN¥2,400.48. I got got to a point that I keep getting this error:
*total=`cat data/all-lstmf | wc -l` \*
* no=`echo "$total * 0.90 / 1" | bc`; \*
* head -n "$no" data/all-lstmf > "data/list.train"*
9 matches
Mail list logo