Oh the training started by itself after a long while and still processing. 
Does it normally take that long to train 6 images?



<https://lh3.googleusercontent.com/-S-zqe4mmBWA/W2lFl6LakEI/AAAAAAAAAOY/1g2tCBu6-cUDZjSj8-DsyvMhl3ypueJggCLcBGAs/s1600/Capture.PNG>


On Monday, August 6, 2018 at 11:42:40 PM UTC-7, May wrote:
>
> Thanks a lot Shree. I tried the tesseract 4.0 and the training is working 
> well until it reaches the lstm-training step and got stuck there. I am 
> totally new in the training so hope you don't mind if I am asking silly 
> questions. Do you know why I got stuck? Also, would you call this training 
> fine-tuning? As I just want to improve the accuracy of existing 
> eng.langdata. 
>
>
> <https://lh3.googleusercontent.com/-dWRkYql4AKA/W2k9PoNsndI/AAAAAAAAAOM/zWVkkPvUCT44moZPpvt6xgYFnQ0StwxUQCLcBGAs/s1600/Capture.PNG>
>
>
>
> On Monday, August 6, 2018 at 10:26:12 PM UTC-7, shree wrote:
>>
>> Ocr-d scripts are geared towards tesseract 4.0.x. you are trying to use 
>> it with tesseract 3.05.
>>
>> On Tue 7 Aug, 2018, 10:50 AM May, <[email protected]> wrote:
>>
>>> Hey Shree
>>>
>>> I also tried with the orignal script from the github. But faced the same 
>>> issue with the process stuck at unicharset_output.
>>>
>>>
>>> <https://lh3.googleusercontent.com/-rFB69WQGLIg/W2krzHUjFfI/AAAAAAAAAOA/SZ4CEzUIEGMIhQUWXHfHMS9H4Yxk-ADGwCLcBGAs/s1600/Capture.PNG>
>>>
>>>
>>> These are the versions:
>>> tesseract 3.05.02
>>>  leptonica-1.75.3
>>>   libgif 5.1.4 : libjpeg 8d (libjpeg-turbo 1.5.3) : libpng 1.6.34 : 
>>> libtiff 4.0.9 : zlib 1.2.11 : libwebp 0.6.1 : libopenjp2 2.2.0
>>>
>>>
>>> On Thursday, August 2, 2018 at 8:52:38 PM UTC-7, shree wrote:
>>>>
>>>> Please use latest scripts from https://github.com/OCR-D/ocrd-train
>>>>
>>>> On Fri, Aug 3, 2018 at 4:41 AM May <[email protected]> wrote:
>>>>
>>>>>
>>>>> <https://lh3.googleusercontent.com/-LnwUni4-lLw/W2OPUqJpn_I/AAAAAAAAANs/Xd_-CVCdiMk0cjMmxBpVgfOSU1JeAacAgCLcBGAs/s1600/Capture.PNG>
>>>>>
>>>>>
>>>>>
>>>>> <https://lh3.googleusercontent.com/-j3_B1CmVv9w/W2OPbuUYH3I/AAAAAAAAANw/xmBXrNakKuMHm2L9cj-K3sCXCjFxuF80QCLcBGAs/s1600/Capture.PNG>
>>>>>
>>>>>
>>>>>
>>>>> Here are attached photos
>>>>>
>>>>>
>>>>> On Thursday, August 2, 2018 at 4:08:11 PM UTC-7, May wrote:
>>>>>>
>>>>>> Hey all,
>>>>>>
>>>>>> I am following Shree's script for OCR-d in the google groups for 
>>>>>> ocrd-training (
>>>>>> https://groups.google.com/forum/#!topic/tesseract-ocr/be4-rjvY2tQ). 
>>>>>> I managed to pass the combine tessdata stage but got stuck at the 
>>>>>> unicharset stage:
>>>>>>  
>>>>>>
>>>>>>
>>>>>> I have edited the script to direct it to my path:
>>>>>>
>>>>>> I do find a unicharset file named "unicharset" but not as 
>>>>>> "my.unicharset". Changing the script by removing "my." also did not 
>>>>>> solve 
>>>>>> the problem. Do you know what's causing the issue?
>>>>>>
>>>>>> Best
>>>>>> May
>>>>>>
>>>>> -- 
>>>>> You received this message because you are subscribed to the Google 
>>>>> Groups "tesseract-ocr" group.
>>>>> To unsubscribe from this group and stop receiving emails from it, send 
>>>>> an email to [email protected].
>>>>> To post to this group, send email to [email protected].
>>>>> Visit this group at https://groups.google.com/group/tesseract-ocr.
>>>>> To view this discussion on the web visit 
>>>>> https://groups.google.com/d/msgid/tesseract-ocr/48347dd8-7b7e-4d0d-9cb5-b21e3ec23f31%40googlegroups.com
>>>>>  
>>>>> <https://groups.google.com/d/msgid/tesseract-ocr/48347dd8-7b7e-4d0d-9cb5-b21e3ec23f31%40googlegroups.com?utm_medium=email&utm_source=footer>
>>>>> .
>>>>> For more options, visit https://groups.google.com/d/optout.
>>>>>
>>>>
>>>>
>>>> -- 
>>>>
>>>> ____________________________________________________________
>>>> भजन - कीर्तन - आरती @ http://bhajans.ramparivar.com
>>>>
>>> -- 
>>> You received this message because you are subscribed to the Google 
>>> Groups "tesseract-ocr" group.
>>> To unsubscribe from this group and stop receiving emails from it, send 
>>> an email to [email protected].
>>> To post to this group, send email to [email protected].
>>> Visit this group at https://groups.google.com/group/tesseract-ocr.
>>> To view this discussion on the web visit 
>>> https://groups.google.com/d/msgid/tesseract-ocr/af43b995-7e24-4dca-827c-080755211544%40googlegroups.com
>>>  
>>> <https://groups.google.com/d/msgid/tesseract-ocr/af43b995-7e24-4dca-827c-080755211544%40googlegroups.com?utm_medium=email&utm_source=footer>
>>> .
>>> For more options, visit https://groups.google.com/d/optout.
>>>
>>

-- 
You received this message because you are subscribed to the Google Groups 
"tesseract-ocr" group.
To unsubscribe from this group and stop receiving emails from it, send an email 
to [email protected].
To post to this group, send email to [email protected].
Visit this group at https://groups.google.com/group/tesseract-ocr.
To view this discussion on the web visit 
https://groups.google.com/d/msgid/tesseract-ocr/5a1e3259-e0e4-45aa-8eb5-db28f0eba535%40googlegroups.com.
For more options, visit https://groups.google.com/d/optout.

Reply via email to