Ray has uploaded new traineddata files in
https://github.com/tesseract-ocr/tessdata/tree/master/best

Why don't you first try recognition with that

ShreeDevi
____________________________________________________________
भजन - कीर्तन - आरती @ http://bhajans.ramparivar.com

On Tue, Aug 1, 2017 at 1:45 PM, <robertyoung0...@gmail.com> wrote:

> Hello, Shree:
>
> I'm sorry, but whether can I use more than one unicharset, such as chi_sim
> and eng and so on, to finetune the training?
> Maybe some special characters can be in other unicharsets. If I find
> it/them, maybe I will train my traineddata with more unicharsets, and the
> special characters will be encoded at that time.
>
> Thanks, and hope for your reply.
>
> 在 2017年7月25日星期二 UTC+8下午3:23:08,shree写道:
>>
>> That error is because some characters in your training text are not part
>> of the unicharset of chi_sim.
>>
>> You are trying finetune training which will give error. Replace top layer
>> will work.
>>
>> I suggest that you wait 2-3 weeks for Ray to upload new traineddata for
>> all languages.
>>
>> You can tell us if there are any specific characters missing from
>> existing traineddata .
>>
>> ShreeDevi
>> ____________________________________________________________
>> भजन - कीर्तन - आरती @ http://bhajans.ramparivar.com
>>
>> On Tue, Jul 25, 2017 at 12:46 PM, <roberty...@gmail.com> wrote:
>>
>>> Hello,
>>>
>>> I apply the command to train my own traineddata:
>>>
>>> lstmtraining --model_output ~/tesstutorial/chituned_from_chisim/chituned \
>>>   --continue_from ~/tesstutorial/chituned_from_chisim/chi_sim.lstm \
>>>   --train_listfile ~/tesstutorial/chitest/chi.training_files.txt \
>>>   --eval_listfile ~/tesstutorial/chitest/chi.training_files.txt \
>>>   --target_error_rate 0.01
>>>
>>> An error appears by Tess4.0 that shown in the following img. The system 
>>> (Tess4.0) says "Can't encode transcript" for text content such as 
>>> "化简(-x2)3的结果是...".
>>> Why? Can you help me?
>>>
>>>
>>> <https://lh3.googleusercontent.com/-f5tjdv3_nvk/WXbvefZQYrI/AAAAAAAAAAM/COSWa-ewxy46XNkFxUCUl5V2r4K2ZfiQACLcBGAs/s1600/_%2524_WUP8_FXB%2560DR9_I5A8Y%2560L.png>
>>>
>>> --
>>> You received this message because you are subscribed to the Google
>>> Groups "tesseract-ocr" group.
>>> To unsubscribe from this group and stop receiving emails from it, send
>>> an email to tesseract-oc...@googlegroups.com.
>>> To post to this group, send email to tesser...@googlegroups.com.
>>> Visit this group at https://groups.google.com/group/tesseract-ocr.
>>> To view this discussion on the web visit https://groups.google.com/d/ms
>>> gid/tesseract-ocr/e2e1d749-a55d-4355-b128-5d0fe2181e19%40goo
>>> glegroups.com
>>> <https://groups.google.com/d/msgid/tesseract-ocr/e2e1d749-a55d-4355-b128-5d0fe2181e19%40googlegroups.com?utm_medium=email&utm_source=footer>
>>> .
>>> For more options, visit https://groups.google.com/d/optout.
>>>
>>
>> --
> You received this message because you are subscribed to the Google Groups
> "tesseract-ocr" group.
> To unsubscribe from this group and stop receiving emails from it, send an
> email to tesseract-ocr+unsubscr...@googlegroups.com.
> To post to this group, send email to tesseract-ocr@googlegroups.com.
> Visit this group at https://groups.google.com/group/tesseract-ocr.
> To view this discussion on the web visit https://groups.google.com/d/
> msgid/tesseract-ocr/2753f88a-ba89-4164-8271-9eb13207736f%
> 40googlegroups.com
> <https://groups.google.com/d/msgid/tesseract-ocr/2753f88a-ba89-4164-8271-9eb13207736f%40googlegroups.com?utm_medium=email&utm_source=footer>
> .
> For more options, visit https://groups.google.com/d/optout.
>

-- 
You received this message because you are subscribed to the Google Groups 
"tesseract-ocr" group.
To unsubscribe from this group and stop receiving emails from it, send an email 
to tesseract-ocr+unsubscr...@googlegroups.com.
To post to this group, send email to tesseract-ocr@googlegroups.com.
Visit this group at https://groups.google.com/group/tesseract-ocr.
To view this discussion on the web visit 
https://groups.google.com/d/msgid/tesseract-ocr/CAG2NduUKXSiqsVuQenHf%2BCBJ01-XOeGGM8FKNn-G0xH%2B47QCTw%40mail.gmail.com.
For more options, visit https://groups.google.com/d/optout.

Reply via email to