I have posted a couple of test models for Farsi at
https://github.com/Shreeshrii/tessdata_shreetest

These have not been trained on text with diacritics as the normalization
and training process was giving error on the combining marks.

Please give them a try and see if they provide better recognition for
numbers and text without combining marks.

FYI, I do not know the Persian language so it is difficult for me to gauge
if results are ok or not.

ShreeDevi
____________________________________________________________
भजन - कीर्तन - आरती @ http://bhajans.ramparivar.com

On Tue, May 15, 2018 at 6:47 PM, reza <[email protected]> wrote:

> hi again
> thanks for your reply.
>
> i need more fonts. for examples :
> B Koodak
> B Lotus
> B Titr
> B Zar
> B Yekan
> Iran Nastaliq
>
> if needs, i send the .ttf files of that fonts ?
>
> thanks
>
>
> On Tuesday, May 15, 2018 at 5:35:10 PM UTC+4:30, shree wrote:
>>
>> I will try to put together complete steps.
>>
>> I am doing a test run for training persian.
>>
>> Are the following fonts ok for it?
>>
>>   '55_Sarchia_Kurdish' \
>>   '56_Sarchia_Kurdish_Bold Bold' \
>>   'Amiri' \
>>   'Arabic Typesetting' \
>>   'Arial' \
>>   'Arial Unicode MS' \
>>   'B Nazanin' \
>>   'B Nazanin Bold' \
>>   'Calibri' \
>>   'Courier New' \
>>   'Microsoft Sans Serif' \
>>   'Scheherazade' \
>>   'Tahoma' \
>>   'Times New Roman,' \
>>   'Traditional Arabic' \
>>
>> ShreeDevi
>> ____________________________________________________________
>> भजन - कीर्तन - आरती @ http://bhajans.ramparivar.com
>>
>> On Tue, May 15, 2018 at 3:59 PM, reza <[email protected]> wrote:
>>
>>> i test it on ubuntu , that raised error too.
>>>
>>> could u help me and send me a new bash file for fine tuning with new
>>> fonts ?
>>>
>>> i put "eng.traineddata" fil in tessdata_best folder
>>> and "eng.training_text" and "eng.traineddata" in langdata\eng
>>>
>>> is it true and sufficient ? or need more file ?
>>>
>>>
>>> thanks
>>>
>>> --
>>> You received this message because you are subscribed to the Google
>>> Groups "tesseract-ocr" group.
>>> To unsubscribe from this group and stop receiving emails from it, send
>>> an email to [email protected].
>>> To post to this group, send email to [email protected].
>>> Visit this group at https://groups.google.com/group/tesseract-ocr.
>>> To view this discussion on the web visit https://groups.google.com/d/ms
>>> gid/tesseract-ocr/885e3e15-e08f-4489-a0bc-2162f913495a%40goo
>>> glegroups.com
>>> <https://groups.google.com/d/msgid/tesseract-ocr/885e3e15-e08f-4489-a0bc-2162f913495a%40googlegroups.com?utm_medium=email&utm_source=footer>
>>> .
>>>
>>> For more options, visit https://groups.google.com/d/optout.
>>>
>>
>> --
> You received this message because you are subscribed to the Google Groups
> "tesseract-ocr" group.
> To unsubscribe from this group and stop receiving emails from it, send an
> email to [email protected].
> To post to this group, send email to [email protected].
> Visit this group at https://groups.google.com/group/tesseract-ocr.
> To view this discussion on the web visit https://groups.google.com/d/
> msgid/tesseract-ocr/e43db8d0-731e-4268-8791-9e243646f49d%
> 40googlegroups.com
> <https://groups.google.com/d/msgid/tesseract-ocr/e43db8d0-731e-4268-8791-9e243646f49d%40googlegroups.com?utm_medium=email&utm_source=footer>
> .
>
> For more options, visit https://groups.google.com/d/optout.
>

-- 
You received this message because you are subscribed to the Google Groups 
"tesseract-ocr" group.
To unsubscribe from this group and stop receiving emails from it, send an email 
to [email protected].
To post to this group, send email to [email protected].
Visit this group at https://groups.google.com/group/tesseract-ocr.
To view this discussion on the web visit 
https://groups.google.com/d/msgid/tesseract-ocr/CAG2NduUXfFe4wtOWbgk7yA%2Bsz0NQeRGXAcKp2q%3DfjmYLc9FomA%40mail.gmail.com.
For more options, visit https://groups.google.com/d/optout.

Reply via email to