I have posted a couple of test models for Farsi at https://github.com/Shreeshrii/tessdata_shreetest
These have not been trained on text with diacritics as the normalization and training process was giving error on the combining marks. Please give them a try and see if they provide better recognition for numbers and text without combining marks. FYI, I do not know the Persian language so it is difficult for me to gauge if results are ok or not. ShreeDevi ____________________________________________________________ भजन - कीर्तन - आरती @ http://bhajans.ramparivar.com On Tue, May 15, 2018 at 6:47 PM, reza <[email protected]> wrote: > hi again > thanks for your reply. > > i need more fonts. for examples : > B Koodak > B Lotus > B Titr > B Zar > B Yekan > Iran Nastaliq > > if needs, i send the .ttf files of that fonts ? > > thanks > > > On Tuesday, May 15, 2018 at 5:35:10 PM UTC+4:30, shree wrote: >> >> I will try to put together complete steps. >> >> I am doing a test run for training persian. >> >> Are the following fonts ok for it? >> >> '55_Sarchia_Kurdish' \ >> '56_Sarchia_Kurdish_Bold Bold' \ >> 'Amiri' \ >> 'Arabic Typesetting' \ >> 'Arial' \ >> 'Arial Unicode MS' \ >> 'B Nazanin' \ >> 'B Nazanin Bold' \ >> 'Calibri' \ >> 'Courier New' \ >> 'Microsoft Sans Serif' \ >> 'Scheherazade' \ >> 'Tahoma' \ >> 'Times New Roman,' \ >> 'Traditional Arabic' \ >> >> ShreeDevi >> ____________________________________________________________ >> भजन - कीर्तन - आरती @ http://bhajans.ramparivar.com >> >> On Tue, May 15, 2018 at 3:59 PM, reza <[email protected]> wrote: >> >>> i test it on ubuntu , that raised error too. >>> >>> could u help me and send me a new bash file for fine tuning with new >>> fonts ? >>> >>> i put "eng.traineddata" fil in tessdata_best folder >>> and "eng.training_text" and "eng.traineddata" in langdata\eng >>> >>> is it true and sufficient ? or need more file ? >>> >>> >>> thanks >>> >>> -- >>> You received this message because you are subscribed to the Google >>> Groups "tesseract-ocr" group. >>> To unsubscribe from this group and stop receiving emails from it, send >>> an email to [email protected]. >>> To post to this group, send email to [email protected]. >>> Visit this group at https://groups.google.com/group/tesseract-ocr. >>> To view this discussion on the web visit https://groups.google.com/d/ms >>> gid/tesseract-ocr/885e3e15-e08f-4489-a0bc-2162f913495a%40goo >>> glegroups.com >>> <https://groups.google.com/d/msgid/tesseract-ocr/885e3e15-e08f-4489-a0bc-2162f913495a%40googlegroups.com?utm_medium=email&utm_source=footer> >>> . >>> >>> For more options, visit https://groups.google.com/d/optout. >>> >> >> -- > You received this message because you are subscribed to the Google Groups > "tesseract-ocr" group. > To unsubscribe from this group and stop receiving emails from it, send an > email to [email protected]. > To post to this group, send email to [email protected]. > Visit this group at https://groups.google.com/group/tesseract-ocr. > To view this discussion on the web visit https://groups.google.com/d/ > msgid/tesseract-ocr/e43db8d0-731e-4268-8791-9e243646f49d% > 40googlegroups.com > <https://groups.google.com/d/msgid/tesseract-ocr/e43db8d0-731e-4268-8791-9e243646f49d%40googlegroups.com?utm_medium=email&utm_source=footer> > . > > For more options, visit https://groups.google.com/d/optout. > -- You received this message because you are subscribed to the Google Groups "tesseract-ocr" group. To unsubscribe from this group and stop receiving emails from it, send an email to [email protected]. To post to this group, send email to [email protected]. Visit this group at https://groups.google.com/group/tesseract-ocr. To view this discussion on the web visit https://groups.google.com/d/msgid/tesseract-ocr/CAG2NduUXfFe4wtOWbgk7yA%2Bsz0NQeRGXAcKp2q%3DfjmYLc9FomA%40mail.gmail.com. For more options, visit https://groups.google.com/d/optout.

