Common fonts do not cover every unicode symbol (about 100000). If one font works and another does not the text is correct and you just need to find fonts covering that language.
Lorenzo Il sab 14 mar 2020, 23:34 aby tesh <[email protected]> ha scritto: > Even google's Noto font doesn't show glyphs while opening it with Gnome > Fonts, does that mean it is not a unicode font? > > On Saturday, March 14, 2020 at 8:45:46 PM UTC+3, shree wrote: >> >> Are all these Unicode fonts? >> >> What about training text in utf-8 Unicode encoding? >> >> On Sat, Mar 14, 2020, 22:37 aby tesh <[email protected]> wrote: >> >>> Hey shree, I have compiled all relevant fonts and attached them below. I >>> am not sure know how i can generate text data with it. >>> >>> On Tuesday, March 10, 2020 at 5:35:26 AM UTC+3, shree wrote: >>>> >>>> If you can share a large enough training text and fonts, I can rerun >>>> the training. >>>> >>>> On Tue, Mar 10, 2020, 03:41 aby tesh <[email protected]> wrote: >>>> >>>>> Hey, >>>>> >>>>> I followed the steps in the readme file, and i started the >>>>> lstmtraining, but it seems my current computer's processor can't handle >>>>> the >>>>> training for a longer period of time. >>>>> >>>>> What can i do about it? When should i abort the training to get a good >>>>> trainedata file? or is there one which is accurate that you can share ? >>>>> >>>>> Thanks >>>>> >>>>> -- >>>>> You received this message because you are subscribed to the Google >>>>> Groups "tesseract-ocr" group. >>>>> To unsubscribe from this group and stop receiving emails from it, send >>>>> an email to [email protected]. >>>>> To view this discussion on the web visit >>>>> https://groups.google.com/d/msgid/tesseract-ocr/e727f106-d668-44b5-9bba-8fad29fc1587%40googlegroups.com >>>>> <https://groups.google.com/d/msgid/tesseract-ocr/e727f106-d668-44b5-9bba-8fad29fc1587%40googlegroups.com?utm_medium=email&utm_source=footer> >>>>> . >>>>> >>>> -- >>> You received this message because you are subscribed to the Google >>> Groups "tesseract-ocr" group. >>> To unsubscribe from this group and stop receiving emails from it, send >>> an email to [email protected]. >>> To view this discussion on the web visit >>> https://groups.google.com/d/msgid/tesseract-ocr/efa79761-20a5-4d20-b0c1-40eb2523c289%40googlegroups.com >>> <https://groups.google.com/d/msgid/tesseract-ocr/efa79761-20a5-4d20-b0c1-40eb2523c289%40googlegroups.com?utm_medium=email&utm_source=footer> >>> . >>> >> -- > You received this message because you are subscribed to the Google Groups > "tesseract-ocr" group. > To unsubscribe from this group and stop receiving emails from it, send an > email to [email protected]. > To view this discussion on the web visit > https://groups.google.com/d/msgid/tesseract-ocr/373dfeed-d09f-49cc-9f3e-8b0d55661f1c%40googlegroups.com > <https://groups.google.com/d/msgid/tesseract-ocr/373dfeed-d09f-49cc-9f3e-8b0d55661f1c%40googlegroups.com?utm_medium=email&utm_source=footer> > . > -- You received this message because you are subscribed to the Google Groups "tesseract-ocr" group. To unsubscribe from this group and stop receiving emails from it, send an email to [email protected]. To view this discussion on the web visit https://groups.google.com/d/msgid/tesseract-ocr/CAMgOLLwb8wx85FYX8OXRmDTR-xq20edL%2BNgUo%2Bc0%2B2ycTUqXPg%40mail.gmail.com.

