There is no online corpus for xsa that I could find.

Two of the fonts you sent are legacy fonts, that is they map English
letters to ancient Arabic characters.

Are there any converters that convert from the legacy mapping to Unicode?

If there is existing text in legacy fonts, it can be converted to Unicode
and that can be used for training.

On Sun, Mar 15, 2020, 17:57 aby tesh <[email protected]> wrote:

> Where can i get the training text, or can i create a new one. I have a
> problem writing with fonts which some of included in the attachment i sent
> you.
>
> On Sunday, March 15, 2020 at 4:32:08 AM UTC+3, shree wrote:
>>
>> I had used the findfonts feature of text2image and found only two fonts
>> that rendered the xsa text. I will check the fonts that you sent. What
>> about training text? Unless you have some more text, it will be difficult
>> to do training.
>>
>> Quivira
>> Segoe UI Historic
>>
>> On Sun, Mar 15, 2020, 04:01 aby tesh <[email protected]> wrote:
>>
>>> That is what i am not getting, i don't think they all are unicode fonts,
>>> i couldn't get one. Some render on my machine (Linux) some don't.
>>>
>>> On Saturday, March 14, 2020 at 8:45:46 PM UTC+3, shree wrote:
>>>>
>>>> Are all these Unicode fonts?
>>>>
>>>> What about training text in utf-8 Unicode encoding?
>>>>
>>>> On Sat, Mar 14, 2020, 22:37 aby tesh <[email protected]> wrote:
>>>>
>>>>> Hey shree, I have compiled all relevant fonts and attached them below.
>>>>> I am not sure know how i can generate text data with it.
>>>>>
>>>>> On Tuesday, March 10, 2020 at 5:35:26 AM UTC+3, shree wrote:
>>>>>>
>>>>>> If you can share a large enough training text and fonts, I can rerun
>>>>>> the training.
>>>>>>
>>>>>> On Tue, Mar 10, 2020, 03:41 aby tesh <[email protected]> wrote:
>>>>>>
>>>>>>> Hey,
>>>>>>>
>>>>>>> I followed the steps in the readme file, and i started the
>>>>>>> lstmtraining, but it seems my current computer's processor can't handle 
>>>>>>> the
>>>>>>> training for a longer period of time.
>>>>>>>
>>>>>>> What can i do about it? When should i abort the training to get a
>>>>>>> good trainedata file? or is there one which is accurate that you can 
>>>>>>> share ?
>>>>>>>
>>>>>>> Thanks
>>>>>>>
>>>>>>> --
>>>>>>> You received this message because you are subscribed to the Google
>>>>>>> Groups "tesseract-ocr" group.
>>>>>>> To unsubscribe from this group and stop receiving emails from it,
>>>>>>> send an email to [email protected].
>>>>>>> To view this discussion on the web visit
>>>>>>> https://groups.google.com/d/msgid/tesseract-ocr/e727f106-d668-44b5-9bba-8fad29fc1587%40googlegroups.com
>>>>>>> <https://groups.google.com/d/msgid/tesseract-ocr/e727f106-d668-44b5-9bba-8fad29fc1587%40googlegroups.com?utm_medium=email&utm_source=footer>
>>>>>>> .
>>>>>>>
>>>>>> --
>>>>> You received this message because you are subscribed to the Google
>>>>> Groups "tesseract-ocr" group.
>>>>> To unsubscribe from this group and stop receiving emails from it, send
>>>>> an email to [email protected].
>>>>> To view this discussion on the web visit
>>>>> https://groups.google.com/d/msgid/tesseract-ocr/efa79761-20a5-4d20-b0c1-40eb2523c289%40googlegroups.com
>>>>> <https://groups.google.com/d/msgid/tesseract-ocr/efa79761-20a5-4d20-b0c1-40eb2523c289%40googlegroups.com?utm_medium=email&utm_source=footer>
>>>>> .
>>>>>
>>>> --
>>> You received this message because you are subscribed to the Google
>>> Groups "tesseract-ocr" group.
>>> To unsubscribe from this group and stop receiving emails from it, send
>>> an email to [email protected].
>>> To view this discussion on the web visit
>>> https://groups.google.com/d/msgid/tesseract-ocr/1d3e54cc-3f53-4ad3-b870-171bb26fc6eb%40googlegroups.com
>>> <https://groups.google.com/d/msgid/tesseract-ocr/1d3e54cc-3f53-4ad3-b870-171bb26fc6eb%40googlegroups.com?utm_medium=email&utm_source=footer>
>>> .
>>>
>> --
> You received this message because you are subscribed to the Google Groups
> "tesseract-ocr" group.
> To unsubscribe from this group and stop receiving emails from it, send an
> email to [email protected].
> To view this discussion on the web visit
> https://groups.google.com/d/msgid/tesseract-ocr/88bfa189-4a1e-4528-857c-013248b5ee4b%40googlegroups.com
> <https://groups.google.com/d/msgid/tesseract-ocr/88bfa189-4a1e-4528-857c-013248b5ee4b%40googlegroups.com?utm_medium=email&utm_source=footer>
> .
>

-- 
You received this message because you are subscribed to the Google Groups 
"tesseract-ocr" group.
To unsubscribe from this group and stop receiving emails from it, send an email 
to [email protected].
To view this discussion on the web visit 
https://groups.google.com/d/msgid/tesseract-ocr/CAG2NduVrD9Vo8HUFWe_dr6c6Gs2EPOB2bh9DfkmAtA85cKp8fQ%40mail.gmail.com.

Reply via email to