Hello Wincent,

Thanks for the new version of package.
No errors regarding font now and not slow either.

Tested on Ubuntu.

On Mon, Feb 10, 2020 at 12:28 AM Wincent Balin <[email protected]>
wrote:

> Hello Shree,
>
> I just uploaded new version of the package. About the fixes:
>
> 1. --fonts_dir: I added the default value of the fonts directory on
> different platforms.
>
> 2. Amount of threads: I also capped the maximal amount of threads to the
> number of CPUs.
>
> Would you like to re-test it, please?
>
>
>
> Am Dienstag, 4. Februar 2020 12:21:49 UTC+1 schrieb shree:
>>
>> By the way, I added a create_ground_truth utility, which creates .gt.txt
>>> files as well as the associated .tif files for every specified font, to
>>> the package. I think it could be useful for anyone who does not have a
>>> ground truth collection yet.
>>>
>>> Thanks, I tried it with latest tesseract code.
>>
>> 1. Error when --fonts_dir is not specified, works ok, when specified.
>>
>> 2. Very slow (10 mins), started 20 text2image processes in parallel for
>> training_text with 20 lines.
>>
>>  create_ground_truth --fonts_dir ~/.fonts --fonts "Arial Unicode MS"
>> corpora ground-truth
>> 2020-02-04 11:01:19,135 INFO     Processing .txt files
>> 2020-02-04 11:01:19,137 INFO     Generating .tif files
>> 2020-02-04 11:10:24,855 INFO     Done
>>
>> Much faster (1 second) after setting  export OMP_THREAD_LIMIT=1
>>
>>  export OMP_THREAD_LIMIT=1
>>  create_ground_truth --fonts_dir ~/.fonts --fonts "Arial Unicode MS"
>> corpora ground-truth
>> 2020-02-04 11:12:18,713 INFO     Processing .txt files
>> 2020-02-04 11:12:18,715 INFO     Generating .tif files
>> 2020-02-04 11:12:19,398 INFO     Done
>>
>> You can update the documenation.
>>
>> <http://bhajans.ramparivar.com>
>>
> --
> You received this message because you are subscribed to the Google Groups
> "tesseract-ocr" group.
> To unsubscribe from this group and stop receiving emails from it, send an
> email to [email protected].
> To view this discussion on the web visit
> https://groups.google.com/d/msgid/tesseract-ocr/ec83d722-4bac-46cf-b501-d4d990816596%40googlegroups.com
> <https://groups.google.com/d/msgid/tesseract-ocr/ec83d722-4bac-46cf-b501-d4d990816596%40googlegroups.com?utm_medium=email&utm_source=footer>
> .
>


-- 

____________________________________________________________
भजन - कीर्तन - आरती @ http://bhajans.ramparivar.com

-- 
You received this message because you are subscribed to the Google Groups 
"tesseract-ocr" group.
To unsubscribe from this group and stop receiving emails from it, send an email 
to [email protected].
To view this discussion on the web visit 
https://groups.google.com/d/msgid/tesseract-ocr/CAG2NduW_H7-wKW1csCMU1S_grTxmV8noo6Dd5q_KCC%2BBH-apTQ%40mail.gmail.com.

Reply via email to