https://apple.stackexchange.com/questions/128091/where-can-i-find-default-microsoft-fonts-calibri-cambria


On Thu, Oct 3, 2019 at 1:33 PM Dustin Theobald <d.theo1...@gmail.com> wrote:

> Ok. Thank you very much for your help! I'll get it to work somehow!
>
> Cheers,
> Dustin
>
> Am Mittwoch, 2. Oktober 2019 16:46:25 UTC+2 schrieb shree:
>>
>> Sorry, don't know how to add those fonts for Mac.
>>
>> The tutorial uses the following set of fonts:
>>
>> https://github.com/tesseract-ocr/tesseract/blob/master/src/training/language-specific.sh#L42
>>
>>
>> You could use a similar set of fonts available on the Mac and assign via
>> fontlist.
>>
>> On Wed, Oct 2, 2019 at 7:38 PM Dustin Theobald <d.th...@gmail.com> wrote:
>>
>>> Hey shree,
>>>
>>> do you know how to manually install the missing fonts for MAC, like in
>>> your docu for linux:
>>>
>>> sudo apt update
>>> sudo apt install ttf-mscorefonts-installer
>>> sudo apt install fonts-dejavu
>>> fc-cache -vf
>>>
>>> Thank you in advance!
>>>
>>> Best regards,
>>> Dustin
>>>
>>> Am Mittwoch, 2. Oktober 2019 11:26:28 UTC+2 schrieb shree:
>>>>
>>>> >This doesn't work on my MAC. I can't find some of the fonts, so I
>>>> only try to create trainingdata for Arial, if use the
>>>> 5-makedata-plusminus.sh, he is only rendering (creating 2 pages), which
>>>> seems odd.
>>>>
>>>> 2 pages should be ok because it uses the training_text from langdata
>>>> repo which is around 80 lines plus the extra lines added with plusminus.
>>>>
>>>> On Wed, Oct 2, 2019 at 2:53 PM Shree Devi Kumar <shree...@gmail.com>
>>>> wrote:
>>>>
>>>>> 1. You could install on linux using the appropriate package from
>>>>> https://github.com/tesseract-ocr/tesseract/wiki#tesseract-4-packages-with-lstm-engine-and-related-traineddata
>>>>>
>>>>> OR
>>>>>
>>>>> 2. When building tesseract from git source, follow
>>>>> https://github.com/tesseract-ocr/tesseract/wiki/Compiling-%E2%80%93-GitInstallation#build-with-training-tools
>>>>>
>>>>> You seem to be missing some steps there.
>>>>>
>>>>> On Wed, Oct 2, 2019 at 2:32 PM Dustin Theobald <d.th...@gmail.com>
>>>>> wrote:
>>>>>
>>>>>> Hey Shree,
>>>>>>
>>>>>> Thank you for your help!
>>>>>>
>>>>>> This doesn't work on my MAC. I can't find some of the fonts, so I
>>>>>> only try to create trainingdata for Arial, if use the
>>>>>> 5-makedata-plusminus.sh, he is only rendering (creating 2 pages), which
>>>>>> seems odd.
>>>>>>
>>>>>> I'm switching to my linux now, but I have problems installing
>>>>>> tesseract.
>>>>>>
>>>>>> I'm following the documentation:
>>>>>>
>>>>>> sudo apt install tesseract-ocr
>>>>>>
>>>>>> After, I try to find the folder to run
>>>>>>
>>>>>> make
>>>>>> make training
>>>>>> make training-install
>>>>>>
>>>>>>  But I cannot find the folder (on ubuntu)
>>>>>>
>>>>>> So, I clone the GitHub Repository:
>>>>>> https://github.com/tesseract-ocr/tesseract
>>>>>> to my Desktop and run ./autogen.sh ./configure, make, make training,
>>>>>> sudo make trainng-install
>>>>>>
>>>>>> But then I'll get the following error when running
>>>>>> 5-makedata-plusminus.sh:
>>>>>>
>>>>>> /usr/local/bin/text2image: error while loading shared libraries:
>>>>>> libtesseract.so.5: cannot open shared object file: No such file or 
>>>>>> directory
>>>>>> ERROR: Program text2image failed. Abort.
>>>>>>
>>>>>> Thank you very much for your help!
>>>>>>
>>>>>> Am Dienstag, 1. Oktober 2019 17:41:36 UTC+2 schrieb shree:
>>>>>>>
>>>>>>> specifically
>>>>>>> https://github.com/Shreeshrii/tess4training/blob/master/6-plusminus.log#L429
>>>>>>>
>>>>>>> On Tue, Oct 1, 2019 at 9:09 PM Shree Devi Kumar <shree...@gmail.com>
>>>>>>> wrote:
>>>>>>>
>>>>>>>> See https://github.com/Shreeshrii/tess4training
>>>>>>>>
>>>>>>>> On Tue, Oct 1, 2019 at 7:53 PM Dustin Theobald <d.th...@gmail.com>
>>>>>>>> wrote:
>>>>>>>>
>>>>>>>>> Changed my evaluation to:
>>>>>>>>>
>>>>>>>>> ~/../../usr/local/bin/lstmeval \
>>>>>>>>>   --model ~/Desktop/tesstutorial/trainplusminus/
>>>>>>>>> *plusminus_checkpoint* \
>>>>>>>>>   --traineddata
>>>>>>>>> ~/Desktop/tesstutorial/trainplusminus/eng/eng.traineddata \
>>>>>>>>>   --eval_listfile
>>>>>>>>> ~/Desktop/tesstutorial/trainplusminus/eng.training_files.txt 2>&1 | 
>>>>>>>>> grep ±
>>>>>>>>>
>>>>>>>>> Still doesn't work.
>>>>>>>>>
>>>>>>>>> Am Dienstag, 1. Oktober 2019 14:39:48 UTC+2 schrieb Dustin
>>>>>>>>> Theobald:
>>>>>>>>>>
>>>>>>>>>> Hey guys,
>>>>>>>>>>
>>>>>>>>>> I have a Problem when Finetuning Characters (trying the ± approach
>>>>>>>>>> on
>>>>>>>>>> https://github.com/tesseract-ocr/tesseract/wiki/TrainingTesseract-4.00
>>>>>>>>>> )
>>>>>>>>>>
>>>>>>>>>> (I'm working on a MAC)
>>>>>>>>>>
>>>>>>>>>> My tesseract version:
>>>>>>>>>>
>>>>>>>>>> tesseract 5.0.0-alpha-457-gb3b74
>>>>>>>>>>
>>>>>>>>>>  leptonica-1.78.0
>>>>>>>>>>
>>>>>>>>>>   libgif 5.1.4 : libjpeg 9c : libpng 1.6.37 : libtiff 4.0.10 :
>>>>>>>>>> zlib 1.2.11 : libwebp 1.0.3 : libopenjp2 2.3.1
>>>>>>>>>>
>>>>>>>>>>  Found AVX2
>>>>>>>>>>
>>>>>>>>>>  Found AVX
>>>>>>>>>>
>>>>>>>>>>  Found FMA
>>>>>>>>>>
>>>>>>>>>>  Found SSE
>>>>>>>>>>
>>>>>>>>>>  Found libarchive 3.4.0 zlib/1.2.11 liblzma/5.2.4 bz2lib/1.0.6
>>>>>>>>>>
>>>>>>>>>> My bashscript looks at follows: https://pastebin.com/XK4CkuM2
>>>>>>>>>>
>>>>>>>>>> When I evaluate via:
>>>>>>>>>>
>>>>>>>>>> ~/../../usr/local/bin/lstmeval \
>>>>>>>>>>   --model ~/Desktop/tesstutorial/trainplusminus/eng.traineddata \
>>>>>>>>>>   --traineddata
>>>>>>>>>> ~/Desktop/tesstutorial/trainplusminus/eng/eng.traineddata \
>>>>>>>>>>   --eval_listfile
>>>>>>>>>> ~/Desktop/tesstutorial/trainplusminus/eng.training_files.txt 2>&1 | 
>>>>>>>>>> grep ±
>>>>>>>>>>
>>>>>>>>>> I don't get any OCR Line correctly.
>>>>>>>>>>
>>>>>>>>>> Does someone see a mistake in my code?
>>>>>>>>>>
>>>>>>>>>>
>>>>>>>>>>
>>>>>>>>>> --
>>>>>>>>> You received this message because you are subscribed to the Google
>>>>>>>>> Groups "tesseract-ocr" group.
>>>>>>>>> To unsubscribe from this group and stop receiving emails from it,
>>>>>>>>> send an email to tesser...@googlegroups.com.
>>>>>>>>> To view this discussion on the web visit
>>>>>>>>> https://groups.google.com/d/msgid/tesseract-ocr/e9ba2635-6308-41a8-8150-e5d4da520269%40googlegroups.com
>>>>>>>>> <https://groups.google.com/d/msgid/tesseract-ocr/e9ba2635-6308-41a8-8150-e5d4da520269%40googlegroups.com?utm_medium=email&utm_source=footer>
>>>>>>>>> .
>>>>>>>>>
>>>>>>>>
>>>>>>>>
>>>>>>>> --
>>>>>>>>
>>>>>>>> ____________________________________________________________
>>>>>>>> भजन - कीर्तन - आरती @ http://bhajans.ramparivar.com
>>>>>>>>
>>>>>>>
>>>>>>>
>>>>>>> --
>>>>>>>
>>>>>>> ____________________________________________________________
>>>>>>> भजन - कीर्तन - आरती @ http://bhajans.ramparivar.com
>>>>>>>
>>>>>> --
>>>>>> You received this message because you are subscribed to the Google
>>>>>> Groups "tesseract-ocr" group.
>>>>>> To unsubscribe from this group and stop receiving emails from it,
>>>>>> send an email to tesser...@googlegroups.com.
>>>>>> To view this discussion on the web visit
>>>>>> https://groups.google.com/d/msgid/tesseract-ocr/d44cd443-da72-4df4-9a7c-aae082726010%40googlegroups.com
>>>>>> <https://groups.google.com/d/msgid/tesseract-ocr/d44cd443-da72-4df4-9a7c-aae082726010%40googlegroups.com?utm_medium=email&utm_source=footer>
>>>>>> .
>>>>>>
>>>>>
>>>>>
>>>>> --
>>>>>
>>>>> ____________________________________________________________
>>>>> भजन - कीर्तन - आरती @ http://bhajans.ramparivar.com
>>>>>
>>>>
>>>>
>>>> --
>>>>
>>>> ____________________________________________________________
>>>> भजन - कीर्तन - आरती @ http://bhajans.ramparivar.com
>>>>
>>> --
>>> You received this message because you are subscribed to the Google
>>> Groups "tesseract-ocr" group.
>>> To unsubscribe from this group and stop receiving emails from it, send
>>> an email to tesser...@googlegroups.com.
>>> To view this discussion on the web visit
>>> https://groups.google.com/d/msgid/tesseract-ocr/0a2e9693-553a-4340-832d-79a31da74314%40googlegroups.com
>>> <https://groups.google.com/d/msgid/tesseract-ocr/0a2e9693-553a-4340-832d-79a31da74314%40googlegroups.com?utm_medium=email&utm_source=footer>
>>> .
>>>
>>
>>
>> --
>>
>> ____________________________________________________________
>> भजन - कीर्तन - आरती @ http://bhajans.ramparivar.com
>>
> --
> You received this message because you are subscribed to the Google Groups
> "tesseract-ocr" group.
> To unsubscribe from this group and stop receiving emails from it, send an
> email to tesseract-ocr+unsubscr...@googlegroups.com.
> To view this discussion on the web visit
> https://groups.google.com/d/msgid/tesseract-ocr/ca6dd8f3-27d1-4ab5-bfe1-45011e63223e%40googlegroups.com
> <https://groups.google.com/d/msgid/tesseract-ocr/ca6dd8f3-27d1-4ab5-bfe1-45011e63223e%40googlegroups.com?utm_medium=email&utm_source=footer>
> .
>


-- 

____________________________________________________________
भजन - कीर्तन - आरती @ http://bhajans.ramparivar.com

-- 
You received this message because you are subscribed to the Google Groups 
"tesseract-ocr" group.
To unsubscribe from this group and stop receiving emails from it, send an email 
to tesseract-ocr+unsubscr...@googlegroups.com.
To view this discussion on the web visit 
https://groups.google.com/d/msgid/tesseract-ocr/CAG2NduU91KVJu0ebWOohsdGZU4B8AUvGPdZrc44f%2BFAM0mTzwQ%40mail.gmail.com.

Reply via email to