>This doesn't work on my MAC. I can't find some of the fonts, so I only try
to create trainingdata for Arial, if use the 5-makedata-plusminus.sh, he is
only rendering (creating 2 pages), which seems odd.

2 pages should be ok because it uses the training_text from langdata repo
which is around 80 lines plus the extra lines added with plusminus.

On Wed, Oct 2, 2019 at 2:53 PM Shree Devi Kumar <shreesh...@gmail.com>
wrote:

> 1. You could install on linux using the appropriate package from
> https://github.com/tesseract-ocr/tesseract/wiki#tesseract-4-packages-with-lstm-engine-and-related-traineddata
>
> OR
>
> 2. When building tesseract from git source, follow
> https://github.com/tesseract-ocr/tesseract/wiki/Compiling-%E2%80%93-GitInstallation#build-with-training-tools
>
> You seem to be missing some steps there.
>
> On Wed, Oct 2, 2019 at 2:32 PM Dustin Theobald <d.theo1...@gmail.com>
> wrote:
>
>> Hey Shree,
>>
>> Thank you for your help!
>>
>> This doesn't work on my MAC. I can't find some of the fonts, so I only
>> try to create trainingdata for Arial, if use the 5-makedata-plusminus.sh,
>> he is only rendering (creating 2 pages), which seems odd.
>>
>> I'm switching to my linux now, but I have problems installing tesseract.
>>
>> I'm following the documentation:
>>
>> sudo apt install tesseract-ocr
>>
>> After, I try to find the folder to run
>>
>> make
>> make training
>> make training-install
>>
>>  But I cannot find the folder (on ubuntu)
>>
>> So, I clone the GitHub Repository:
>> https://github.com/tesseract-ocr/tesseract
>> to my Desktop and run ./autogen.sh ./configure, make, make training, sudo
>> make trainng-install
>>
>> But then I'll get the following error when running
>> 5-makedata-plusminus.sh:
>>
>> /usr/local/bin/text2image: error while loading shared libraries:
>> libtesseract.so.5: cannot open shared object file: No such file or directory
>> ERROR: Program text2image failed. Abort.
>>
>> Thank you very much for your help!
>>
>> Am Dienstag, 1. Oktober 2019 17:41:36 UTC+2 schrieb shree:
>>>
>>> specifically
>>> https://github.com/Shreeshrii/tess4training/blob/master/6-plusminus.log#L429
>>>
>>> On Tue, Oct 1, 2019 at 9:09 PM Shree Devi Kumar <shree...@gmail.com>
>>> wrote:
>>>
>>>> See https://github.com/Shreeshrii/tess4training
>>>>
>>>> On Tue, Oct 1, 2019 at 7:53 PM Dustin Theobald <d.th...@gmail.com>
>>>> wrote:
>>>>
>>>>> Changed my evaluation to:
>>>>>
>>>>> ~/../../usr/local/bin/lstmeval \
>>>>>   --model ~/Desktop/tesstutorial/trainplusminus/*plusminus_checkpoint*
>>>>> \
>>>>>   --traineddata
>>>>> ~/Desktop/tesstutorial/trainplusminus/eng/eng.traineddata \
>>>>>   --eval_listfile
>>>>> ~/Desktop/tesstutorial/trainplusminus/eng.training_files.txt 2>&1 | grep ±
>>>>>
>>>>> Still doesn't work.
>>>>>
>>>>> Am Dienstag, 1. Oktober 2019 14:39:48 UTC+2 schrieb Dustin Theobald:
>>>>>>
>>>>>> Hey guys,
>>>>>>
>>>>>> I have a Problem when Finetuning Characters (trying the ± approach
>>>>>> on
>>>>>> https://github.com/tesseract-ocr/tesseract/wiki/TrainingTesseract-4.00
>>>>>> )
>>>>>>
>>>>>> (I'm working on a MAC)
>>>>>>
>>>>>> My tesseract version:
>>>>>>
>>>>>> tesseract 5.0.0-alpha-457-gb3b74
>>>>>>
>>>>>>  leptonica-1.78.0
>>>>>>
>>>>>>   libgif 5.1.4 : libjpeg 9c : libpng 1.6.37 : libtiff 4.0.10 : zlib
>>>>>> 1.2.11 : libwebp 1.0.3 : libopenjp2 2.3.1
>>>>>>
>>>>>>  Found AVX2
>>>>>>
>>>>>>  Found AVX
>>>>>>
>>>>>>  Found FMA
>>>>>>
>>>>>>  Found SSE
>>>>>>
>>>>>>  Found libarchive 3.4.0 zlib/1.2.11 liblzma/5.2.4 bz2lib/1.0.6
>>>>>>
>>>>>> My bashscript looks at follows: https://pastebin.com/XK4CkuM2
>>>>>>
>>>>>> When I evaluate via:
>>>>>>
>>>>>> ~/../../usr/local/bin/lstmeval \
>>>>>>   --model ~/Desktop/tesstutorial/trainplusminus/eng.traineddata \
>>>>>>   --traineddata
>>>>>> ~/Desktop/tesstutorial/trainplusminus/eng/eng.traineddata \
>>>>>>   --eval_listfile
>>>>>> ~/Desktop/tesstutorial/trainplusminus/eng.training_files.txt 2>&1 | grep 
>>>>>> ±
>>>>>>
>>>>>> I don't get any OCR Line correctly.
>>>>>>
>>>>>> Does someone see a mistake in my code?
>>>>>>
>>>>>>
>>>>>>
>>>>>> --
>>>>> You received this message because you are subscribed to the Google
>>>>> Groups "tesseract-ocr" group.
>>>>> To unsubscribe from this group and stop receiving emails from it, send
>>>>> an email to tesser...@googlegroups.com.
>>>>> To view this discussion on the web visit
>>>>> https://groups.google.com/d/msgid/tesseract-ocr/e9ba2635-6308-41a8-8150-e5d4da520269%40googlegroups.com
>>>>> <https://groups.google.com/d/msgid/tesseract-ocr/e9ba2635-6308-41a8-8150-e5d4da520269%40googlegroups.com?utm_medium=email&utm_source=footer>
>>>>> .
>>>>>
>>>>
>>>>
>>>> --
>>>>
>>>> ____________________________________________________________
>>>> भजन - कीर्तन - आरती @ http://bhajans.ramparivar.com
>>>>
>>>
>>>
>>> --
>>>
>>> ____________________________________________________________
>>> भजन - कीर्तन - आरती @ http://bhajans.ramparivar.com
>>>
>> --
>> You received this message because you are subscribed to the Google Groups
>> "tesseract-ocr" group.
>> To unsubscribe from this group and stop receiving emails from it, send an
>> email to tesseract-ocr+unsubscr...@googlegroups.com.
>> To view this discussion on the web visit
>> https://groups.google.com/d/msgid/tesseract-ocr/d44cd443-da72-4df4-9a7c-aae082726010%40googlegroups.com
>> <https://groups.google.com/d/msgid/tesseract-ocr/d44cd443-da72-4df4-9a7c-aae082726010%40googlegroups.com?utm_medium=email&utm_source=footer>
>> .
>>
>
>
> --
>
> ____________________________________________________________
> भजन - कीर्तन - आरती @ http://bhajans.ramparivar.com
>


-- 

____________________________________________________________
भजन - कीर्तन - आरती @ http://bhajans.ramparivar.com

-- 
You received this message because you are subscribed to the Google Groups 
"tesseract-ocr" group.
To unsubscribe from this group and stop receiving emails from it, send an email 
to tesseract-ocr+unsubscr...@googlegroups.com.
To view this discussion on the web visit 
https://groups.google.com/d/msgid/tesseract-ocr/CAG2NduW61GdJGHrZP_9KW%3D-7t2ZE3YRrbZ2tkqvCdDshQu94XA%40mail.gmail.com.

Reply via email to