Hi Shreeshrii,

Can you please tell me the training command  used? Also, how can I create
the graphs and these other documents?

On Sat, 26 Dec 2020, 18:37 Shree Devi Kumar, <[email protected]> wrote:

> Soumik,
>
> I used your groundtruth and trained using ben as the START_MODEL.  I got
> best results on the validation set of images at around 5000 iterations. see
> attached Accuracy report and CER graph.
>
>
>
> On Thu, Dec 24, 2020 at 8:36 PM Soumik Ranjan Dasgupta <
> [email protected]> wrote:
>
>> Hi everyone,
>> I wanted to do fine-tune the ben.traineddata model by using some ancient
>> text that were supposedly printed with typeset. I have roughly around 1k
>> lines of text and tried the normal fine-tuning approach with around 25k
>> iterations.
>> The thing that surprised me the most was even after packing the
>> traineddata (character error was around 4%) and testing an unseen image,
>> the performance was exactly the same. Not a single character was different!
>> You can find the traineddata, training data, the logs and the source code
>> at this link:
>> https://github.com/srdg/unarchived_ben_tess/releases/tag/v0.0.4-alpha
>>
>> Can anyone tell me exactly what I am doing wrong here? Do I need to
>> change any training parameter, increase my training data, or anything else
>> completely?
>>
>> Best regards,
>> Soumik
>>
>> --
>> You received this message because you are subscribed to the Google Groups
>> "tesseract-ocr" group.
>> To unsubscribe from this group and stop receiving emails from it, send an
>> email to [email protected].
>> To view this discussion on the web visit
>> https://groups.google.com/d/msgid/tesseract-ocr/1fc044d1-b0ae-45d5-9041-e6fbf8ec5089n%40googlegroups.com
>> <https://groups.google.com/d/msgid/tesseract-ocr/1fc044d1-b0ae-45d5-9041-e6fbf8ec5089n%40googlegroups.com?utm_medium=email&utm_source=footer>
>> .
>>
>
>
> --
>
> ____________________________________________________________
> भजन - कीर्तन - आरती @ http://bhajans.ramparivar.com
>
> --
> You received this message because you are subscribed to the Google Groups
> "tesseract-ocr" group.
> To unsubscribe from this group and stop receiving emails from it, send an
> email to [email protected].
> To view this discussion on the web visit
> https://groups.google.com/d/msgid/tesseract-ocr/CAG2NduVZ3A7CUEqw29Gxu6r1-cLHPTLFt%3D%3D0C0109D_6x6C7Kw%40mail.gmail.com
> <https://groups.google.com/d/msgid/tesseract-ocr/CAG2NduVZ3A7CUEqw29Gxu6r1-cLHPTLFt%3D%3D0C0109D_6x6C7Kw%40mail.gmail.com?utm_medium=email&utm_source=footer>
> .
>

-- 
You received this message because you are subscribed to the Google Groups 
"tesseract-ocr" group.
To unsubscribe from this group and stop receiving emails from it, send an email 
to [email protected].
To view this discussion on the web visit 
https://groups.google.com/d/msgid/tesseract-ocr/CAM-%2BFN%3DZggnH4wV5vUhY9nsSqjKg9xZ5TQDoCMwSqf7H0oPogQ%40mail.gmail.com.

Reply via email to