Hi Shreeshrii, Can you please tell me the training command used? Also, how can I create the graphs and these other documents?
On Sat, 26 Dec 2020, 18:37 Shree Devi Kumar, <[email protected]> wrote: > Soumik, > > I used your groundtruth and trained using ben as the START_MODEL. I got > best results on the validation set of images at around 5000 iterations. see > attached Accuracy report and CER graph. > > > > On Thu, Dec 24, 2020 at 8:36 PM Soumik Ranjan Dasgupta < > [email protected]> wrote: > >> Hi everyone, >> I wanted to do fine-tune the ben.traineddata model by using some ancient >> text that were supposedly printed with typeset. I have roughly around 1k >> lines of text and tried the normal fine-tuning approach with around 25k >> iterations. >> The thing that surprised me the most was even after packing the >> traineddata (character error was around 4%) and testing an unseen image, >> the performance was exactly the same. Not a single character was different! >> You can find the traineddata, training data, the logs and the source code >> at this link: >> https://github.com/srdg/unarchived_ben_tess/releases/tag/v0.0.4-alpha >> >> Can anyone tell me exactly what I am doing wrong here? Do I need to >> change any training parameter, increase my training data, or anything else >> completely? >> >> Best regards, >> Soumik >> >> -- >> You received this message because you are subscribed to the Google Groups >> "tesseract-ocr" group. >> To unsubscribe from this group and stop receiving emails from it, send an >> email to [email protected]. >> To view this discussion on the web visit >> https://groups.google.com/d/msgid/tesseract-ocr/1fc044d1-b0ae-45d5-9041-e6fbf8ec5089n%40googlegroups.com >> <https://groups.google.com/d/msgid/tesseract-ocr/1fc044d1-b0ae-45d5-9041-e6fbf8ec5089n%40googlegroups.com?utm_medium=email&utm_source=footer> >> . >> > > > -- > > ____________________________________________________________ > भजन - कीर्तन - आरती @ http://bhajans.ramparivar.com > > -- > You received this message because you are subscribed to the Google Groups > "tesseract-ocr" group. > To unsubscribe from this group and stop receiving emails from it, send an > email to [email protected]. > To view this discussion on the web visit > https://groups.google.com/d/msgid/tesseract-ocr/CAG2NduVZ3A7CUEqw29Gxu6r1-cLHPTLFt%3D%3D0C0109D_6x6C7Kw%40mail.gmail.com > <https://groups.google.com/d/msgid/tesseract-ocr/CAG2NduVZ3A7CUEqw29Gxu6r1-cLHPTLFt%3D%3D0C0109D_6x6C7Kw%40mail.gmail.com?utm_medium=email&utm_source=footer> > . > -- You received this message because you are subscribed to the Google Groups "tesseract-ocr" group. To unsubscribe from this group and stop receiving emails from it, send an email to [email protected]. To view this discussion on the web visit https://groups.google.com/d/msgid/tesseract-ocr/CAM-%2BFN%3DZggnH4wV5vUhY9nsSqjKg9xZ5TQDoCMwSqf7H0oPogQ%40mail.gmail.com.

