Dear shree, am having a problem training the model, When I added more samples ... the result got worse, is there a best practice to add training data to train the model ?
Regards On Thu, Jul 11, 2019 at 3:15 PM fady taher <fadytahe...@gmail.com> wrote: > so ... I added "Cr⁶⁺" 66 times but am getting "Cr³+" instead ... should > I increase the training data with more samples ?? > > On Wed, Jul 10, 2019 at 4:56 PM Shree Devi Kumar <shreesh...@gmail.com> > wrote: > >> No. It just means that you have ~25 (136-111) more characters in your new >> unicharset that you are training on. >> >> *given outputs 111 not equal to unicharset of 136.* >> >> >> >> On Wed, Jul 10, 2019 at 8:01 PM fady taher <fadytahe...@gmail.com> wrote: >> >>> should I worry regarding the below error ? >>> >>> Warning: LSTMTrainer deserialized an LSTMRecognizer! >>> Continuing from ../tesstutorial/eng_layer_eng/eng.lstm >>> *Appending a new network to an old one!!Warning: given outputs 111 not >>> equal to unicharset of 136.* >>> Num outputs,weights in Series: >>> Lfx256:256, 361472 >>> Fc136:136, 34952 >>> Total weights = 396424 >>> Built network:[1,36,0,1[C3,3Ft16]Mp3,3Lfys64Lfx96Lrx96Lfx256Fc136] from >>> request [Lfx256 O1c111] >>> Training parameters: >>> Debug interval = 0, weights = 0.1, learning rate = 0.001, momentum=0.5 >>> null char=135 >>> >>> On Tue, Jul 9, 2019 at 1:41 PM fady taher <fadytahe...@gmail.com> wrote: >>> >>>> will try and feed you back, thanks alot >>>> >>>> On Tue, Jul 9, 2019 at 1:40 PM Shree Devi Kumar <shreesh...@gmail.com> >>>> wrote: >>>> >>>>> I don't think I had any (or enough) plus superscript in my >>>>> training_text. >>>>> >>>>> Treat this as an example and train as per the data you expect. >>>>> >>>>> On Tue, 9 Jul 2019, 17:01 fady taher, <fadytahe...@gmail.com> wrote: >>>>> >>>>>> Dear Shree, thanks for you quick response ... I gave a try to the >>>>>> submodule ... it gave results to *Cr⁶⁶ *while it should have been *Cr >>>>>> *⁶*⁺ *any ideas if this is solvable *?* >>>>>> >>>>>> >>>>>> Regards >>>>>> >>>>>> On Tue, Jul 9, 2019 at 1:14 PM Shree Devi Kumar <shreesh...@gmail.com> >>>>>> wrote: >>>>>> >>>>>>> If you use the submodule you will save time taken in running the >>>>>>> 8-makedata_layernew.sh script. However, if you have modified >>>>>>> training_text or want to checkout the full process, run the script. >>>>>>> >>>>>>> On Tue, Jul 9, 2019 at 4:33 PM fady taher <fadytahe...@gmail.com> >>>>>>> wrote: >>>>>>> >>>>>>>> I can see that you have mentioned >>>>>>>>> >>>>>>>> "IT IS NOT REQUIRED TO RUN THIS SCRIPT AS THE OUTPUT FOLDERS ARE >>>>>>>> PROVIDED AS A SUBMODULE IN THE REPO. Use git submodule update --init to >>>>>>>> download the files (approx 600MB)." >>>>>>>> so, should I just use the eng.traineddata found in tessdata folder ? >>>>>>>> >>>>>>>> >>>>>>>> -- >>>>>>>> You received this message because you are subscribed to the Google >>>>>>>> Groups "tesseract-ocr" group. >>>>>>>> To unsubscribe from this group and stop receiving emails from it, >>>>>>>> send an email to tesseract-ocr+unsubscr...@googlegroups.com. >>>>>>>> To post to this group, send email to tesseract-ocr@googlegroups.com >>>>>>>> . >>>>>>>> Visit this group at https://groups.google.com/group/tesseract-ocr. >>>>>>>> To view this discussion on the web visit >>>>>>>> https://groups.google.com/d/msgid/tesseract-ocr/CADhGFTzKp3_jx_yxT7YkvabM8g5WnAjXoMWXM5UL6or5W4uz3A%40mail.gmail.com >>>>>>>> <https://groups.google.com/d/msgid/tesseract-ocr/CADhGFTzKp3_jx_yxT7YkvabM8g5WnAjXoMWXM5UL6or5W4uz3A%40mail.gmail.com?utm_medium=email&utm_source=footer> >>>>>>>> . >>>>>>>> For more options, visit https://groups.google.com/d/optout. >>>>>>>> >>>>>>> >>>>>>> >>>>>>> -- >>>>>>> >>>>>>> ____________________________________________________________ >>>>>>> भजन - कीर्तन - आरती @ http://bhajans.ramparivar.com >>>>>>> >>>>>>> -- >>>>>>> You received this message because you are subscribed to the Google >>>>>>> Groups "tesseract-ocr" group. >>>>>>> To unsubscribe from this group and stop receiving emails from it, >>>>>>> send an email to tesseract-ocr+unsubscr...@googlegroups.com. >>>>>>> To post to this group, send email to tesseract-ocr@googlegroups.com. >>>>>>> Visit this group at https://groups.google.com/group/tesseract-ocr. >>>>>>> To view this discussion on the web visit >>>>>>> https://groups.google.com/d/msgid/tesseract-ocr/CAG2NduURF7oAZWeRqTWj%3DC%3D9D2kxmTnJNVV2GjUHR9jZH82iiQ%40mail.gmail.com >>>>>>> <https://groups.google.com/d/msgid/tesseract-ocr/CAG2NduURF7oAZWeRqTWj%3DC%3D9D2kxmTnJNVV2GjUHR9jZH82iiQ%40mail.gmail.com?utm_medium=email&utm_source=footer> >>>>>>> . >>>>>>> For more options, visit https://groups.google.com/d/optout. >>>>>>> >>>>>> -- >>>>>> You received this message because you are subscribed to the Google >>>>>> Groups "tesseract-ocr" group. >>>>>> To unsubscribe from this group and stop receiving emails from it, >>>>>> send an email to tesseract-ocr+unsubscr...@googlegroups.com. >>>>>> To post to this group, send email to tesseract-ocr@googlegroups.com. >>>>>> Visit this group at https://groups.google.com/group/tesseract-ocr. >>>>>> To view this discussion on the web visit >>>>>> https://groups.google.com/d/msgid/tesseract-ocr/CADhGFTwwQUp1G-PtjUr7mVy4pGM0%3Do%2BrMNyhxEvOjP-ThzGDrg%40mail.gmail.com >>>>>> <https://groups.google.com/d/msgid/tesseract-ocr/CADhGFTwwQUp1G-PtjUr7mVy4pGM0%3Do%2BrMNyhxEvOjP-ThzGDrg%40mail.gmail.com?utm_medium=email&utm_source=footer> >>>>>> . >>>>>> For more options, visit https://groups.google.com/d/optout. >>>>>> >>>>> -- >>>>> You received this message because you are subscribed to the Google >>>>> Groups "tesseract-ocr" group. >>>>> To unsubscribe from this group and stop receiving emails from it, send >>>>> an email to tesseract-ocr+unsubscr...@googlegroups.com. >>>>> To post to this group, send email to tesseract-ocr@googlegroups.com. >>>>> Visit this group at https://groups.google.com/group/tesseract-ocr. >>>>> To view this discussion on the web visit >>>>> https://groups.google.com/d/msgid/tesseract-ocr/CAG2NduUB7evdPLdvVOxvZ2q87vYVKgmbKk9H-H6SakqzXTc8jA%40mail.gmail.com >>>>> <https://groups.google.com/d/msgid/tesseract-ocr/CAG2NduUB7evdPLdvVOxvZ2q87vYVKgmbKk9H-H6SakqzXTc8jA%40mail.gmail.com?utm_medium=email&utm_source=footer> >>>>> . >>>>> For more options, visit https://groups.google.com/d/optout. >>>>> >>>> -- >>> You received this message because you are subscribed to the Google >>> Groups "tesseract-ocr" group. >>> To unsubscribe from this group and stop receiving emails from it, send >>> an email to tesseract-ocr+unsubscr...@googlegroups.com. >>> To post to this group, send email to tesseract-ocr@googlegroups.com. >>> Visit this group at https://groups.google.com/group/tesseract-ocr. >>> To view this discussion on the web visit >>> https://groups.google.com/d/msgid/tesseract-ocr/CADhGFTyW1s1LbewRGuFS8GDF70QxTdzGJGaO%2ByyA2OvCaw0d7w%40mail.gmail.com >>> <https://groups.google.com/d/msgid/tesseract-ocr/CADhGFTyW1s1LbewRGuFS8GDF70QxTdzGJGaO%2ByyA2OvCaw0d7w%40mail.gmail.com?utm_medium=email&utm_source=footer> >>> . >>> For more options, visit https://groups.google.com/d/optout. >>> >> >> >> -- >> >> ____________________________________________________________ >> भजन - कीर्तन - आरती @ http://bhajans.ramparivar.com >> >> -- >> You received this message because you are subscribed to the Google Groups >> "tesseract-ocr" group. >> To unsubscribe from this group and stop receiving emails from it, send an >> email to tesseract-ocr+unsubscr...@googlegroups.com. >> To post to this group, send email to tesseract-ocr@googlegroups.com. >> Visit this group at https://groups.google.com/group/tesseract-ocr. >> To view this discussion on the web visit >> https://groups.google.com/d/msgid/tesseract-ocr/CAG2NduV8zk0mVMTepBUD-QyUvphVX0bf0PN5Go%3D%3Dra8bHOp9YA%40mail.gmail.com >> <https://groups.google.com/d/msgid/tesseract-ocr/CAG2NduV8zk0mVMTepBUD-QyUvphVX0bf0PN5Go%3D%3Dra8bHOp9YA%40mail.gmail.com?utm_medium=email&utm_source=footer> >> . >> For more options, visit https://groups.google.com/d/optout. >> > -- You received this message because you are subscribed to the Google Groups "tesseract-ocr" group. To unsubscribe from this group and stop receiving emails from it, send an email to tesseract-ocr+unsubscr...@googlegroups.com. To post to this group, send email to tesseract-ocr@googlegroups.com. Visit this group at https://groups.google.com/group/tesseract-ocr. To view this discussion on the web visit https://groups.google.com/d/msgid/tesseract-ocr/CADhGFTygrVSKiE_rXmnYG%2BR-VoEn-rm%2BfhgFSm2y1Dh8%3DA2-cA%40mail.gmail.com. For more options, visit https://groups.google.com/d/optout.