Please look at tesstrain.sh It is setting max-pages to 3 for text2image invocation. You can change it there.
On Tue 13 Mar, 2018, 6:54 AM , <[email protected]> wrote: > Dear all, > > I'm trying to train lstm using a large training text, different fonts, > colors etc. I'm trying to use text2image to generate my tif / box file > combinations, however text2image appears to be limited to 3 pages and thus > truncates my training text. How should I solve this? Call text2image in a > loop on the remaining training text and generate hundreds, if not > thousands, of tif / box file combos for all of my training text, fonts etc? > > Thanks for the help! > > John. > > -- > You received this message because you are subscribed to the Google Groups > "tesseract-ocr" group. > To unsubscribe from this group and stop receiving emails from it, send an > email to [email protected]. > To post to this group, send email to [email protected]. > Visit this group at https://groups.google.com/group/tesseract-ocr. > To view this discussion on the web visit > https://groups.google.com/d/msgid/tesseract-ocr/10bc983a-83a5-4434-afca-18cc2d5d1ce4%40googlegroups.com > <https://groups.google.com/d/msgid/tesseract-ocr/10bc983a-83a5-4434-afca-18cc2d5d1ce4%40googlegroups.com?utm_medium=email&utm_source=footer> > . > For more options, visit https://groups.google.com/d/optout. > -- You received this message because you are subscribed to the Google Groups "tesseract-ocr" group. To unsubscribe from this group and stop receiving emails from it, send an email to [email protected]. To post to this group, send email to [email protected]. Visit this group at https://groups.google.com/group/tesseract-ocr. To view this discussion on the web visit https://groups.google.com/d/msgid/tesseract-ocr/CAG2NduWdb-u-bDnc7QPS5W8-dXDsFN3vma6W5cA0emg1duExbw%40mail.gmail.com. For more options, visit https://groups.google.com/d/optout.

