Disregard my last question. I figured out how to modify the batch size and found that it will hang indefinitely after processing the first batch of files if the specified batch size is smaller than the number of files I want to process. I set the batch size to 9999 and everything seems to be working fine now. Odd.
On Friday, January 4, 2019 at 11:27:37 AM UTC-5, [email protected] wrote: > > Hey all, > > I'm currently working on a program that explores the handwritten OCR > capabilities of Tesseract. > > I have ~1400 images with ~8 lines of handwritten textlines per image with > accompanying BOX files. Additionally, I've got a couple of handwritten > fonts that I'm using to bootstrap the training process. > > One problem I'm having is that when I invoke tesstrain.sh, it will > consitently fail at some point (mostly around Phase E) when more than 7 > box/tif pairs or fonts are provided as input. I've tried combinations where > all the inputs are font files, all inputs are handwritten tif/box pairs, > and inputs as a mix of the two. > > I had originally tried using Shree's modified boxtrain files but was > receiving an error that had to do with failing to read in a unicharset > file. So, I modified tesstrain.sh and tesstrain_utils.sh (referencing > Shree's modified scripts) myself to work with my own provided tif/box pairs. > > Is there a limit to the number of inputs to tesstrain.sh that should be > followed or should I confidently be able to give tesstrain.sh all 1400 of > my images no problem? > > Thanks, > Tim Snyder > > > -- You received this message because you are subscribed to the Google Groups "tesseract-ocr" group. To unsubscribe from this group and stop receiving emails from it, send an email to [email protected]. To post to this group, send email to [email protected]. Visit this group at https://groups.google.com/group/tesseract-ocr. To view this discussion on the web visit https://groups.google.com/d/msgid/tesseract-ocr/5cfcc9c5-1167-44b6-95ec-8d40b862c048%40googlegroups.com. For more options, visit https://groups.google.com/d/optout.

