Hi shree, Thanks for your reply. Is there any option to use tesstrain.sh in tesseract 4.0 to generate the traineddata and lstm files using the image and boxfiles? Or do I still have to go through the process as listed in the Tesseract 3.0 instructions? In which case, I would be able to generate the traineddata file (and the unicharset file, I think), but not the lstm file. How can I generate this lstm file? Is there a tool I can use?