[tesseract-ocr] Re: Could tesseract return the index for the picture??

2019-12-23 Thread luffy monky
Hi all I find a good command that contain the info I need. pytesseract.image_to_data that will show a lot of information about the pic So now I just need try to soft the "left""top" "width" "height" by text info but with this command will show that a string

Re: [tesseract-ocr] Interrupting and restarting lstmtraining

2019-12-23 Thread Shree Devi Kumar
You can create traineddata with the --stop-training while lstmtraining continues to run. If you are using tesstrain makefile then it has a target called traineddata which will generate traineddata file for each intermediate checkpoint. You can stop and start training but I have a feeling that tra

[tesseract-ocr] validation loss during training phase of tesseract

2019-12-23 Thread Laetitia Josse
Hi! Is there any validation step during the training phase of tesseract? Is it possible to extract informations about it? Thanks in advance -- You received this message because you are subscribed to the Google Groups "tesseract-ocr" group. To unsubscribe from this group and stop receiving ema

[tesseract-ocr] Could tesseract return the index for the picture??

2019-12-23 Thread luffy monky
Hi all thanks for you. I had finish my homework~~ Now I can use the Pytesseracr.image_to_string(image) that will show all the text.. but I had an other question.. If I make sure my image had some match word like "*nothing*" and my pic by the command will show *you know nothing , John snow.*

[tesseract-ocr] Re: Potential bug report

2019-12-23 Thread Nicholas Rees
I think I'm going to submit this report at the end of the day if there are no objections here. I can't find any documentation suggesting that this shouldn't work and both variables work as intended individually, just not together. I don't have much coding experience, but I'd be more than happy

[tesseract-ocr] Re: Potential bug report

2019-12-23 Thread Nicholas Rees
> hi, > $ tesseract 205c.tif 205c --psm 6 -c > tessedit_char_whitelist=ABCDEFGHIJKLMNOPQRSTUVWXYZ0123456789=+& > as per my knowledge tessedit_char_whitelist works with tesseract 3 not > with tesseract 4. > > - show quoted text - > >> - show quoted text - >> -- >> You received this message bec

[tesseract-ocr] Interrupting and restarting lstmtraining

2019-12-23 Thread Adam Funk
Hi, I have an lstmtraining job running without --max_iterations; it's been going for a couple of weeks now (in a docker container in screen on a server that I ssh into). Can I safely use ctrl-C to stop it, use lstmtraining --stop_training (with appropriate settings for --continue_from --trainedda

Re: [tesseract-ocr] Potential bug report

2019-12-23 Thread Ashwini Nande
hi, $ tesseract 205c.tif 205c --psm 6 -c tessedit_char_whitelist=ABCDEFGHIJKLMNOPQRSTUVWXYZ0123456789=+& as per my knowledge tessedit_char_whitelist works with tesseract 3 not with tesseract 4. On Mon, Dec 23, 2019 at 3:10 PM Nicholas Rees wrote: > Below is a bug report that I'm considering maki

[tesseract-ocr] Potential bug report

2019-12-23 Thread Nicholas Rees
Below is a bug report that I'm considering making. However, I'm not entirely positive that its a bug and I'd like someone who knows more about this to check this and make sure that this is a bug so I'm not wasting anyone's time. The following is the bug report that I'll post if you guys think i