Re: [tesseract-ocr] train tesseract OCR 4.0

2018-10-22 Thread Shree Devi Kumar
Please see https://github.com/tesseract-ocr/tesseract/wiki and https://github.com/tesseract-ocr/tesseract/wiki/TrainingTesseract-4.00#fine-tuning-for-impact On Mon, 22 Oct 2018, 06:59 kislay bajpai, wrote: > Hello, > > Sorry to disturb you, actually i am very new with tesseract and getting no >

Re: [tesseract-ocr] train tesseract OCR 4.0

2018-10-22 Thread kislay bajpai
Hello, Sorry to disturb you, actually i am very new with tesseract and getting no idea, how to train it. Please help me out. I am in big trouble. version - tesseract4.0 alpha OS - ubuntu16.04 and RHEL 7.3 (any one i can use) On Tue, Oct 16, 2018 at 7:10 PM Shree Devi Kumar wrote: > Please do

Re: [tesseract-ocr] train tesseract OCR 4.0

2018-10-16 Thread kislay bajpai
Hello Shree, I am confused how to train tesseract 4.0 alpha for new font (E 13B). Please help me for it. On Thursday, March 23, 2017 at 5:24:59 PM UTC+5:30, shree wrote: > > To read characters from an image, it is not necessary to train it. Just > use an appropriate traineddata. > > Training

Re: [tesseract-ocr] train tesseract OCR 4.0

2017-04-05 Thread srnsp92
You can use *.* when identifying the files.. but you should be careful only image files are only supplied... as it can take all available files, because * means it takes input for all the files. 1)I request you can help me with posts i had posted today.. 2) And please guide how can i generate

Re: [tesseract-ocr] train tesseract OCR 4.0

2017-04-04 Thread Saurabh Srivastav
thank you shree , you always help me. but i still have one problem that i wrote a bash script which trace the all images with .jpg extension and make their output files as the name of image. but i want that when i run script it trace more images with some different extensions like .jpg , .jpeg

Re: [tesseract-ocr] train tesseract OCR 4.0

2017-04-04 Thread ShreeDevi Kumar
Tesstrain.sh generates a file called eng.training_files.txt You are using command without .text extension Check the name of generated file and use that. I have found that editing that file also gives errors. - excuse the brevity, sent from mobile On 04-Apr-2017 7:01 PM,

Re: [tesseract-ocr] train tesseract OCR 4.0

2017-04-04 Thread srnsp92
I am trying to tesseract 4,, and i am getting folowing error,, command used: mkdir -p /home/p/Documents/T/engoutput /home/p/Documents/T/tesseract-master/training/lstmtraining -U /home/p/Documents/T/img_frm_3/unicharset \ --script_dir /home/p/Documents/T/TESS_4_ALPHA/langdata-master

Re: [tesseract-ocr] train tesseract OCR 4.0

2017-04-04 Thread ShreeDevi Kumar
See https://github.com/tesseract-ocr/tesseract/blob/master/training/tesstrain.sh https://github.com/tesseract-ocr/tesseract/blob/master/training/tesstrain_utils.sh https://github.com/tesseract-ocr/tesseract/blob/master/training/language-specific.sh -- You received this message because you are

Re: [tesseract-ocr] train tesseract OCR 4.0

2017-04-04 Thread srnsp92
Hello ShreeDevi, https://medium.com/apegroup-texts/training-tesseract-for-labels-receipts-and-such-690f452e8f79 In the link, we can see a full fledged tutorial of tesseract 3.0 version, of using it and training it. Can you please clarify the below points...?

Re: [tesseract-ocr] train tesseract OCR 4.0

2017-04-03 Thread Saurabh Srivastav
shree, actually i want a bash script which run tesseract and store ouput file in a folder.. kindly help me to make this type of bash script. thank you. -- You received this message because you are subscribed to the Google Groups "tesseract-ocr" group. To unsubscribe from this

Re: [tesseract-ocr] train tesseract OCR 4.0

2017-04-03 Thread ShreeDevi Kumar
Saurabh, It depends on what you want to do with the bash script. Here is a sample of a script I used to compare results using diff tessdata files by looping thru a set of image files. Google the bash commands to figure out what they do! #!/bin/bash set -vx export

Re: [tesseract-ocr] train tesseract OCR 4.0

2017-04-03 Thread Saurabh Srivastav
hello shree ! thank you for your help. may you please help me how can i write a bash script for tesseract. -- You received this message because you are subscribed to the Google Groups "tesseract-ocr" group. To unsubscribe from this group and stop receiving emails from it, send an email to

Re: [tesseract-ocr] train tesseract OCR 4.0

2017-03-22 Thread Saurabh Srivastav
Thank you shree for your valuable reply. But now i have created box files for a particuler image and trained it..but still i am missing something, may you please help me what i have to do after creating box file for that image and make tesseract to read the characters from that image. thanks

Re: [tesseract-ocr] train tesseract OCR 4.0

2017-03-02 Thread ShreeDevi Kumar
screenshot of warning means that your image does not have resolution info. Your OCR output file should have been created. Training 4.0 is not easy. Please see https://github.com/tesseract-ocr/tesseract/wiki/4.0-with-LSTM ShreeDevi भजन