That was very crystal clear explanation. Thank you for explaining shree. I 
got it now.
On Thursday, June 20, 2019 at 1:55:20 PM UTC+5:30, shree wrote:
>
> if [[ ${MY_BOXTIFF_DIR} != "" ]]; then
>     tlog "\n=== Copy existing box/tiff pairs from '${MY_BOXTIFF_DIR}'"
>     cp  ${MY_BOXTIFF_DIR}/*.box ${TRAINING_DIR} | true
>     cp  ${MY_BOXTIFF_DIR}/*.tif ${TRAINING_DIR} | true
>     ls -l  ${TRAINING_DIR}
> fi
>
> copies the files to training directory
>
> phase_I_generate_image 8
>
> generates box/tiff pairs from the training text and fonts specified. 
> Please note that if you had same name files copied from my_boxtiff_dir, 
> they will get overwritten,
>
> phase_UP_generate_unicharset
>
> generates unicharset from all box files in training directory (meeting the 
> file naming convention lang.xxx.exp0.box)
>
> phase_E_extract_features " --psm 6 lstm.train " 8 "lstmf"
>
> this created lstmf files from all the box/tiff pairs
>
> make__lstmdata
>
> creates the list of lstmf files
> moves all required files from tmp directory to output directory
>
>
> On Thu, Jun 20, 2019 at 10:55 AM hrishikesh kaulwar <hpka...@gmail.com 
> <javascript:>> wrote:
>
>>
>> Hey shree could you tell me what line in tesstrain.sh takes care of user 
>> provided tiff box pairs. Like what is the line which creates lstmf files 
>> from those pairs and then puts the name of lstmf files in training_list. 
>> Thanks in advance.
>> On Tuesday, June 18, 2019 at 2:54:09 PM UTC+5:30, hrishikesh kaulwar 
>> wrote:
>>>
>>> Greetings,
>>>     I just got to know that tesstrain.sh is modified to support user 
>>> provided box/tiff pairs by adding a tiff/box directory flag. I used that 
>>> version of tesseract source to use my own tiff/box pairs. But when I ran 
>>> tesstrain.sh I got to know that it just copies tiff/box pairs provided by 
>>> me to training directory but .lstmf file is generated from 
>>> eng.training_text file. My tiff/box pairs are not getting used in creating 
>>> training data. Can someone point out what mistake I am making? or some way 
>>> to only use user provided tiff/box pairs to create training data?
>>>  Thanks in advance.
>>>
>> -- 
>> You received this message because you are subscribed to the Google Groups 
>> "tesseract-ocr" group.
>> To unsubscribe from this group and stop receiving emails from it, send an 
>> email to tesser...@googlegroups.com <javascript:>.
>> To post to this group, send email to tesser...@googlegroups.com 
>> <javascript:>.
>> Visit this group at https://groups.google.com/group/tesseract-ocr.
>> To view this discussion on the web visit 
>> https://groups.google.com/d/msgid/tesseract-ocr/166bfb15-52d9-4cc1-8f28-bb20e7ff3797%40googlegroups.com
>>  
>> <https://groups.google.com/d/msgid/tesseract-ocr/166bfb15-52d9-4cc1-8f28-bb20e7ff3797%40googlegroups.com?utm_medium=email&utm_source=footer>
>> .
>> For more options, visit https://groups.google.com/d/optout.
>>
>
>
> -- 
>
> ____________________________________________________________
> भजन - कीर्तन - आरती @ http://bhajans.ramparivar.com
>

-- 
You received this message because you are subscribed to the Google Groups 
"tesseract-ocr" group.
To unsubscribe from this group and stop receiving emails from it, send an email 
to tesseract-ocr+unsubscr...@googlegroups.com.
To post to this group, send email to tesseract-ocr@googlegroups.com.
Visit this group at https://groups.google.com/group/tesseract-ocr.
To view this discussion on the web visit 
https://groups.google.com/d/msgid/tesseract-ocr/002a186a-cefc-41fc-97dc-6d7c24882abf%40googlegroups.com.
For more options, visit https://groups.google.com/d/optout.

Reply via email to