Hello,

I have used the uncombine option of combine_tessdata to uncombine eng,

The command was:  combine_tessdata -u eng.traineddata 
/home/carlos/tesseract.training/experimentos/sep/eng.

This separated the files into:

eng.bigram-dawg  
eng.cube-unicharset  
eng.freq-dawg  
eng.normproto    
eng.pffmtable  
eng.shapetable     
eng.unicharset
eng.config       
eng.cube-word-dawg   
eng.inttemp    
eng.number-dawg  
eng.punc-dawg  
eng.unicharambigs  
eng.word-dawg

This will be handy to extract the word list dictionaries 
using dawg2wordlist, so beautiful to look at dictionaries.

But, what I would like to do is to retrieve the original TR files used for 
this trained set of data.

So, eng.normproto is the result of executing cntraining over the tr 
sources. Is there a reverse function? The man page does not show anything 
and I've tried to search for it online before asking.

The next step, is probably searching inside the source code if such 
possibility exists in case no one here knows about it.

Thanks in advance!


-- 
-- 
You received this message because you are subscribed to the Google
Groups "tesseract-ocr" group.
To post to this group, send email to [email protected]
To unsubscribe from this group, send email to
[email protected]
For more options, visit this group at
http://groups.google.com/group/tesseract-ocr?hl=en

--- 
You received this message because you are subscribed to the Google Groups 
"tesseract-ocr" group.
To unsubscribe from this group and stop receiving emails from it, send an email 
to [email protected].
For more options, visit https://groups.google.com/groups/opt_out.


Reply via email to