Hello, I have used the uncombine option of combine_tessdata to uncombine eng,
The command was: combine_tessdata -u eng.traineddata /home/carlos/tesseract.training/experimentos/sep/eng. This separated the files into: eng.bigram-dawg eng.cube-unicharset eng.freq-dawg eng.normproto eng.pffmtable eng.shapetable eng.unicharset eng.config eng.cube-word-dawg eng.inttemp eng.number-dawg eng.punc-dawg eng.unicharambigs eng.word-dawg This will be handy to extract the word list dictionaries using dawg2wordlist, so beautiful to look at dictionaries. But, what I would like to do is to retrieve the original TR files used for this trained set of data. So, eng.normproto is the result of executing cntraining over the tr sources. Is there a reverse function? The man page does not show anything and I've tried to search for it online before asking. The next step, is probably searching inside the source code if such possibility exists in case no one here knows about it. Thanks in advance! -- -- You received this message because you are subscribed to the Google Groups "tesseract-ocr" group. To post to this group, send email to [email protected] To unsubscribe from this group, send email to [email protected] For more options, visit this group at http://groups.google.com/group/tesseract-ocr?hl=en --- You received this message because you are subscribed to the Google Groups "tesseract-ocr" group. To unsubscribe from this group and stop receiving emails from it, send an email to [email protected]. For more options, visit https://groups.google.com/groups/opt_out.

