Re: [tesseract-ocr] Error:Assert failed:in file ../lstm/lstmtrainer.h, line 110

2017-08-28 Thread ShreeDevi Kumar
Please see
https://github.com/tesseract-ocr/tesseract/wiki/TrainingTesseract-4.00#fine-tuning-for-impact

The following command extracts the .lstm file from the .traineddata file.

training/combine_tessdata -e tessdata/best/eng.traineddata \
  ~/tesstutorial/impact_from_full/eng.lstm


ShreeDevi

भजन - कीर्तन - आरती @ http://bhajans.ramparivar.com

On Mon, Aug 28, 2017 at 3:01 PM, Ava Nimaee  wrote:

> Hi shree
> I read instructions on the training wiki page but i dont have eng.lstm
> non of the syntaxs create eng.lstm. how can i create it. even i check
> langdata which i download it form git amd there is't there.
> i spend alot of time but i don't khonw how i can create it.
> can you tell me.
>
> On Monday, August 21, 2017 at 7:41:41 PM UTC+4:30, shree wrote:
>>
>> lstm file is the language model. It is saved in traineddata file.
>>
>> dawgs are a kind of compressed files, created from lists of words,
>> punctuation or numbers.
>>
>> You can use dawg2wordlist to unpack them.
>>
>> Please follow the instructions on the training wiki page.
>>
> --
> You received this message because you are subscribed to the Google Groups
> "tesseract-ocr" group.
> To unsubscribe from this group and stop receiving emails from it, send an
> email to tesseract-ocr+unsubscr...@googlegroups.com.
> To post to this group, send email to tesseract-ocr@googlegroups.com.
> Visit this group at https://groups.google.com/group/tesseract-ocr.
> To view this discussion on the web visit https://groups.google.com/d/
> msgid/tesseract-ocr/b671d71c-181d-4cac-8def-122c74a0af12%
> 40googlegroups.com
> 
> .
> For more options, visit https://groups.google.com/d/optout.
>

-- 
You received this message because you are subscribed to the Google Groups 
"tesseract-ocr" group.
To unsubscribe from this group and stop receiving emails from it, send an email 
to tesseract-ocr+unsubscr...@googlegroups.com.
To post to this group, send email to tesseract-ocr@googlegroups.com.
Visit this group at https://groups.google.com/group/tesseract-ocr.
To view this discussion on the web visit 
https://groups.google.com/d/msgid/tesseract-ocr/CAG2NduUAtupdX4spDuC092W6FPzjf2XcmLjSvTEiWadjD9_8ig%40mail.gmail.com.
For more options, visit https://groups.google.com/d/optout.


Re: [tesseract-ocr] Error:Assert failed:in file ../lstm/lstmtrainer.h, line 110

2017-08-28 Thread Ava Nimaee
Hi shree
I read instructions on the training wiki page but i dont have eng.lstm
non of the syntaxs create eng.lstm. how can i create it. even i check 
langdata which i download it form git amd there is't there.
i spend alot of time but i don't khonw how i can create it.
can you tell me.

On Monday, August 21, 2017 at 7:41:41 PM UTC+4:30, shree wrote:
>
> lstm file is the language model. It is saved in traineddata file.
>
> dawgs are a kind of compressed files, created from lists of words, 
> punctuation or numbers.
>
> You can use dawg2wordlist to unpack them.
>
> Please follow the instructions on the training wiki page.
>

-- 
You received this message because you are subscribed to the Google Groups 
"tesseract-ocr" group.
To unsubscribe from this group and stop receiving emails from it, send an email 
to tesseract-ocr+unsubscr...@googlegroups.com.
To post to this group, send email to tesseract-ocr@googlegroups.com.
Visit this group at https://groups.google.com/group/tesseract-ocr.
To view this discussion on the web visit 
https://groups.google.com/d/msgid/tesseract-ocr/b671d71c-181d-4cac-8def-122c74a0af12%40googlegroups.com.
For more options, visit https://groups.google.com/d/optout.


Re: [tesseract-ocr] Error:Assert failed:in file ../lstm/lstmtrainer.h, line 110

2017-08-21 Thread ShreeDevi Kumar
lstm file is the language model. It is saved in traineddata file.

dawgs are a kind of compressed files, created from lists of words,
punctuation or numbers.

You can use dawg2wordlist to unpack them.

Please follow the instructions on the training wiki page.

-- 
You received this message because you are subscribed to the Google Groups 
"tesseract-ocr" group.
To unsubscribe from this group and stop receiving emails from it, send an email 
to tesseract-ocr+unsubscr...@googlegroups.com.
To post to this group, send email to tesseract-ocr@googlegroups.com.
Visit this group at https://groups.google.com/group/tesseract-ocr.
To view this discussion on the web visit 
https://groups.google.com/d/msgid/tesseract-ocr/CAG2NduUHTAxNsd81rf6Pm3yL1kZpnsDqcocm2%2BRAdWuGde5hRQ%40mail.gmail.com.
For more options, visit https://groups.google.com/d/optout.


Re: [tesseract-ocr] Error:Assert failed:in file ../lstm/lstmtrainer.h, line 110

2017-08-21 Thread ShreeDevi Kumar
training/combine_tessdata -e tessdata/best/eng.traineddata \
  ~/tesstutorial/impact_from_full/eng.lstm


On 04-Aug-2017 12:03 PM,  wrote:

> Hello,
>
> I use the 'git pull' command to update the code from the link
> https://github.com/tesseract-ocr/tesseract.git, and I recompile,
> reinstall the Tess4.0.
>
> But when I execute the command (showed in below) to finetune the
> traineddata, an error appears: 
> "mgr_.Init(traineddata_path.c_str()):Error:Assert
> failed:in file ../lstm/lstmtrainer.h, line 110"
>
> lstmtraining --model_output ~/tesstutorial/chituned_from_chisim/chituned \
> --continue_from ~/tesstutorial/chituned_from_chisim/chi_sim.lstm \
> --train_listfile ~/tesstutorial/chitest/chi_sim.training_files.txt \
> --eval_listfile ~/tesstutorial/chitest/chi_sim.training_files.txt \
> --target_error_rate 0.01
>
>
>
> There is nothing wrong with the Tess before updating the code. But now, An
> assertion error crashes. Why? Can you help me?
>
> --
> You received this message because you are subscribed to the Google Groups
> "tesseract-ocr" group.
> To unsubscribe from this group and stop receiving emails from it, send an
> email to tesseract-ocr+unsubscr...@googlegroups.com.
> To post to this group, send email to tesseract-ocr@googlegroups.com.
> Visit this group at https://groups.google.com/group/tesseract-ocr.
> To view this discussion on the web visit https://groups.google.com/d/
> msgid/tesseract-ocr/75ba4766-370a-46c0-88b0-a15456aa7c9f%
> 40googlegroups.com
> 
> .
> For more options, visit https://groups.google.com/d/optout.
>

-- 
You received this message because you are subscribed to the Google Groups 
"tesseract-ocr" group.
To unsubscribe from this group and stop receiving emails from it, send an email 
to tesseract-ocr+unsubscr...@googlegroups.com.
To post to this group, send email to tesseract-ocr@googlegroups.com.
Visit this group at https://groups.google.com/group/tesseract-ocr.
To view this discussion on the web visit 
https://groups.google.com/d/msgid/tesseract-ocr/CAG2NduX50NmMX9hk5Mjm4MijuSC8Rb%2BD4M9hxTUWh8Kmrtz1qg%40mail.gmail.com.
For more options, visit https://groups.google.com/d/optout.