Re: [tesseract-ocr] Re: Doubt on "--eval_listfile"

2018-04-10 Thread ShreeDevi Kumar
Yes, and you can use different text files for training and eval.



ShreeDevi

भजन - कीर्तन - आरती @ http://bhajans.ramparivar.com

On Tue, Apr 10, 2018 at 10:01 PM, Fanatico  wrote:

> wen I asked about passing the ".training_text" as a param, I meant in the
> creation of the training data "training/tesstrain.sh"
>
> On Tuesday, 10 April 2018 13:30:05 UTC-3, Fanatico wrote:
>>
>> I just thought, but can I pass only the ".training_text" file as a param ?
>> like --training_text
>>
> --
> You received this message because you are subscribed to the Google Groups
> "tesseract-ocr" group.
> To unsubscribe from this group and stop receiving emails from it, send an
> email to tesseract-ocr+unsubscr...@googlegroups.com.
> To post to this group, send email to tesseract-ocr@googlegroups.com.
> Visit this group at https://groups.google.com/group/tesseract-ocr.
> To view this discussion on the web visit https://groups.google.com/d/
> msgid/tesseract-ocr/3b918a9a-0d49-4b28-b624-0e2e9df03f1a%
> 40googlegroups.com
> 
> .
>
> For more options, visit https://groups.google.com/d/optout.
>

-- 
You received this message because you are subscribed to the Google Groups 
"tesseract-ocr" group.
To unsubscribe from this group and stop receiving emails from it, send an email 
to tesseract-ocr+unsubscr...@googlegroups.com.
To post to this group, send email to tesseract-ocr@googlegroups.com.
Visit this group at https://groups.google.com/group/tesseract-ocr.
To view this discussion on the web visit 
https://groups.google.com/d/msgid/tesseract-ocr/CAG2NduVv8%3DVBvSJX7KXCJEazZjT%2Bfisj2efxB1mq2ApNGygz3g%40mail.gmail.com.
For more options, visit https://groups.google.com/d/optout.


[tesseract-ocr] Re: Doubt on "--eval_listfile"

2018-04-10 Thread Fanatico
wen I asked about passing the ".training_text" as a param, I meant in the 
creation of the training data "training/tesstrain.sh"

On Tuesday, 10 April 2018 13:30:05 UTC-3, Fanatico wrote:
>
> I just thought, but can I pass only the ".training_text" file as a param ?
> like --training_text
>

-- 
You received this message because you are subscribed to the Google Groups 
"tesseract-ocr" group.
To unsubscribe from this group and stop receiving emails from it, send an email 
to tesseract-ocr+unsubscr...@googlegroups.com.
To post to this group, send email to tesseract-ocr@googlegroups.com.
Visit this group at https://groups.google.com/group/tesseract-ocr.
To view this discussion on the web visit 
https://groups.google.com/d/msgid/tesseract-ocr/3b918a9a-0d49-4b28-b624-0e2e9df03f1a%40googlegroups.com.
For more options, visit https://groups.google.com/d/optout.


[tesseract-ocr] Re: Doubt on "--eval_listfile"

2018-04-10 Thread Fanatico
I see, thanks for the reply.

On Tuesday, 10 April 2018 11:45:59 UTC-3, Fanatico wrote:
>
> Platform: MAC OS X
> Tesseract: 4.0.0-beta.1-69-g10f4
>
> Wen I execute a command like:
>
> SCROLLVIEW_PATH=~/projects/tesseract/java \
>   ~/projects/tesseract/training/lstmtraining \
> --debug_interval 100 \
> --continue_from 
> ~/projects/ocr/training/kortrain/kor_from_full/kor.lstm \
> --traineddata 
> ~/projects/ocr/training/kortrain/new_train/kor/kor.traineddata \
> --append_index 5 \
> --net_spec '[Lfx256 O1c111]' \
> --model_output ~/projects/ocr/training/kortrain/kor_from_full/base \
> --train_listfile 
> ~/projects/ocr/training/kortrain/new_train/kor.training_files.txt \
> --eval_listfile 
> ~/projects/ocr/training/kortrain/eval/kor.training_files.txt \
> --target_error_rate 1 
> &>~/projects/ocr/training/kortrain/kor_from_full/basetrain.log
>
> I have "--train_listfile" that tells the location of my training files 
> for each font and I have "--eval_listfile" that I suppose is the location 
> for the training files used to test the result of the training.
>
> So my doubt is:
> 1 - Why I'm training with the fonts "A", "B" and "C" but testing with the 
> fonts "D", "E" and "F"?
> 2 - And if I need to test using the same fonts, then why do I need to pass 
> 2 times the same file?
>

-- 
You received this message because you are subscribed to the Google Groups 
"tesseract-ocr" group.
To unsubscribe from this group and stop receiving emails from it, send an email 
to tesseract-ocr+unsubscr...@googlegroups.com.
To post to this group, send email to tesseract-ocr@googlegroups.com.
Visit this group at https://groups.google.com/group/tesseract-ocr.
To view this discussion on the web visit 
https://groups.google.com/d/msgid/tesseract-ocr/c219cf38-9c05-44d9-9cf9-ab8e05b960d6%40googlegroups.com.
For more options, visit https://groups.google.com/d/optout.