Hello shree, Thank you for your valuable reply.. Are there any changes i 
need to follow for the steps below.. I request you to suggest the changes 
for the below commands, these are for tess 3.0

tesseract ara.arial.exp4.tif ara.arial.exp4 nobatch box.train
unicharset_extractor ara.arial.exp4.box
echo "arial 0 0 1 0 0" > font_properties # tell Tesseract informations 
about the font
mftraining -F font_properties -U unicharset -O ara.unicharset ara.arial.exp4
.tr
shapeclustering -F unicharset ara.arial.exp4.tr
cntraining ara.arial.exp4.tr

mv inttemp ara.inttemp
mv normproto ara.normproto
mv pffmtable ara.pffmtable
mv shapetable ara.shapetable
combine_tessdata ara.


Please suggest changes for the above steps. I plan to publish a rigorous 
explanative tutorial after getting overview of all the steps.
Thank you.


On Wednesday, April 12, 2017 at 3:38:11 PM UTC+5:30, shree wrote:
>
> see 
> https://github.com/tesseract-ocr/tesseract/blob/master/training/tesstrain.sh
>
>
> if ((LINEDATA)); then
>   phase_E_extract_features "lstm.train" 8 "lstmf"
>   make__lstmdata
> else
>   phase_E_extract_features "box.train" 8 "tr"
>   phase_C_cluster_prototypes "${TRAINING_DIR}/${LANG_CODE}.normproto"
>   if [[ "${ENABLE_SHAPE_CLUSTERING}" == "y" ]]; then
>       phase_S_cluster_shapes
>   fi
>   phase_M_cluster_microfeatures
>   phase_B_generate_ambiguities
>   make__traineddata
> fi
>
> --------------------
>
> lstm.train is for LSTM training
>
> box.train is for 3.0 Tesseract legacy engine training
>
> Please note that current master code is for alpha testing for 4.0 LSTM and 
> will most probably drop support for legacy engine.
>
> If you want the legacy tesseract engine and train for it, please use the 
> 3.05 branch of the github repo.
>

-- 
You received this message because you are subscribed to the Google Groups 
"tesseract-ocr" group.
To unsubscribe from this group and stop receiving emails from it, send an email 
to tesseract-ocr+unsubscr...@googlegroups.com.
To post to this group, send email to tesseract-ocr@googlegroups.com.
Visit this group at https://groups.google.com/group/tesseract-ocr.
To view this discussion on the web visit 
https://groups.google.com/d/msgid/tesseract-ocr/70a9d13b-a28b-4e6f-9c78-ec1c41361d96%40googlegroups.com.
For more options, visit https://groups.google.com/d/optout.

Reply via email to