You are missing langdata files Failed to load script unicharset from:/home/adarsh/tesseract/ langdata/Latin.unicharset
Failed to read data from: /home/adarsh/tesseract/langdata/radical-stroke.txt Error reading radical code table /home/adarsh/tesseract/ langdata/radical-stroke.txt Even after you fix the above, this is only first step of LSTM training process. It creates a starter traineddata and lstmf files to be used by lstmtraining. The starter traineddata cannot be used to OCR. Please read wiki pages regarding training 4.0 ShreeDevi ____________________________________________________________ भजन - कीर्तन - आरती @ http://bhajans.ramparivar.com On Thu, Feb 15, 2018 at 12:52 PM, <[email protected]> wrote: > adarsh@adarsh-X555LJ:~/tesseract$ training/tesstrain.sh --fonts_dir > /usr/share/fonts --lang eng --noextract_font_properties --langdata_dir > /home/adarsh/tesseract/langdata --training_text > /home/adarsh/tesseract/langdata/eng/eng.training_text > --linedata_only --tessdata_dir /home/tessdata/tessdata --output_dir > ~/tesstutorial/engtrain --overwrite > > === Starting training for language 'eng' > [Thu Feb 15 11:56:06 IST 2018] /usr/local/bin/text2image > --fonts_dir=/usr/share/fonts --font=Arial Bold > --outputbase=/tmp/font_tmp.zQ3JffkHYN/sample_text.txt > --text=/tmp/font_tmp.zQ3JffkHYN/sample_text.txt > --fontconfig_tmpdir=/tmp/font_tmp.zQ3JffkHYN > Rendered page 0 to file /tmp/font_tmp.zQ3JffkHYN/sample_text.txt.tif > > === Phase I: Generating training images === > Rendering using Arial Bold > Rendering using Arial Italic > Rendering using Arial > Rendering using Courier New Bold Italic > Rendering using Courier New > Rendering using Courier New Italic > Rendering using Courier New Bold > Rendering using Arial Bold Italic > [Thu Feb 15 11:56:27 IST 2018] /usr/local/bin/text2image > --fontconfig_tmpdir=/tmp/font_tmp.zQ3JffkHYN --fonts_dir=/usr/share/fonts > --strip_unrenderable_words --leading=32 --char_spacing=0.0 --exposure=0 > --outputbase=/tmp/tmp.kisZVM4Xbo/eng/eng.Courier_New_Bold_Italic.exp0 > --max_pages=3 --font=Courier New Bold Italic --text=/home/adarsh/tesseract/ > langdata/eng/eng.training_text > [Thu Feb 15 11:56:27 IST 2018] /usr/local/bin/text2image > --fontconfig_tmpdir=/tmp/font_tmp.zQ3JffkHYN --fonts_dir=/usr/share/fonts > --strip_unrenderable_words --leading=32 --char_spacing=0.0 --exposure=0 > --outputbase=/tmp/tmp.kisZVM4Xbo/eng/eng.Arial.exp0 --max_pages=3 > --font=Arial --text=/home/adarsh/tesseract/langdata/eng/eng.training_text > [Thu Feb 15 11:56:27 IST 2018] /usr/local/bin/text2image > --fontconfig_tmpdir=/tmp/font_tmp.zQ3JffkHYN --fonts_dir=/usr/share/fonts > --strip_unrenderable_words --leading=32 --char_spacing=0.0 --exposure=0 > --outputbase=/tmp/tmp.kisZVM4Xbo/eng/eng.Arial_Italic.exp0 --max_pages=3 > --font=Arial Italic --text=/home/adarsh/tesseract/ > langdata/eng/eng.training_text > [Thu Feb 15 11:56:27 IST 2018] /usr/local/bin/text2image > --fontconfig_tmpdir=/tmp/font_tmp.zQ3JffkHYN --fonts_dir=/usr/share/fonts > --strip_unrenderable_words --leading=32 --char_spacing=0.0 --exposure=0 > --outputbase=/tmp/tmp.kisZVM4Xbo/eng/eng.Arial_Bold.exp0 --max_pages=3 > --font=Arial Bold --text=/home/adarsh/tesseract/ > langdata/eng/eng.training_text > [Thu Feb 15 11:56:27 IST 2018] /usr/local/bin/text2image > --fontconfig_tmpdir=/tmp/font_tmp.zQ3JffkHYN --fonts_dir=/usr/share/fonts > --strip_unrenderable_words --leading=32 --char_spacing=0.0 --exposure=0 > --outputbase=/tmp/tmp.kisZVM4Xbo/eng/eng.Courier_New.exp0 --max_pages=3 > --font=Courier New --text=/home/adarsh/tesseract/ > langdata/eng/eng.training_text > [Thu Feb 15 11:56:27 IST 2018] /usr/local/bin/text2image > --fontconfig_tmpdir=/tmp/font_tmp.zQ3JffkHYN --fonts_dir=/usr/share/fonts > --strip_unrenderable_words --leading=32 --char_spacing=0.0 --exposure=0 > --outputbase=/tmp/tmp.kisZVM4Xbo/eng/eng.Courier_New_Bold.exp0 > --max_pages=3 --font=Courier New Bold --text=/home/adarsh/tesseract/ > langdata/eng/eng.training_text > [Thu Feb 15 11:56:27 IST 2018] /usr/local/bin/text2image > --fontconfig_tmpdir=/tmp/font_tmp.zQ3JffkHYN --fonts_dir=/usr/share/fonts > --strip_unrenderable_words --leading=32 --char_spacing=0.0 --exposure=0 > --outputbase=/tmp/tmp.kisZVM4Xbo/eng/eng.Arial_Bold_Italic.exp0 > --max_pages=3 --font=Arial Bold Italic --text=/home/adarsh/tesseract/ > langdata/eng/eng.training_text > [Thu Feb 15 11:56:27 IST 2018] /usr/local/bin/text2image > --fontconfig_tmpdir=/tmp/font_tmp.zQ3JffkHYN --fonts_dir=/usr/share/fonts > --strip_unrenderable_words --leading=32 --char_spacing=0.0 --exposure=0 > --outputbase=/tmp/tmp.kisZVM4Xbo/eng/eng.Courier_New_Italic.exp0 > --max_pages=3 --font=Courier New Italic --text=/home/adarsh/tesseract/ > langdata/eng/eng.training_text > Rendered page 0 to file /tmp/tmp.kisZVM4Xbo/eng/eng.Arial_Italic.exp0.tif > Rendered page 0 to file /tmp/tmp.kisZVM4Xbo/eng/eng. > Courier_New_Bold_Italic.exp0.tif > Rendered page 0 to file /tmp/tmp.kisZVM4Xbo/eng/eng.Arial_Bold.exp0.tif > Rendered page 0 to file /tmp/tmp.kisZVM4Xbo/eng/eng.Arial.exp0.tif > Rendered page 0 to file /tmp/tmp.kisZVM4Xbo/eng/eng. > Arial_Bold_Italic.exp0.tif > Rendered page 0 to file /tmp/tmp.kisZVM4Xbo/eng/eng. > Courier_New_Italic.exp0.tif > Rendered page 0 to file /tmp/tmp.kisZVM4Xbo/eng/eng.Courier_New.exp0.tif > Rendered page 0 to file /tmp/tmp.kisZVM4Xbo/eng/eng. > Courier_New_Bold.exp0.tif > Rendered page 1 to file /tmp/tmp.kisZVM4Xbo/eng/eng.Arial_Italic.exp0.tif > Rendered page 1 to file /tmp/tmp.kisZVM4Xbo/eng/eng.Arial_Bold.exp0.tif > Rendered page 1 to file /tmp/tmp.kisZVM4Xbo/eng/eng. > Courier_New_Bold_Italic.exp0.tif > Rendered page 1 to file /tmp/tmp.kisZVM4Xbo/eng/eng. > Arial_Bold_Italic.exp0.tif > Rendered page 1 to file /tmp/tmp.kisZVM4Xbo/eng/eng.Arial.exp0.tif > Rendered page 1 to file /tmp/tmp.kisZVM4Xbo/eng/eng. > Courier_New_Italic.exp0.tif > Rendered page 1 to file /tmp/tmp.kisZVM4Xbo/eng/eng.Courier_New.exp0.tif > Rendered page 1 to file /tmp/tmp.kisZVM4Xbo/eng/eng. > Courier_New_Bold.exp0.tif > Rendering using Times New Roman, Bold Italic > Rendering using Times New Roman, Italic > Rendering using Times New Roman, > Rendering using Times New Roman, Bold > Rendering using Georgia Bold > Rendering using Georgia Bold Italic > Rendering using Georgia Italic > Rendering using Georgia > [Thu Feb 15 11:56:35 IST 2018] /usr/local/bin/text2image > --fontconfig_tmpdir=/tmp/font_tmp.zQ3JffkHYN --fonts_dir=/usr/share/fonts > --strip_unrenderable_words --leading=32 --char_spacing=0.0 --exposure=0 > --outputbase=/tmp/tmp.kisZVM4Xbo/eng/eng.Times_New_Roman_Bold_Italic.exp0 > --max_pages=3 --font=Times New Roman, Bold Italic > --text=/home/adarsh/tesseract/langdata/eng/eng.training_text > [Thu Feb 15 11:56:35 IST 2018] /usr/local/bin/text2image > --fontconfig_tmpdir=/tmp/font_tmp.zQ3JffkHYN --fonts_dir=/usr/share/fonts > --strip_unrenderable_words --leading=32 --char_spacing=0.0 --exposure=0 > --outputbase=/tmp/tmp.kisZVM4Xbo/eng/eng.Times_New_Roman.exp0 > --max_pages=3 --font=Times New Roman, --text=/home/adarsh/tesseract/ > langdata/eng/eng.training_text > [Thu Feb 15 11:56:35 IST 2018] /usr/local/bin/text2image > --fontconfig_tmpdir=/tmp/font_tmp.zQ3JffkHYN --fonts_dir=/usr/share/fonts > --strip_unrenderable_words --leading=32 --char_spacing=0.0 --exposure=0 > --outputbase=/tmp/tmp.kisZVM4Xbo/eng/eng.Times_New_Roman_Italic.exp0 > --max_pages=3 --font=Times New Roman, Italic --text=/home/adarsh/tesseract/ > langdata/eng/eng.training_text > [Thu Feb 15 11:56:35 IST 2018] /usr/local/bin/text2image > --fontconfig_tmpdir=/tmp/font_tmp.zQ3JffkHYN --fonts_dir=/usr/share/fonts > --strip_unrenderable_words --leading=32 --char_spacing=0.0 --exposure=0 > --outputbase=/tmp/tmp.kisZVM4Xbo/eng/eng.Georgia_Bold.exp0 --max_pages=3 > --font=Georgia Bold --text=/home/adarsh/tesseract/ > langdata/eng/eng.training_text > [Thu Feb 15 11:56:35 IST 2018] /usr/local/bin/text2image > --fontconfig_tmpdir=/tmp/font_tmp.zQ3JffkHYN --fonts_dir=/usr/share/fonts > --strip_unrenderable_words --leading=32 --char_spacing=0.0 --exposure=0 > --outputbase=/tmp/tmp.kisZVM4Xbo/eng/eng.Georgia_Bold_Italic.exp0 > --max_pages=3 --font=Georgia Bold Italic --text=/home/adarsh/tesseract/ > langdata/eng/eng.training_text > [Thu Feb 15 11:56:35 IST 2018] /usr/local/bin/text2image > --fontconfig_tmpdir=/tmp/font_tmp.zQ3JffkHYN --fonts_dir=/usr/share/fonts > --strip_unrenderable_words --leading=32 --char_spacing=0.0 --exposure=0 > --outputbase=/tmp/tmp.kisZVM4Xbo/eng/eng.Georgia_Italic.exp0 > --max_pages=3 --font=Georgia Italic --text=/home/adarsh/tesseract/ > langdata/eng/eng.training_text > [Thu Feb 15 11:56:35 IST 2018] /usr/local/bin/text2image > --fontconfig_tmpdir=/tmp/font_tmp.zQ3JffkHYN --fonts_dir=/usr/share/fonts > --strip_unrenderable_words --leading=32 --char_spacing=0.0 --exposure=0 > --outputbase=/tmp/tmp.kisZVM4Xbo/eng/eng.Georgia.exp0 --max_pages=3 > --font=Georgia --text=/home/adarsh/tesseract/ > langdata/eng/eng.training_text > [Thu Feb 15 11:56:35 IST 2018] /usr/local/bin/text2image > --fontconfig_tmpdir=/tmp/font_tmp.zQ3JffkHYN --fonts_dir=/usr/share/fonts > --strip_unrenderable_words --leading=32 --char_spacing=0.0 --exposure=0 > --outputbase=/tmp/tmp.kisZVM4Xbo/eng/eng.Times_New_Roman_Bold.exp0 > --max_pages=3 --font=Times New Roman, Bold --text=/home/adarsh/tesseract/ > langdata/eng/eng.training_text > Rendered page 0 to file /tmp/tmp.kisZVM4Xbo/eng/eng. > Georgia_Bold_Italic.exp0.tif > Rendered page 0 to file /tmp/tmp.kisZVM4Xbo/eng/eng. > Times_New_Roman.exp0.tif > Rendered page 0 to file /tmp/tmp.kisZVM4Xbo/eng/eng.Georgia.exp0.tif > Rendered page 0 to file /tmp/tmp.kisZVM4Xbo/eng/eng. > Times_New_Roman_Bold_Italic.exp0.tif > Rendered page 0 to file /tmp/tmp.kisZVM4Xbo/eng/eng. > Times_New_Roman_Bold.exp0.tif > Rendered page 0 to file /tmp/tmp.kisZVM4Xbo/eng/eng. > Times_New_Roman_Italic.exp0.tif > Rendered page 0 to file /tmp/tmp.kisZVM4Xbo/eng/eng. > Georgia_Italic.exp0.tif > Rendered page 0 to file /tmp/tmp.kisZVM4Xbo/eng/eng.Georgia_Bold.exp0.tif > Rendered page 1 to file /tmp/tmp.kisZVM4Xbo/eng/eng. > Georgia_Bold_Italic.exp0.tif > Rendered page 1 to file /tmp/tmp.kisZVM4Xbo/eng/eng. > Times_New_Roman.exp0.tif > Rendered page 1 to file /tmp/tmp.kisZVM4Xbo/eng/eng.Georgia.exp0.tif > Rendered page 1 to file /tmp/tmp.kisZVM4Xbo/eng/eng. > Times_New_Roman_Italic.exp0.tif > Rendered page 1 to file /tmp/tmp.kisZVM4Xbo/eng/eng. > Times_New_Roman_Bold.exp0.tif > Rendered page 1 to file /tmp/tmp.kisZVM4Xbo/eng/eng. > Times_New_Roman_Bold_Italic.exp0.tif > Rendered page 1 to file /tmp/tmp.kisZVM4Xbo/eng/eng. > Georgia_Italic.exp0.tif > Rendered page 1 to file /tmp/tmp.kisZVM4Xbo/eng/eng.Georgia_Bold.exp0.tif > Rendering using Trebuchet MS Bold > Rendering using Trebuchet MS Bold Italic > Rendering using Verdana Bold > Rendering using Verdana Bold Italic > Rendering using Trebuchet MS > Rendering using Trebuchet MS Italic > Rendering using Verdana > Rendering using Verdana Italic > [Thu Feb 15 11:56:43 IST 2018] /usr/local/bin/text2image > --fontconfig_tmpdir=/tmp/font_tmp.zQ3JffkHYN --fonts_dir=/usr/share/fonts > --strip_unrenderable_words --leading=32 --char_spacing=0.0 --exposure=0 > --outputbase=/tmp/tmp.kisZVM4Xbo/eng/eng.Trebuchet_MS_Bold.exp0 > --max_pages=3 --font=Trebuchet MS Bold --text=/home/adarsh/tesseract/ > langdata/eng/eng.training_text > [Thu Feb 15 11:56:43 IST 2018] /usr/local/bin/text2image > --fontconfig_tmpdir=/tmp/font_tmp.zQ3JffkHYN --fonts_dir=/usr/share/fonts > --strip_unrenderable_words --leading=32 --char_spacing=0.0 --exposure=0 > --outputbase=/tmp/tmp.kisZVM4Xbo/eng/eng.Trebuchet_MS_Italic.exp0 > --max_pages=3 --font=Trebuchet MS Italic --text=/home/adarsh/tesseract/ > langdata/eng/eng.training_text > [Thu Feb 15 11:56:43 IST 2018] /usr/local/bin/text2image > --fontconfig_tmpdir=/tmp/font_tmp.zQ3JffkHYN --fonts_dir=/usr/share/fonts > --strip_unrenderable_words --leading=32 --char_spacing=0.0 --exposure=0 > --outputbase=/tmp/tmp.kisZVM4Xbo/eng/eng.Trebuchet_MS_Bold_Italic.exp0 > --max_pages=3 --font=Trebuchet MS Bold Italic --text=/home/adarsh/tesseract/ > langdata/eng/eng.training_text > [Thu Feb 15 11:56:43 IST 2018] /usr/local/bin/text2image > --fontconfig_tmpdir=/tmp/font_tmp.zQ3JffkHYN --fonts_dir=/usr/share/fonts > --strip_unrenderable_words --leading=32 --char_spacing=0.0 --exposure=0 > --outputbase=/tmp/tmp.kisZVM4Xbo/eng/eng.Trebuchet_MS.exp0 --max_pages=3 > --font=Trebuchet MS --text=/home/adarsh/tesseract/ > langdata/eng/eng.training_text > [Thu Feb 15 11:56:43 IST 2018] /usr/local/bin/text2image > --fontconfig_tmpdir=/tmp/font_tmp.zQ3JffkHYN --fonts_dir=/usr/share/fonts > --strip_unrenderable_words --leading=32 --char_spacing=0.0 --exposure=0 > --outputbase=/tmp/tmp.kisZVM4Xbo/eng/eng.Verdana_Bold_Italic.exp0 > --max_pages=3 --font=Verdana Bold Italic --text=/home/adarsh/tesseract/ > langdata/eng/eng.training_text > [Thu Feb 15 11:56:43 IST 2018] /usr/local/bin/text2image > --fontconfig_tmpdir=/tmp/font_tmp.zQ3JffkHYN --fonts_dir=/usr/share/fonts > --strip_unrenderable_words --leading=32 --char_spacing=0.0 --exposure=0 > --outputbase=/tmp/tmp.kisZVM4Xbo/eng/eng.Verdana_Bold.exp0 --max_pages=3 > --font=Verdana Bold --text=/home/adarsh/tesseract/ > langdata/eng/eng.training_text > [Thu Feb 15 11:56:43 IST 2018] /usr/local/bin/text2image > --fontconfig_tmpdir=/tmp/font_tmp.zQ3JffkHYN --fonts_dir=/usr/share/fonts > --strip_unrenderable_words --leading=32 --char_spacing=0.0 --exposure=0 > --outputbase=/tmp/tmp.kisZVM4Xbo/eng/eng.Verdana.exp0 --max_pages=3 > --font=Verdana --text=/home/adarsh/tesseract/ > langdata/eng/eng.training_text > [Thu Feb 15 11:56:43 IST 2018] /usr/local/bin/text2image > --fontconfig_tmpdir=/tmp/font_tmp.zQ3JffkHYN --fonts_dir=/usr/share/fonts > --strip_unrenderable_words --leading=32 --char_spacing=0.0 --exposure=0 > --outputbase=/tmp/tmp.kisZVM4Xbo/eng/eng.Verdana_Italic.exp0 > --max_pages=3 --font=Verdana Italic --text=/home/adarsh/tesseract/ > langdata/eng/eng.training_text > Rendered page 0 to file /tmp/tmp.kisZVM4Xbo/eng/eng.Trebuchet_MS.exp0.tif > Rendered page 0 to file /tmp/tmp.kisZVM4Xbo/eng/eng. > Trebuchet_MS_Italic.exp0.tif > Rendered page 0 to file /tmp/tmp.kisZVM4Xbo/eng/eng. > Trebuchet_MS_Bold_Italic.exp0.tif > Rendered page 0 to file /tmp/tmp.kisZVM4Xbo/eng/eng.Verdana.exp0.tif > Rendered page 0 to file /tmp/tmp.kisZVM4Xbo/eng/eng. > Trebuchet_MS_Bold.exp0.tif > Rendered page 0 to file /tmp/tmp.kisZVM4Xbo/eng/eng. > Verdana_Bold_Italic.exp0.tif > Rendered page 0 to file /tmp/tmp.kisZVM4Xbo/eng/eng. > Verdana_Italic.exp0.tif > Rendered page 0 to file /tmp/tmp.kisZVM4Xbo/eng/eng.Verdana_Bold.exp0.tif > Rendered page 1 to file /tmp/tmp.kisZVM4Xbo/eng/eng.Trebuchet_MS.exp0.tif > Rendered page 1 to file /tmp/tmp.kisZVM4Xbo/eng/eng. > Trebuchet_MS_Bold_Italic.exp0.tif > Rendered page 1 to file /tmp/tmp.kisZVM4Xbo/eng/eng.Verdana.exp0.tif > Rendered page 1 to file /tmp/tmp.kisZVM4Xbo/eng/eng. > Trebuchet_MS_Italic.exp0.tif > Rendered page 1 to file /tmp/tmp.kisZVM4Xbo/eng/eng. > Trebuchet_MS_Bold.exp0.tif > Rendered page 1 to file /tmp/tmp.kisZVM4Xbo/eng/eng. > Verdana_Bold_Italic.exp0.tif > Rendered page 1 to file /tmp/tmp.kisZVM4Xbo/eng/eng.Verdana_Bold.exp0.tif > Rendered page 1 to file /tmp/tmp.kisZVM4Xbo/eng/eng. > Verdana_Italic.exp0.tif > Rendering using URW Bookman L Italic > Rendering using Century Schoolbook L Italic > Rendering using URW Bookman L Bold Italic > Rendering using Century Schoolbook L Bold Italic > Rendering using URW Bookman L Bold > Rendering using Century Schoolbook L Bold > Rendering using Century Schoolbook L Medium > Rendering using DejaVu Sans Ultra-Light > [Thu Feb 15 11:56:51 IST 2018] /usr/local/bin/text2image > --fontconfig_tmpdir=/tmp/font_tmp.zQ3JffkHYN --fonts_dir=/usr/share/fonts > --strip_unrenderable_words --leading=32 --char_spacing=0.0 --exposure=0 > --outputbase=/tmp/tmp.kisZVM4Xbo/eng/eng.Century_Schoolbook_L_Italic.exp0 > --max_pages=3 --font=Century Schoolbook L Italic > --text=/home/adarsh/tesseract/langdata/eng/eng.training_text > [Thu Feb 15 11:56:51 IST 2018] /usr/local/bin/text2image > --fontconfig_tmpdir=/tmp/font_tmp.zQ3JffkHYN --fonts_dir=/usr/share/fonts > --strip_unrenderable_words --leading=32 --char_spacing=0.0 --exposure=0 > --outputbase=/tmp/tmp.kisZVM4Xbo/eng/eng.URW_Bookman_L_Bold_Italic.exp0 > --max_pages=3 --font=URW Bookman L Bold Italic > --text=/home/adarsh/tesseract/langdata/eng/eng.training_text > [Thu Feb 15 11:56:51 IST 2018] /usr/local/bin/text2image > --fontconfig_tmpdir=/tmp/font_tmp.zQ3JffkHYN --fonts_dir=/usr/share/fonts > --strip_unrenderable_words --leading=32 --char_spacing=0.0 --exposure=0 > --outputbase=/tmp/tmp.kisZVM4Xbo/eng/eng.Century_Schoolbook_L_Bold.exp0 > --max_pages=3 --font=Century Schoolbook L Bold > --text=/home/adarsh/tesseract/langdata/eng/eng.training_text > [Thu Feb 15 11:56:51 IST 2018] /usr/local/bin/text2image > --fontconfig_tmpdir=/tmp/font_tmp.zQ3JffkHYN --fonts_dir=/usr/share/fonts > --strip_unrenderable_words --leading=32 --char_spacing=0.0 --exposure=0 > --outputbase=/tmp/tmp.kisZVM4Xbo/eng/eng.URW_Bookman_L_Italic.exp0 > --max_pages=3 --font=URW Bookman L Italic --text=/home/adarsh/tesseract/ > langdata/eng/eng.training_text > [Thu Feb 15 11:56:51 IST 2018] /usr/local/bin/text2image > --fontconfig_tmpdir=/tmp/font_tmp.zQ3JffkHYN --fonts_dir=/usr/share/fonts > --strip_unrenderable_words --leading=32 --char_spacing=0.0 --exposure=0 > --outputbase=/tmp/tmp.kisZVM4Xbo/eng/eng.URW_Bookman_L_Bold.exp0 > --max_pages=3 --font=URW Bookman L Bold --text=/home/adarsh/tesseract/ > langdata/eng/eng.training_text > [Thu Feb 15 11:56:51 IST 2018] /usr/local/bin/text2image > --fontconfig_tmpdir=/tmp/font_tmp.zQ3JffkHYN --fonts_dir=/usr/share/fonts > --strip_unrenderable_words --leading=32 --char_spacing=0.0 --exposure=0 > --outputbase=/tmp/tmp.kisZVM4Xbo/eng/eng.Century_Schoolbook_L_Medium.exp0 > --max_pages=3 --font=Century Schoolbook L Medium > --text=/home/adarsh/tesseract/langdata/eng/eng.training_text > [Thu Feb 15 11:56:51 IST 2018] /usr/local/bin/text2image > --fontconfig_tmpdir=/tmp/font_tmp.zQ3JffkHYN --fonts_dir=/usr/share/fonts > --strip_unrenderable_words --leading=32 --char_spacing=0.0 --exposure=0 > --outputbase=/tmp/tmp.kisZVM4Xbo/eng/eng.Century_Schoolbook_L_Bold_Italic.exp0 > --max_pages=3 --font=Century Schoolbook L Bold Italic > --text=/home/adarsh/tesseract/langdata/eng/eng.training_text > [Thu Feb 15 11:56:51 IST 2018] /usr/local/bin/text2image > --fontconfig_tmpdir=/tmp/font_tmp.zQ3JffkHYN --fonts_dir=/usr/share/fonts > --strip_unrenderable_words --leading=32 --char_spacing=0.0 --exposure=0 > --outputbase=/tmp/tmp.kisZVM4Xbo/eng/eng.DejaVu_Sans_Ultra-Light.exp0 > --max_pages=3 --font=DejaVu Sans Ultra-Light --text=/home/adarsh/tesseract/ > langdata/eng/eng.training_text > Rendered page 0 to file /tmp/tmp.kisZVM4Xbo/eng/eng. > URW_Bookman_L_Bold_Italic.exp0.tif > Rendered page 0 to file /tmp/tmp.kisZVM4Xbo/eng/eng. > Century_Schoolbook_L_Bold.exp0.tif > Rendered page 0 to file /tmp/tmp.kisZVM4Xbo/eng/eng. > Century_Schoolbook_L_Medium.exp0.tif > Rendered page 0 to file /tmp/tmp.kisZVM4Xbo/eng/eng. > DejaVu_Sans_Ultra-Light.exp0.tif > Rendered page 0 to file /tmp/tmp.kisZVM4Xbo/eng/eng. > URW_Bookman_L_Italic.exp0.tif > Rendered page 0 to file /tmp/tmp.kisZVM4Xbo/eng/eng. > Century_Schoolbook_L_Bold_Italic.exp0.tif > Rendered page 0 to file /tmp/tmp.kisZVM4Xbo/eng/eng. > URW_Bookman_L_Bold.exp0.tif > Rendered page 0 to file /tmp/tmp.kisZVM4Xbo/eng/eng. > Century_Schoolbook_L_Italic.exp0.tif > Rendered page 1 to file /tmp/tmp.kisZVM4Xbo/eng/eng. > URW_Bookman_L_Bold_Italic.exp0.tif > Rendered page 1 to file /tmp/tmp.kisZVM4Xbo/eng/eng. > DejaVu_Sans_Ultra-Light.exp0.tif > Rendered page 1 to file /tmp/tmp.kisZVM4Xbo/eng/eng. > Century_Schoolbook_L_Bold.exp0.tif > Rendered page 1 to file /tmp/tmp.kisZVM4Xbo/eng/eng. > Century_Schoolbook_L_Medium.exp0.tif > Rendered page 1 to file /tmp/tmp.kisZVM4Xbo/eng/eng. > URW_Bookman_L_Italic.exp0.tif > Rendered page 1 to file /tmp/tmp.kisZVM4Xbo/eng/eng. > Century_Schoolbook_L_Italic.exp0.tif > Rendered page 1 to file /tmp/tmp.kisZVM4Xbo/eng/eng. > Century_Schoolbook_L_Bold_Italic.exp0.tif > Rendered page 1 to file /tmp/tmp.kisZVM4Xbo/eng/eng. > URW_Bookman_L_Bold.exp0.tif > > === Phase UP: Generating unicharset and unichar properties files === > [Thu Feb 15 11:57:00 IST 2018] /usr/local/bin/unicharset_extractor > --output_unicharset /tmp/tmp.kisZVM4Xbo/eng/eng.unicharset --norm_mode 1 > /tmp/tmp.kisZVM4Xbo/eng/eng.Arial_Bold.exp0.box > /tmp/tmp.kisZVM4Xbo/eng/eng.Arial_Bold_Italic.exp0.box > /tmp/tmp.kisZVM4Xbo/eng/eng.Arial.exp0.box > /tmp/tmp.kisZVM4Xbo/eng/eng.Arial_Italic.exp0.box > /tmp/tmp.kisZVM4Xbo/eng/eng.Century_Schoolbook_L_Bold.exp0.box > /tmp/tmp.kisZVM4Xbo/eng/eng.Century_Schoolbook_L_Bold_Italic.exp0.box > /tmp/tmp.kisZVM4Xbo/eng/eng.Century_Schoolbook_L_Italic.exp0.box > /tmp/tmp.kisZVM4Xbo/eng/eng.Century_Schoolbook_L_Medium.exp0.box > /tmp/tmp.kisZVM4Xbo/eng/eng.Courier_New_Bold.exp0.box > /tmp/tmp.kisZVM4Xbo/eng/eng.Courier_New_Bold_Italic.exp0.box > /tmp/tmp.kisZVM4Xbo/eng/eng.Courier_New.exp0.box > /tmp/tmp.kisZVM4Xbo/eng/eng.Courier_New_Italic.exp0.box > /tmp/tmp.kisZVM4Xbo/eng/eng.DejaVu_Sans_Ultra-Light.exp0.box > /tmp/tmp.kisZVM4Xbo/eng/eng.Georgia_Bold.exp0.box > /tmp/tmp.kisZVM4Xbo/eng/eng.Georgia_Bold_Italic.exp0.box > /tmp/tmp.kisZVM4Xbo/eng/eng.Georgia.exp0.box > /tmp/tmp.kisZVM4Xbo/eng/eng.Georgia_Italic.exp0.box > /tmp/tmp.kisZVM4Xbo/eng/eng.Times_New_Roman_Bold.exp0.box > /tmp/tmp.kisZVM4Xbo/eng/eng.Times_New_Roman_Bold_Italic.exp0.box > /tmp/tmp.kisZVM4Xbo/eng/eng.Times_New_Roman.exp0.box > /tmp/tmp.kisZVM4Xbo/eng/eng.Times_New_Roman_Italic.exp0.box > /tmp/tmp.kisZVM4Xbo/eng/eng.Trebuchet_MS_Bold.exp0.box > /tmp/tmp.kisZVM4Xbo/eng/eng.Trebuchet_MS_Bold_Italic.exp0.box > /tmp/tmp.kisZVM4Xbo/eng/eng.Trebuchet_MS.exp0.box > /tmp/tmp.kisZVM4Xbo/eng/eng.Trebuchet_MS_Italic.exp0.box > /tmp/tmp.kisZVM4Xbo/eng/eng.URW_Bookman_L_Bold.exp0.box > /tmp/tmp.kisZVM4Xbo/eng/eng.URW_Bookman_L_Bold_Italic.exp0.box > /tmp/tmp.kisZVM4Xbo/eng/eng.URW_Bookman_L_Italic.exp0.box > /tmp/tmp.kisZVM4Xbo/eng/eng.Verdana_Bold.exp0.box > /tmp/tmp.kisZVM4Xbo/eng/eng.Verdana_Bold_Italic.exp0.box > /tmp/tmp.kisZVM4Xbo/eng/eng.Verdana.exp0.box /tmp/tmp.kisZVM4Xbo/eng/eng. > Verdana_Italic.exp0.box > Extracting unicharset from box file /tmp/tmp.kisZVM4Xbo/eng/eng. > Arial_Bold.exp0.box > Extracting unicharset from box file /tmp/tmp.kisZVM4Xbo/eng/eng. > Arial_Bold_Italic.exp0.box > Extracting unicharset from box file /tmp/tmp.kisZVM4Xbo/eng/eng. > Arial.exp0.box > Extracting unicharset from box file /tmp/tmp.kisZVM4Xbo/eng/eng. > Arial_Italic.exp0.box > Extracting unicharset from box file /tmp/tmp.kisZVM4Xbo/eng/eng. > Century_Schoolbook_L_Bold.exp0.box > Extracting unicharset from box file /tmp/tmp.kisZVM4Xbo/eng/eng. > Century_Schoolbook_L_Bold_Italic.exp0.box > Extracting unicharset from box file /tmp/tmp.kisZVM4Xbo/eng/eng. > Century_Schoolbook_L_Italic.exp0.box > Extracting unicharset from box file /tmp/tmp.kisZVM4Xbo/eng/eng. > Century_Schoolbook_L_Medium.exp0.box > Extracting unicharset from box file /tmp/tmp.kisZVM4Xbo/eng/eng. > Courier_New_Bold.exp0.box > Extracting unicharset from box file /tmp/tmp.kisZVM4Xbo/eng/eng. > Courier_New_Bold_Italic.exp0.box > Extracting unicharset from box file /tmp/tmp.kisZVM4Xbo/eng/eng. > Courier_New.exp0.box > Extracting unicharset from box file /tmp/tmp.kisZVM4Xbo/eng/eng. > Courier_New_Italic.exp0.box > Extracting unicharset from box file /tmp/tmp.kisZVM4Xbo/eng/eng. > DejaVu_Sans_Ultra-Light.exp0.box > Extracting unicharset from box file /tmp/tmp.kisZVM4Xbo/eng/eng. > Georgia_Bold.exp0.box > Extracting unicharset from box file /tmp/tmp.kisZVM4Xbo/eng/eng. > Georgia_Bold_Italic.exp0.box > Extracting unicharset from box file /tmp/tmp.kisZVM4Xbo/eng/eng. > Georgia.exp0.box > Extracting unicharset from box file /tmp/tmp.kisZVM4Xbo/eng/eng. > Georgia_Italic.exp0.box > Extracting unicharset from box file /tmp/tmp.kisZVM4Xbo/eng/eng. > Times_New_Roman_Bold.exp0.box > Extracting unicharset from box file /tmp/tmp.kisZVM4Xbo/eng/eng. > Times_New_Roman_Bold_Italic.exp0.box > Extracting unicharset from box file /tmp/tmp.kisZVM4Xbo/eng/eng. > Times_New_Roman.exp0.box > Extracting unicharset from box file /tmp/tmp.kisZVM4Xbo/eng/eng. > Times_New_Roman_Italic.exp0.box > Extracting unicharset from box file /tmp/tmp.kisZVM4Xbo/eng/eng. > Trebuchet_MS_Bold.exp0.box > Extracting unicharset from box file /tmp/tmp.kisZVM4Xbo/eng/eng. > Trebuchet_MS_Bold_Italic.exp0.box > Extracting unicharset from box file /tmp/tmp.kisZVM4Xbo/eng/eng. > Trebuchet_MS.exp0.box > Extracting unicharset from box file /tmp/tmp.kisZVM4Xbo/eng/eng. > Trebuchet_MS_Italic.exp0.box > Extracting unicharset from box file /tmp/tmp.kisZVM4Xbo/eng/eng. > URW_Bookman_L_Bold.exp0.box > Extracting unicharset from box file /tmp/tmp.kisZVM4Xbo/eng/eng. > URW_Bookman_L_Bold_Italic.exp0.box > Extracting unicharset from box file /tmp/tmp.kisZVM4Xbo/eng/eng. > URW_Bookman_L_Italic.exp0.box > Extracting unicharset from box file /tmp/tmp.kisZVM4Xbo/eng/eng. > Verdana_Bold.exp0.box > Extracting unicharset from box file /tmp/tmp.kisZVM4Xbo/eng/eng. > Verdana_Bold_Italic.exp0.box > Extracting unicharset from box file /tmp/tmp.kisZVM4Xbo/eng/eng. > Verdana.exp0.box > Extracting unicharset from box file /tmp/tmp.kisZVM4Xbo/eng/eng. > Verdana_Italic.exp0.box > Other case É of é is not in unicharset > Wrote unicharset file /tmp/tmp.kisZVM4Xbo/eng/eng.unicharset > [Thu Feb 15 11:57:00 IST 2018] /usr/local/bin/set_unicharset_properties > -U /tmp/tmp.kisZVM4Xbo/eng/eng.unicharset -O > /tmp/tmp.kisZVM4Xbo/eng/eng.unicharset > -X /tmp/tmp.kisZVM4Xbo/eng/eng.xheights --script_dir=/home/adarsh/ > tesseract/langdata > Loaded unicharset of size 111 from file /tmp/tmp.kisZVM4Xbo/eng/eng. > unicharset > Setting unichar properties > Other case É of é is not in unicharset > Setting script properties > Failed to load script unicharset from:/home/adarsh/tesseract/ > langdata/Latin.unicharset > Warning: properties incomplete for index 3 = d > Warning: properties incomplete for index 4 = i > Warning: properties incomplete for index 5 = f > Warning: properties incomplete for index 6 = e > Warning: properties incomplete for index 7 = r > Warning: properties incomplete for index 8 = n > Warning: properties incomplete for index 9 = t > Warning: properties incomplete for index 10 = N > Warning: properties incomplete for index 11 = w > Warning: properties incomplete for index 12 = A > Warning: properties incomplete for index 13 = c > Warning: properties incomplete for index 14 = l > Warning: properties incomplete for index 15 = s > Warning: properties incomplete for index 16 = p > Warning: properties incomplete for index 17 = a > Warning: properties incomplete for index 18 = g > Warning: properties incomplete for index 19 = 2 > Warning: properties incomplete for index 20 = 3 > Warning: properties incomplete for index 21 = T > Warning: properties incomplete for index 22 = o > Warning: properties incomplete for index 23 = S > Warning: properties incomplete for index 24 = v > Warning: properties incomplete for index 25 = ~ > Warning: properties incomplete for index 26 = D > Warning: properties incomplete for index 27 = C > Warning: properties incomplete for index 28 = h > Warning: properties incomplete for index 29 = ' > Warning: properties incomplete for index 30 = 7 > Warning: properties incomplete for index 31 = « > Warning: properties incomplete for index 32 = : > Warning: properties incomplete for index 33 = # > Warning: properties incomplete for index 34 = 1 > Warning: properties incomplete for index 35 = Z > Warning: properties incomplete for index 36 = _ > Warning: properties incomplete for index 37 = M > Warning: properties incomplete for index 38 = u > Warning: properties incomplete for index 39 = m > Warning: properties incomplete for index 40 = P > Warning: properties incomplete for index 41 = H > Warning: properties incomplete for index 42 = O > Warning: properties incomplete for index 43 = ( > Warning: properties incomplete for index 44 = ) > Warning: properties incomplete for index 45 = q > Warning: properties incomplete for index 46 = y > Warning: properties incomplete for index 47 = | > Warning: properties incomplete for index 48 = U > Warning: properties incomplete for index 49 = 0 > Warning: properties incomplete for index 50 = % > Warning: properties incomplete for index 51 = x > Warning: properties incomplete for index 52 = F > Warning: properties incomplete for index 53 = R > Warning: properties incomplete for index 54 = I > Warning: properties incomplete for index 55 = , > Warning: properties incomplete for index 56 = ! > Warning: properties incomplete for index 57 = E > Warning: properties incomplete for index 58 = b > Warning: properties incomplete for index 59 = \ > Warning: properties incomplete for index 60 = 8 > Warning: properties incomplete for index 61 = ? > Warning: properties incomplete for index 62 = & > Warning: properties incomplete for index 63 = ; > Warning: properties incomplete for index 64 = B > Warning: properties incomplete for index 65 = k > Warning: properties incomplete for index 66 = - > Warning: properties incomplete for index 67 = > > Warning: properties incomplete for index 68 = L > Warning: properties incomplete for index 69 = . > Warning: properties incomplete for index 70 = — > Warning: properties incomplete for index 71 = 4 > Warning: properties incomplete for index 72 = » > Warning: properties incomplete for index 73 = € > Warning: properties incomplete for index 74 = W > Warning: properties incomplete for index 75 = J > Warning: properties incomplete for index 76 = é > Warning: properties incomplete for index 77 = 9 > Warning: properties incomplete for index 78 = ® > Warning: properties incomplete for index 79 = $ > Warning: properties incomplete for index 80 = 5 > Warning: properties incomplete for index 81 = } > Warning: properties incomplete for index 82 = [ > Warning: properties incomplete for index 83 = Y > Warning: properties incomplete for index 84 = § > Warning: properties incomplete for index 85 = " > Warning: properties incomplete for index 86 = { > Warning: properties incomplete for index 87 = ¢ > Warning: properties incomplete for index 88 = / > Warning: properties incomplete for index 89 = Q > Warning: properties incomplete for index 90 = 6 > Warning: properties incomplete for index 91 = G > Warning: properties incomplete for index 92 = ” > Warning: properties incomplete for index 93 = ° > Warning: properties incomplete for index 94 = K > Warning: properties incomplete for index 95 = ¥ > Warning: properties incomplete for index 96 = V > Warning: properties incomplete for index 97 = © > Warning: properties incomplete for index 98 = z > Warning: properties incomplete for index 99 = + > Warning: properties incomplete for index 100 = = > Warning: properties incomplete for index 101 = £ > Warning: properties incomplete for index 102 = < > Warning: properties incomplete for index 103 = ’ > Warning: properties incomplete for index 104 = ‘ > Warning: properties incomplete for index 105 = j > Warning: properties incomplete for index 106 = X > Warning: properties incomplete for index 107 = ] > Warning: properties incomplete for index 108 = * > Warning: properties incomplete for index 109 = “ > Warning: properties incomplete for index 110 = @ > Writing unicharset to file /tmp/tmp.kisZVM4Xbo/eng/eng.unicharset > > === Phase E: Generating lstmf files === > Using TESSDATA_PREFIX=/home/tessdata/tessdata > [Thu Feb 15 11:57:00 IST 2018] /usr/bin/tesseract > /tmp/tmp.kisZVM4Xbo/eng/eng.Arial_Bold.exp0.tif > /tmp/tmp.kisZVM4Xbo/eng/eng.Arial_Bold.exp0 lstm.train > [Thu Feb 15 11:57:00 IST 2018] /usr/bin/tesseract > /tmp/tmp.kisZVM4Xbo/eng/eng.Arial_Bold_Italic.exp0.tif > /tmp/tmp.kisZVM4Xbo/eng/eng.Arial_Bold_Italic.exp0 lstm.train > [Thu Feb 15 11:57:00 IST 2018] /usr/bin/tesseract > /tmp/tmp.kisZVM4Xbo/eng/eng.Arial_Italic.exp0.tif > /tmp/tmp.kisZVM4Xbo/eng/eng.Arial_Italic.exp0 lstm.train > [Thu Feb 15 11:57:00 IST 2018] /usr/bin/tesseract > /tmp/tmp.kisZVM4Xbo/eng/eng.Arial.exp0.tif > /tmp/tmp.kisZVM4Xbo/eng/eng.Arial.exp0 > lstm.train > [Thu Feb 15 11:57:00 IST 2018] /usr/bin/tesseract > /tmp/tmp.kisZVM4Xbo/eng/eng.Century_Schoolbook_L_Bold.exp0.tif > /tmp/tmp.kisZVM4Xbo/eng/eng.Century_Schoolbook_L_Bold.exp0 lstm.train > [Thu Feb 15 11:57:00 IST 2018] /usr/bin/tesseract > /tmp/tmp.kisZVM4Xbo/eng/eng.Century_Schoolbook_L_Bold_Italic.exp0.tif > /tmp/tmp.kisZVM4Xbo/eng/eng.Century_Schoolbook_L_Bold_Italic.exp0 > lstm.train > [Thu Feb 15 11:57:00 IST 2018] /usr/bin/tesseract > /tmp/tmp.kisZVM4Xbo/eng/eng.Century_Schoolbook_L_Medium.exp0.tif > /tmp/tmp.kisZVM4Xbo/eng/eng.Century_Schoolbook_L_Medium.exp0 lstm.train > [Thu Feb 15 11:57:00 IST 2018] /usr/bin/tesseract > /tmp/tmp.kisZVM4Xbo/eng/eng.Century_Schoolbook_L_Italic.exp0.tif > /tmp/tmp.kisZVM4Xbo/eng/eng.Century_Schoolbook_L_Italic.exp0 lstm.train > Tesseract Open Source OCR Engine v4.00.00alpha with Leptonica > Tesseract Open Source OCR Engine v4.00.00alpha with Leptonica > Page 1 > Page 1 > Tesseract Open Source OCR Engine v4.00.00alpha with Leptonica > Page 1 > Tesseract Open Source OCR Engine v4.00.00alpha with Leptonica > Tesseract Open Source OCR Engine v4.00.00alpha with Leptonica > Page 1 > Page 1 > Tesseract Open Source OCR Engine v4.00.00alpha with Leptonica > Tesseract Open Source OCR Engine v4.00.00alpha with Leptonica > Page 1 > Page 1 > Tesseract Open Source OCR Engine v4.00.00alpha with Leptonica > Page 1 > Page 2 > Page 2 > Page 2 > Page 2 > Page 2 > Page 2 > Page 2 > Page 2 > Loaded 45/45 pages (1-45) of document /tmp/tmp.kisZVM4Xbo/eng/eng. > Century_Schoolbook_L_Bold.exp0.lstmf > Loaded 46/46 pages (1-46) of document /tmp/tmp.kisZVM4Xbo/eng/eng. > Century_Schoolbook_L_Bold_Italic.exp0.lstmf > Loaded 52/52 pages (1-52) of document /tmp/tmp.kisZVM4Xbo/eng/eng. > Arial_Italic.exp0.lstmf > Loaded 52/52 pages (1-52) of document /tmp/tmp.kisZVM4Xbo/eng/eng. > Arial.exp0.lstmf > Loaded 46/46 pages (1-46) of document /tmp/tmp.kisZVM4Xbo/eng/eng. > Century_Schoolbook_L_Italic.exp0.lstmf > Loaded 52/52 pages (1-52) of document /tmp/tmp.kisZVM4Xbo/eng/eng. > Arial_Bold.exp0.lstmf > Loaded 47/47 pages (1-47) of document /tmp/tmp.kisZVM4Xbo/eng/eng. > Century_Schoolbook_L_Medium.exp0.lstmf > Loaded 52/52 pages (1-52) of document /tmp/tmp.kisZVM4Xbo/eng/eng. > Arial_Bold_Italic.exp0.lstmf > [Thu Feb 15 11:57:09 IST 2018] /usr/bin/tesseract > /tmp/tmp.kisZVM4Xbo/eng/eng.Courier_New_Bold.exp0.tif > /tmp/tmp.kisZVM4Xbo/eng/eng.Courier_New_Bold.exp0 lstm.train > [Thu Feb 15 11:57:09 IST 2018] /usr/bin/tesseract > /tmp/tmp.kisZVM4Xbo/eng/eng.Georgia_Bold_Italic.exp0.tif > /tmp/tmp.kisZVM4Xbo/eng/eng.Georgia_Bold_Italic.exp0 lstm.train > [Thu Feb 15 11:57:09 IST 2018] /usr/bin/tesseract > /tmp/tmp.kisZVM4Xbo/eng/eng.Courier_New.exp0.tif > /tmp/tmp.kisZVM4Xbo/eng/eng.Courier_New.exp0 lstm.train > [Thu Feb 15 11:57:09 IST 2018] /usr/bin/tesseract > /tmp/tmp.kisZVM4Xbo/eng/eng.DejaVu_Sans_Ultra-Light.exp0.tif > /tmp/tmp.kisZVM4Xbo/eng/eng.DejaVu_Sans_Ultra-Light.exp0 lstm.train > [Thu Feb 15 11:57:09 IST 2018] /usr/bin/tesseract > /tmp/tmp.kisZVM4Xbo/eng/eng.Courier_New_Bold_Italic.exp0.tif > /tmp/tmp.kisZVM4Xbo/eng/eng.Courier_New_Bold_Italic.exp0 lstm.train > [Thu Feb 15 11:57:09 IST 2018] /usr/bin/tesseract > /tmp/tmp.kisZVM4Xbo/eng/eng.Courier_New_Italic.exp0.tif > /tmp/tmp.kisZVM4Xbo/eng/eng.Courier_New_Italic.exp0 lstm.train > [Thu Feb 15 11:57:09 IST 2018] /usr/bin/tesseract > /tmp/tmp.kisZVM4Xbo/eng/eng.Georgia_Bold.exp0.tif > /tmp/tmp.kisZVM4Xbo/eng/eng.Georgia_Bold.exp0 lstm.train > [Thu Feb 15 11:57:09 IST 2018] /usr/bin/tesseract > /tmp/tmp.kisZVM4Xbo/eng/eng.Georgia.exp0.tif > /tmp/tmp.kisZVM4Xbo/eng/eng.Georgia.exp0 > lstm.train > Tesseract Open Source OCR Engine v4.00.00alpha with Leptonica > Page 1 > Tesseract Open Source OCR Engine v4.00.00alpha with Leptonica > Tesseract Open Source OCR Engine v4.00.00alpha with Leptonica > Page 1 > Tesseract Open Source OCR Engine v4.00.00alpha with Leptonica > Tesseract Open Source OCR Engine v4.00.00alpha with Leptonica > Tesseract Open Source OCR Engine v4.00.00alpha with Leptonica > Page 1 > Page 1 > Tesseract Open Source OCR Engine v4.00.00alpha with Leptonica > Tesseract Open Source OCR Engine v4.00.00alpha with Leptonica > Page 1 > Page 1 > Page 1 > Page 1 > Page 2 > Page 2 > Page 2 > Page 2 > Page 2 > Page 2 > Page 2 > Page 2 > Loaded 51/51 pages (1-51) of document /tmp/tmp.kisZVM4Xbo/eng/eng. > Courier_New_Bold.exp0.lstmf > Loaded 52/52 pages (1-52) of document /tmp/tmp.kisZVM4Xbo/eng/eng. > Georgia.exp0.lstmf > Loaded 51/51 pages (1-51) of document /tmp/tmp.kisZVM4Xbo/eng/eng. > Courier_New.exp0.lstmf > Loaded 52/52 pages (1-52) of document /tmp/tmp.kisZVM4Xbo/eng/eng. > Georgia_Bold.exp0.lstmf > Loaded 51/51 pages (1-51) of document /tmp/tmp.kisZVM4Xbo/eng/eng. > Courier_New_Italic.exp0.lstmf > Loaded 51/51 pages (1-51) of document /tmp/tmp.kisZVM4Xbo/eng/eng. > DejaVu_Sans_Ultra-Light.exp0.lstmf > Loaded 52/52 pages (1-52) of document /tmp/tmp.kisZVM4Xbo/eng/eng. > Georgia_Bold_Italic.exp0.lstmf > Loaded 51/51 pages (1-51) of document /tmp/tmp.kisZVM4Xbo/eng/eng. > Courier_New_Bold_Italic.exp0.lstmf > [Thu Feb 15 11:57:18 IST 2018] /usr/bin/tesseract > /tmp/tmp.kisZVM4Xbo/eng/eng.Times_New_Roman_Bold.exp0.tif > /tmp/tmp.kisZVM4Xbo/eng/eng.Times_New_Roman_Bold.exp0 lstm.train > [Thu Feb 15 11:57:18 IST 2018] /usr/bin/tesseract > /tmp/tmp.kisZVM4Xbo/eng/eng.Trebuchet_MS_Bold.exp0.tif > /tmp/tmp.kisZVM4Xbo/eng/eng.Trebuchet_MS_Bold.exp0 lstm.train > [Thu Feb 15 11:57:18 IST 2018] /usr/bin/tesseract > /tmp/tmp.kisZVM4Xbo/eng/eng.Times_New_Roman_Italic.exp0.tif > /tmp/tmp.kisZVM4Xbo/eng/eng.Times_New_Roman_Italic.exp0 lstm.train > [Thu Feb 15 11:57:18 IST 2018] /usr/bin/tesseract > /tmp/tmp.kisZVM4Xbo/eng/eng.Trebuchet_MS_Bold_Italic.exp0.tif > /tmp/tmp.kisZVM4Xbo/eng/eng.Trebuchet_MS_Bold_Italic.exp0 lstm.train > [Thu Feb 15 11:57:18 IST 2018] /usr/bin/tesseract > /tmp/tmp.kisZVM4Xbo/eng/eng.Times_New_Roman_Bold_Italic.exp0.tif > /tmp/tmp.kisZVM4Xbo/eng/eng.Times_New_Roman_Bold_Italic.exp0 lstm.train > [Thu Feb 15 11:57:18 IST 2018] /usr/bin/tesseract > /tmp/tmp.kisZVM4Xbo/eng/eng.Trebuchet_MS.exp0.tif > /tmp/tmp.kisZVM4Xbo/eng/eng.Trebuchet_MS.exp0 lstm.train > [Thu Feb 15 11:57:18 IST 2018] /usr/bin/tesseract > /tmp/tmp.kisZVM4Xbo/eng/eng.Times_New_Roman.exp0.tif > /tmp/tmp.kisZVM4Xbo/eng/eng.Times_New_Roman.exp0 lstm.train > [Thu Feb 15 11:57:18 IST 2018] /usr/bin/tesseract > /tmp/tmp.kisZVM4Xbo/eng/eng.Georgia_Italic.exp0.tif > /tmp/tmp.kisZVM4Xbo/eng/eng.Georgia_Italic.exp0 lstm.train > Tesseract Open Source OCR Engine v4.00.00alpha with Leptonica > Tesseract Open Source OCR Engine v4.00.00alpha with Leptonica > Tesseract Open Source OCR Engine v4.00.00alpha with Leptonica > Page 1 > Page 1 > Tesseract Open Source OCR Engine v4.00.00alpha with Leptonica > Page 1 > Tesseract Open Source OCR Engine v4.00.00alpha with Leptonica > Page 1 > Tesseract Open Source OCR Engine v4.00.00alpha with Leptonica > Page 1 > Tesseract Open Source OCR Engine v4.00.00alpha with Leptonica > Page 1 > Tesseract Open Source OCR Engine v4.00.00alpha with Leptonica > Page 1 > Page 1 > Page 2 > Page 2 > Page 2 > Page 2 > Page 2 > Page 2 > Page 2 > Page 2 > Loaded 52/52 pages (1-52) of document /tmp/tmp.kisZVM4Xbo/eng/eng. > Times_New_Roman.exp0.lstmf > Loaded 51/51 pages (1-51) of document /tmp/tmp.kisZVM4Xbo/eng/eng. > Trebuchet_MS_Bold.exp0.lstmf > Loaded 52/52 pages (1-52) of document /tmp/tmp.kisZVM4Xbo/eng/eng. > Times_New_Roman_Bold.exp0.lstmf > Loaded 51/51 pages (1-51) of document /tmp/tmp.kisZVM4Xbo/eng/eng. > Trebuchet_MS.exp0.lstmf > Loaded 52/52 pages (1-52) of document /tmp/tmp.kisZVM4Xbo/eng/eng. > Times_New_Roman_Italic.exp0.lstmf > Loaded 51/51 pages (1-51) of document /tmp/tmp.kisZVM4Xbo/eng/eng. > Trebuchet_MS_Bold_Italic.exp0.lstmf > Loaded 52/52 pages (1-52) of document /tmp/tmp.kisZVM4Xbo/eng/eng. > Times_New_Roman_Bold_Italic.exp0.lstmf > Loaded 52/52 pages (1-52) of document /tmp/tmp.kisZVM4Xbo/eng/eng. > Georgia_Italic.exp0.lstmf > [Thu Feb 15 11:57:27 IST 2018] /usr/bin/tesseract > /tmp/tmp.kisZVM4Xbo/eng/eng.Trebuchet_MS_Italic.exp0.tif > /tmp/tmp.kisZVM4Xbo/eng/eng.Trebuchet_MS_Italic.exp0 lstm.train > [Thu Feb 15 11:57:27 IST 2018] /usr/bin/tesseract > /tmp/tmp.kisZVM4Xbo/eng/eng.URW_Bookman_L_Bold_Italic.exp0.tif > /tmp/tmp.kisZVM4Xbo/eng/eng.URW_Bookman_L_Bold_Italic.exp0 lstm.train > [Thu Feb 15 11:57:27 IST 2018] /usr/bin/tesseract > /tmp/tmp.kisZVM4Xbo/eng/eng.Verdana_Bold_Italic.exp0.tif > /tmp/tmp.kisZVM4Xbo/eng/eng.Verdana_Bold_Italic.exp0 lstm.train > [Thu Feb 15 11:57:27 IST 2018] /usr/bin/tesseract > /tmp/tmp.kisZVM4Xbo/eng/eng.URW_Bookman_L_Italic.exp0.tif > /tmp/tmp.kisZVM4Xbo/eng/eng.URW_Bookman_L_Italic.exp0 lstm.train > [Thu Feb 15 11:57:27 IST 2018] /usr/bin/tesseract > /tmp/tmp.kisZVM4Xbo/eng/eng.Verdana_Italic.exp0.tif > /tmp/tmp.kisZVM4Xbo/eng/eng.Verdana_Italic.exp0 lstm.train > [Thu Feb 15 11:57:27 IST 2018] /usr/bin/tesseract > /tmp/tmp.kisZVM4Xbo/eng/eng.URW_Bookman_L_Bold.exp0.tif > /tmp/tmp.kisZVM4Xbo/eng/eng.URW_Bookman_L_Bold.exp0 lstm.train > [Thu Feb 15 11:57:27 IST 2018] /usr/bin/tesseract > /tmp/tmp.kisZVM4Xbo/eng/eng.Verdana.exp0.tif > /tmp/tmp.kisZVM4Xbo/eng/eng.Verdana.exp0 > lstm.train > [Thu Feb 15 11:57:27 IST 2018] /usr/bin/tesseract > /tmp/tmp.kisZVM4Xbo/eng/eng.Verdana_Bold.exp0.tif > /tmp/tmp.kisZVM4Xbo/eng/eng.Verdana_Bold.exp0 lstm.train > Tesseract Open Source OCR Engine v4.00.00alpha with Leptonica > Tesseract Open Source OCR Engine v4.00.00alpha with Leptonica > Tesseract Open Source OCR Engine v4.00.00alpha with Leptonica > Tesseract Open Source OCR Engine v4.00.00alpha with Leptonica > Page 1 > Tesseract Open Source OCR Engine v4.00.00alpha with Leptonica > Page 1 > Page 1 > Page 1 > Page 1 > Tesseract Open Source OCR Engine v4.00.00alpha with Leptonica > Page 1 > Tesseract Open Source OCR Engine v4.00.00alpha with Leptonica > Tesseract Open Source OCR Engine v4.00.00alpha with Leptonica > Page 1 > Page 1 > Page 2 > Page 2 > Page 2 > Page 2 > Page 2 > Page 2 > Page 2 > Page 2 > Loaded 49/49 pages (1-49) of document /tmp/tmp.kisZVM4Xbo/eng/eng. > Verdana.exp0.lstmf > Loaded 47/47 pages (1-47) of document /tmp/tmp.kisZVM4Xbo/eng/eng. > URW_Bookman_L_Bold.exp0.lstmf > Loaded 48/48 pages (1-48) of document /tmp/tmp.kisZVM4Xbo/eng/eng. > URW_Bookman_L_Italic.exp0.lstmf > Loaded 46/46 pages (1-46) of document /tmp/tmp.kisZVM4Xbo/eng/eng. > URW_Bookman_L_Bold_Italic.exp0.lstmf > Loaded 49/49 pages (1-49) of document /tmp/tmp.kisZVM4Xbo/eng/eng. > Verdana_Bold_Italic.exp0.lstmf > Loaded 49/49 pages (1-49) of document /tmp/tmp.kisZVM4Xbo/eng/eng. > Verdana_Italic.exp0.lstmf > Loaded 49/49 pages (1-49) of document /tmp/tmp.kisZVM4Xbo/eng/eng. > Verdana_Bold.exp0.lstmf > Loaded 51/51 pages (1-51) of document /tmp/tmp.kisZVM4Xbo/eng/eng. > Trebuchet_MS_Italic.exp0.lstmf > > === Constructing LSTM training data === > [Thu Feb 15 11:57:36 IST 2018] /usr/local/bin/combine_lang_model > --input_unicharset /tmp/tmp.kisZVM4Xbo/eng/eng.unicharset --script_dir > /home/adarsh/tesseract/langdata --words > /home/adarsh/tesseract/langdata/eng/eng.wordlist > --numbers /home/adarsh/tesseract/langdata/eng/eng.numbers --puncs > /home/adarsh/tesseract/langdata/eng/eng.punc --output_dir > /home/adarsh/tesstutorial/engtrain --lang eng > Loaded unicharset of size 111 from file /tmp/tmp.kisZVM4Xbo/eng/eng. > unicharset > Setting unichar properties > Other case É of é is not in unicharset > Setting script properties > Failed to load script unicharset from:/home/adarsh/tesseract/ > langdata/Latin.unicharset > Warning: properties incomplete for index 3 = d > Warning: properties incomplete for index 4 = i > Warning: properties incomplete for index 5 = f > Warning: properties incomplete for index 6 = e > Warning: properties incomplete for index 7 = r > Warning: properties incomplete for index 8 = n > Warning: properties incomplete for index 9 = t > Warning: properties incomplete for index 10 = N > Warning: properties incomplete for index 11 = w > Warning: properties incomplete for index 12 = A > Warning: properties incomplete for index 13 = c > Warning: properties incomplete for index 14 = l > Warning: properties incomplete for index 15 = s > Warning: properties incomplete for index 16 = p > Warning: properties incomplete for index 17 = a > Warning: properties incomplete for index 18 = g > Warning: properties incomplete for index 19 = 2 > Warning: properties incomplete for index 20 = 3 > Warning: properties incomplete for index 21 = T > Warning: properties incomplete for index 22 = o > Warning: properties incomplete for index 23 = S > Warning: properties incomplete for index 24 = v > Warning: properties incomplete for index 25 = ~ > Warning: properties incomplete for index 26 = D > Warning: properties incomplete for index 27 = C > Warning: properties incomplete for index 28 = h > Warning: properties incomplete for index 29 = ' > Warning: properties incomplete for index 30 = 7 > Warning: properties incomplete for index 31 = « > Warning: properties incomplete for index 32 = : > Warning: properties incomplete for index 33 = # > Warning: properties incomplete for index 34 = 1 > Warning: properties incomplete for index 35 = Z > Warning: properties incomplete for index 36 = _ > Warning: properties incomplete for index 37 = M > Warning: properties incomplete for index 38 = u > Warning: properties incomplete for index 39 = m > Warning: properties incomplete for index 40 = P > Warning: properties incomplete for index 41 = H > Warning: properties incomplete for index 42 = O > Warning: properties incomplete for index 43 = ( > Warning: properties incomplete for index 44 = ) > Warning: properties incomplete for index 45 = q > Warning: properties incomplete for index 46 = y > Warning: properties incomplete for index 47 = | > Warning: properties incomplete for index 48 = U > Warning: properties incomplete for index 49 = 0 > Warning: properties incomplete for index 50 = % > Warning: properties incomplete for index 51 = x > Warning: properties incomplete for index 52 = F > Warning: properties incomplete for index 53 = R > Warning: properties incomplete for index 54 = I > Warning: properties incomplete for index 55 = , > Warning: properties incomplete for index 56 = ! > Warning: properties incomplete for index 57 = E > Warning: properties incomplete for index 58 = b > Warning: properties incomplete for index 59 = \ > Warning: properties incomplete for index 60 = 8 > Warning: properties incomplete for index 61 = ? > Warning: properties incomplete for index 62 = & > Warning: properties incomplete for index 63 = ; > Warning: properties incomplete for index 64 = B > Warning: properties incomplete for index 65 = k > Warning: properties incomplete for index 66 = - > Warning: properties incomplete for index 67 = > > Warning: properties incomplete for index 68 = L > Warning: properties incomplete for index 69 = . > Warning: properties incomplete for index 70 = — > Warning: properties incomplete for index 71 = 4 > Warning: properties incomplete for index 72 = » > Warning: properties incomplete for index 73 = € > Warning: properties incomplete for index 74 = W > Warning: properties incomplete for index 75 = J > Warning: properties incomplete for index 76 = é > Warning: properties incomplete for index 77 = 9 > Warning: properties incomplete for index 78 = ® > Warning: properties incomplete for index 79 = $ > Warning: properties incomplete for index 80 = 5 > Warning: properties incomplete for index 81 = } > Warning: properties incomplete for index 82 = [ > Warning: properties incomplete for index 83 = Y > Warning: properties incomplete for index 84 = § > Warning: properties incomplete for index 85 = " > Warning: properties incomplete for index 86 = { > Warning: properties incomplete for index 87 = ¢ > Warning: properties incomplete for index 88 = / > Warning: properties incomplete for index 89 = Q > Warning: properties incomplete for index 90 = 6 > Warning: properties incomplete for index 91 = G > Warning: properties incomplete for index 92 = ” > Warning: properties incomplete for index 93 = ° > Warning: properties incomplete for index 94 = K > Warning: properties incomplete for index 95 = ¥ > Warning: properties incomplete for index 96 = V > Warning: properties incomplete for index 97 = © > Warning: properties incomplete for index 98 = z > Warning: properties incomplete for index 99 = + > Warning: properties incomplete for index 100 = = > Warning: properties incomplete for index 101 = £ > Warning: properties incomplete for index 102 = < > Warning: properties incomplete for index 103 = ’ > Warning: properties incomplete for index 104 = ‘ > Warning: properties incomplete for index 105 = j > Warning: properties incomplete for index 106 = X > Warning: properties incomplete for index 107 = ] > Warning: properties incomplete for index 108 = * > Warning: properties incomplete for index 109 = “ > Warning: properties incomplete for index 110 = @ > Config file is optional, continuing... > Failed to read data from: /home/adarsh/tesseract/langdata/eng/eng.config > Failed to read data from: /home/adarsh/tesseract/ > langdata/radical-stroke.txt > Error reading radical code table /home/adarsh/tesseract/ > langdata/radical-stroke.txt > Moving /tmp/tmp.kisZVM4Xbo/eng/eng.Arial_Bold.exp0.lstmf to > /home/adarsh/tesstutorial/engtrain > Moving /tmp/tmp.kisZVM4Xbo/eng/eng.Arial_Bold_Italic.exp0.lstmf to > /home/adarsh/tesstutorial/engtrain > Moving /tmp/tmp.kisZVM4Xbo/eng/eng.Arial.exp0.lstmf to > /home/adarsh/tesstutorial/engtrain > Moving /tmp/tmp.kisZVM4Xbo/eng/eng.Arial_Italic.exp0.lstmf to > /home/adarsh/tesstutorial/engtrain > Moving /tmp/tmp.kisZVM4Xbo/eng/eng.Century_Schoolbook_L_Bold.exp0.lstmf > to /home/adarsh/tesstutorial/engtrain > Moving /tmp/tmp.kisZVM4Xbo/eng/eng.Century_Schoolbook_L_Bold_Italic.exp0.lstmf > to /home/adarsh/tesstutorial/engtrain > Moving /tmp/tmp.kisZVM4Xbo/eng/eng.Century_Schoolbook_L_Italic.exp0.lstmf > to /home/adarsh/tesstutorial/engtrain > Moving /tmp/tmp.kisZVM4Xbo/eng/eng.Century_Schoolbook_L_Medium.exp0.lstmf > to /home/adarsh/tesstutorial/engtrain > Moving /tmp/tmp.kisZVM4Xbo/eng/eng.Courier_New_Bold.exp0.lstmf to > /home/adarsh/tesstutorial/engtrain > Moving /tmp/tmp.kisZVM4Xbo/eng/eng.Courier_New_Bold_Italic.exp0.lstmf to > /home/adarsh/tesstutorial/engtrain > Moving /tmp/tmp.kisZVM4Xbo/eng/eng.Courier_New.exp0.lstmf to > /home/adarsh/tesstutorial/engtrain > Moving /tmp/tmp.kisZVM4Xbo/eng/eng.Courier_New_Italic.exp0.lstmf to > /home/adarsh/tesstutorial/engtrain > Moving /tmp/tmp.kisZVM4Xbo/eng/eng.DejaVu_Sans_Ultra-Light.exp0.lstmf to > /home/adarsh/tesstutorial/engtrain > Moving /tmp/tmp.kisZVM4Xbo/eng/eng.Georgia_Bold.exp0.lstmf to > /home/adarsh/tesstutorial/engtrain > Moving /tmp/tmp.kisZVM4Xbo/eng/eng.Georgia_Bold_Italic.exp0.lstmf to > /home/adarsh/tesstutorial/engtrain > Moving /tmp/tmp.kisZVM4Xbo/eng/eng.Georgia.exp0.lstmf to > /home/adarsh/tesstutorial/engtrain > Moving /tmp/tmp.kisZVM4Xbo/eng/eng.Georgia_Italic.exp0.lstmf to > /home/adarsh/tesstutorial/engtrain > Moving /tmp/tmp.kisZVM4Xbo/eng/eng.Times_New_Roman_Bold.exp0.lstmf to > /home/adarsh/tesstutorial/engtrain > Moving /tmp/tmp.kisZVM4Xbo/eng/eng.Times_New_Roman_Bold_Italic.exp0.lstmf > to /home/adarsh/tesstutorial/engtrain > Moving /tmp/tmp.kisZVM4Xbo/eng/eng.Times_New_Roman.exp0.lstmf to > /home/adarsh/tesstutorial/engtrain > Moving /tmp/tmp.kisZVM4Xbo/eng/eng.Times_New_Roman_Italic.exp0.lstmf to > /home/adarsh/tesstutorial/engtrain > Moving /tmp/tmp.kisZVM4Xbo/eng/eng.Trebuchet_MS_Bold.exp0.lstmf to > /home/adarsh/tesstutorial/engtrain > Moving /tmp/tmp.kisZVM4Xbo/eng/eng.Trebuchet_MS_Bold_Italic.exp0.lstmf to > /home/adarsh/tesstutorial/engtrain > Moving /tmp/tmp.kisZVM4Xbo/eng/eng.Trebuchet_MS.exp0.lstmf to > /home/adarsh/tesstutorial/engtrain > Moving /tmp/tmp.kisZVM4Xbo/eng/eng.Trebuchet_MS_Italic.exp0.lstmf to > /home/adarsh/tesstutorial/engtrain > Moving /tmp/tmp.kisZVM4Xbo/eng/eng.URW_Bookman_L_Bold.exp0.lstmf to > /home/adarsh/tesstutorial/engtrain > Moving /tmp/tmp.kisZVM4Xbo/eng/eng.URW_Bookman_L_Bold_Italic.exp0.lstmf > to /home/adarsh/tesstutorial/engtrain > Moving /tmp/tmp.kisZVM4Xbo/eng/eng.URW_Bookman_L_Italic.exp0.lstmf to > /home/adarsh/tesstutorial/engtrain > Moving /tmp/tmp.kisZVM4Xbo/eng/eng.Verdana_Bold.exp0.lstmf to > /home/adarsh/tesstutorial/engtrain > Moving /tmp/tmp.kisZVM4Xbo/eng/eng.Verdana_Bold_Italic.exp0.lstmf to > /home/adarsh/tesstutorial/engtrain > Moving /tmp/tmp.kisZVM4Xbo/eng/eng.Verdana.exp0.lstmf to > /home/adarsh/tesstutorial/engtrain > Moving /tmp/tmp.kisZVM4Xbo/eng/eng.Verdana_Italic.exp0.lstmf to > /home/adarsh/tesstutorial/engtrain > > Completed training for language 'eng' > > -- > You received this message because you are subscribed to the Google Groups > "tesseract-ocr" group. > To unsubscribe from this group and stop receiving emails from it, send an > email to [email protected]. > To post to this group, send email to [email protected]. > Visit this group at https://groups.google.com/group/tesseract-ocr. > To view this discussion on the web visit https://groups.google.com/d/ > msgid/tesseract-ocr/baa11e4f-b5a8-42cf-827c-6901073af746% > 40googlegroups.com > <https://groups.google.com/d/msgid/tesseract-ocr/baa11e4f-b5a8-42cf-827c-6901073af746%40googlegroups.com?utm_medium=email&utm_source=footer> > . > For more options, visit https://groups.google.com/d/optout. > -- You received this message because you are subscribed to the Google Groups "tesseract-ocr" group. To unsubscribe from this group and stop receiving emails from it, send an email to [email protected]. To post to this group, send email to [email protected]. Visit this group at https://groups.google.com/group/tesseract-ocr. To view this discussion on the web visit https://groups.google.com/d/msgid/tesseract-ocr/CAG2NduWViWEUQ1zQW%2BnPnJ3kow0FK4GBmoCeranQOOakiSvd4Q%40mail.gmail.com. For more options, visit https://groups.google.com/d/optout.

