sorry I made a typo in the question. I used "hocr " in my config file
在 2018年6月23日星期六 UTC-4上午3:14:38,shree写道: > > tesseract test.png result horc > > You used wrong config file. It should be hocr not horc > > On Sat, Jun 23, 2018 at 12:23 PM Ben Zhang <[email protected] > <javascript:>> wrote: > >> Hi, All, >> I used tesseract 3.05, and type 'tesseract test.png result horc' in >> command line, get result.horc, in this file it has: >> >> *Provider* *Networks* Precertification 808.791.7505 direct 888.941.4622 >> x302toll-free 808.535.8398 fax >> >> Medical *&* Dental *-* *Hawaii* Medical *-* Mainland 888.941 .HMAA >> (4622) *V* Cigna PPO *‘* hmaa.com/providers *HWMG* cigna.com 4F Clgna >> Submit claims directly to HWMG: Submit claims directly to Cigna: PO Box >> 32580 PO Box 188061 Honolulu, HI 96803-2580 Chattanooga, TN 37422-8061 >> Payer ID 48330 Payer ID 62308 8 Drug *-* *Hawaii* *&* Mainland *Vision* >> *-* *Hawaii* *&* Mainland 855.785.6960 .._ Vision Choice {.3 >> Express—Scripts.com fl; amass *SCRIPTSE* 800.877.7195 VS V" VS p *I* CO m >> 9; *care* for Me Submit claims directly to Express Scripts Submit claims >> directly to VSP. >> >> >> \ or call 800.922.1557 for pharmacy help. >> >> Why no info like >> >> LibTesseract.simple_read(config_line_with_hocr, 'phrase.png') >> <div class='ocr_page' id='page_1' title='image ""; bbox 0 0 319 33; >> ppageno 0'> >> <div class='ocr_carea' id='block_1_1' title="bbox 0 0 319 33"> >> <p class='ocr_par' dir='ltr' id='par_1_1' title="bbox 10 13 276 25"> >> <span class='ocr_line' id='line_1_1' title="bbox 10 13 276 25; baseline >> 0 0"><span class='ocrx_word' id='word_1_1' title='bbox 10 14 41 25; >> x_wconf 75' lang='eng' dir='ltr'><strong>the</strong></span> <span >> class='ocrx_word' id='word_1_2' title='bbox 53 13 97 25; x_wconf 84' >> lang='eng' dir='ltr'><strong>book</strong></span> <span class='ocrx_word' >> id='word_1_3' title='bbox 111 13 129 25; x_wconf 79' lang='eng' >> dir='ltr'><strong>is</strong></span> <span class='ocrx_word' id='word_1_4' >> title='bbox 143 17 164 25; x_wconf 83' lang='eng' dir='ltr'>on</span> <span >> class='ocrx_word' id='word_1_5' title='bbox 178 14 209 25; x_wconf 75' >> lang='eng' dir='ltr'><strong>the</strong></span> <span class='ocrx_word' >> id='word_1_6' title='bbox 223 14 276 25; x_wconf 76' lang='eng' >> dir='ltr'><strong>table</strong></span> >> </span> >> </p> >> </div> >> </div> >> >> I am new to tesseract. Thanks for your help >> >> -- >> You received this message because you are subscribed to the Google Groups >> "tesseract-ocr" group. >> To unsubscribe from this group and stop receiving emails from it, send an >> email to [email protected] <javascript:>. >> To post to this group, send email to [email protected] >> <javascript:>. >> Visit this group at https://groups.google.com/group/tesseract-ocr. >> To view this discussion on the web visit >> https://groups.google.com/d/msgid/tesseract-ocr/37fb723a-750f-434d-a12e-f597a80b59e7%40googlegroups.com >> >> <https://groups.google.com/d/msgid/tesseract-ocr/37fb723a-750f-434d-a12e-f597a80b59e7%40googlegroups.com?utm_medium=email&utm_source=footer> >> . >> For more options, visit https://groups.google.com/d/optout. >> > > > -- > > ____________________________________________________________ > भजन - कीर्तन - आरती @ http://bhajans.ramparivar.com > -- You received this message because you are subscribed to the Google Groups "tesseract-ocr" group. To unsubscribe from this group and stop receiving emails from it, send an email to [email protected]. To post to this group, send email to [email protected]. Visit this group at https://groups.google.com/group/tesseract-ocr. To view this discussion on the web visit https://groups.google.com/d/msgid/tesseract-ocr/65360eb8-cfcd-43ed-9a93-08362c80a549%40googlegroups.com. For more options, visit https://groups.google.com/d/optout.

