Thank you so much! I got it working. Didn't think about inverting the images. Best, Cailey
On Sunday, November 1, 2020 at 11:59:00 AM UTC-5 Cailey McVay wrote: > How did you invert the image? And is there a code I can use to invert the > rest of my images to try with more sample data? > > On Sunday, November 1, 2020 at 10:55:00 AM UTC-5 shree wrote: > >> Invert the image. Results using tessdata_best/eng - LSTM engine >> >> $ tesseract legacy-invert.jpg - --psm 6 >> 063.433 >> $ tesseract legacy-300.jpg - --psm 6 >> 063.433 >> $ tesseract legacy-144.jpg - --psm 6 >> 063.433 >> >> >> >> On Sun, Nov 1, 2020 at 8:37 PM Cailey McVay <cailey.m...@dartmouth.edu> >> wrote: >> >>> Here is an example of the sample image. I believe we are using the >>> legacy engine. Does this help? >>> >>> On Saturday, October 31, 2020 at 11:15:46 PM UTC-4 shree wrote: >>> >>>> >When we use tesseract on the images without the trained language we >>>> receive outputs that are accurate about 50% of the time. >>>> >>>> You haven't shared a sample image. Sometimes preprocessing the images, >>>> using a whitelist in case of limited character set can be the solution >>>> rather than training. >>>> >>>> On Sun, Nov 1, 2020, 03:29 Cailey McVay <cailey.m...@dartmouth.edu> >>>> wrote: >>>> >>>>> Hello! >>>>> I am working on a project that is trying to read borehole video >>>>> depths. We trained a new language to read these numbers called NTS. When >>>>> we >>>>> use tesseract on the images without the trained language we receive >>>>> outputs >>>>> that are accurate about 50% of the time. However when we use the new >>>>> language, we receive no output at all. Is it possible that we overtrained >>>>> tesseract to not recognize any of the images? I will attach below our box >>>>> file, unicharset file, box trained file, pffmtable file, and normproto >>>>> file. Our shapetable file processes but then returns an empty file. Could >>>>> something be wrong with our shapetable? And if so, how could we fix that? >>>>> >>>>> Box File for the first five images: >>>>> 0 3 1 14 19 0 >>>>> 9 18 0 29 20 0 >>>>> 3 33 1 46 19 0 >>>>> . 50 1 56 19 0 >>>>> 2 64 1 75 19 0 >>>>> 5 76 1 93 19 0 >>>>> 2 92 1 111 19 0 >>>>> 0 4 1 15 19 1 >>>>> 8 19 1 30 19 1 >>>>> 3 34 1 46 19 1 >>>>> . 54 1 57 5 1 >>>>> 4 65 1 77 19 1 >>>>> 1 82 1 91 19 1 >>>>> 4 96 1 107 19 1 >>>>> 0 3 1 15 19 2 >>>>> 8 19 1 30 19 2 >>>>> 6 34 1 46 19 2 >>>>> . 53 1 57 5 2 >>>>> 8 65 1 77 19 2 >>>>> 3 80 1 91 19 2 >>>>> 9 95 1 107 19 2 >>>>> 0 4 1 15 19 3 >>>>> 8 17 1 31 19 3 >>>>> 8 32 1 46 19 3 >>>>> . 52 2 58 8 3 >>>>> 1 64 0 77 20 3 >>>>> 8 80 1 91 19 3 >>>>> 5 96 1 107 19 3 >>>>> 0 3 1 15 19 4 >>>>> 8 19 1 30 19 4 >>>>> 7 34 1 47 19 4 >>>>> . 53 1 58 9 4 >>>>> 5 65 1 77 19 4 >>>>> 6 80 1 92 19 4 >>>>> 4 95 0 109 20 4 >>>>> 0 4 1 15 19 5 >>>>> 7 19 1 30 19 5 >>>>> 5 34 1 46 19 5 >>>>> . 53 1 57 5 5 >>>>> 3 65 1 76 19 5 >>>>> 1 82 1 90 19 5 >>>>> 3 96 1 107 19 5 >>>>> >>>>> >>>>> Unicharset: >>>>> 14 >>>>> NULL 0 Common 0 >>>>> Joined 7 0,255,0,255,0,0,0,0,0,0 Latin 1 0 1 Joined # Joined [4a 6f 69 >>>>> 6e 65 64 ]a >>>>> |Broken|0|1 21 0,255,0,255,0,0,0,0,0,0 Common 2 10 2 |Broken|0|1 # >>>>> Broken >>>>> 0 8 0,255,0,255,0,0,0,0,0,0 Common 3 2 3 0 # 0 [30 ]0 >>>>> 9 8 0,255,0,255,0,0,0,0,0,0 Common 4 2 4 9 # 9 [39 ]0 >>>>> 3 8 0,255,0,255,0,0,0,0,0,0 Common 5 2 5 3 # 3 [33 ]0 >>>>> . 22 0,255,0,255,0,0,0,0,0,0 Common 6 6 6 . # . [2e ]p >>>>> 2 8 0,255,0,255,0,0,0,0,0,0 Common 7 2 7 2 # 2 [32 ]0 >>>>> 5 8 0,255,0,255,0,0,0,0,0,0 Common 8 2 8 5 # 5 [35 ]0 >>>>> 8 8 0,255,0,255,0,0,0,0,0,0 Common 9 2 9 8 # 8 [38 ]0 >>>>> 4 8 0,255,0,255,0,0,0,0,0,0 Common 10 2 10 4 # 4 [34 ]0 >>>>> 1 8 0,255,0,255,0,0,0,0,0,0 Common 11 2 11 1 # 1 [31 ]0 >>>>> 6 8 0,255,0,255,0,0,0,0,0,0 Common 12 2 12 6 # 6 [36 ]0 >>>>> 7 8 0,255,0,255,0,0,0,0,0,0 Common 13 2 13 7 # 7 [37 ]0 >>>>> >>>>> >>>>> NTS.font.exp0.tr file: >>>>> font 0 3 1 14 19 0 >>>>> 4 >>>>> mf 16 >>>>> -0.085041896 0.30783021 0.27617577 0 0 0 >>>>> -0.25234067 0.27376649 0.089746617 0.13718249 0 0 >>>>> -0.28155157 0.0045010448 0.47040343 0.25 0 0 >>>>> -0.25234067 -0.26476437 0.08974655 0.36281759 0 0 >>>>> -0.085041896 -0.29882804 0.27617577 0.5 0 0 >>>>> -0.031931162 -0.21447986 0.1730229 0.96998096 0 0 >>>>> -0.11690831 0.020721853 0.43796182 0.75 0 0 >>>>> -0.031931162 0.23970276 0.1699543 0.5 0 0 >>>>> 0.24424461 0.072628468 0.47339222 0.76789355 0 0 >>>>> 0.1353676 0.30783021 0.16464323 0 0 0 >>>>> 0.10615671 0.18941826 0.14627755 0.37934926 0 0 >>>>> 0.15926743 -0.011719763 0.30170703 0.25 0 0 >>>>> 0.10615671 -0.19663697 0.12619166 0.090763755 0 0 >>>>> 0.1353676 -0.29882804 0.16464323 0.5 0 0 >>>>> 0.27079996 -0.26476437 0.12619169 0.59076369 0 0 >>>>> 0.29735535 -0.19663697 0.086383387 0.85538673 0 0 >>>>> cn 1 >>>>> 0.36328125 0.35781249 0.2421875 0.1484375 >>>>> if 73 >>>>> 133 69 248 >>>>> 119 72 248 >>>>> 104 75 248 >>>>> 97 82 192 >>>>> 97 95 192 >>>>> 97 107 192 >>>>> 97 120 192 >>>>> 97 132 192 >>>>> 97 145 192 >>>>> 97 157 192 >>>>> 97 170 192 >>>>> 97 182 192 >>>>> 104 188 128 >>>>> 119 188 128 >>>>> 133 188 128 >>>>> 135 206 0 >>>>> 123 206 0 >>>>> 111 206 0 >>>>> 99 206 0 >>>>> 88 206 0 >>>>> 76 206 0 >>>>> 66 201 35 >>>>> 59 193 35 >>>>> 55 182 64 >>>>> 55 168 64 >>>>> 55 155 64 >>>>> 55 142 64 >>>>> 55 128 64 >>>>> 55 115 64 >>>>> 55 101 64 >>>>> 55 88 64 >>>>> 55 75 64 >>>>> 59 64 93 >>>>> 66 55 93 >>>>> 76 51 128 >>>>> 88 51 128 >>>>> 99 51 128 >>>>> 111 51 128 >>>>> 123 51 128 >>>>> 135 51 128 >>>>> 145 184 97 >>>>> 154 175 97 >>>>> 163 167 97 >>>>> 168 156 64 >>>>> 168 143 64 >>>>> 168 130 64 >>>>> 168 118 64 >>>>> 168 105 64 >>>>> 168 92 64 >>>>> 163 82 23 >>>>> 154 77 23 >>>>> 145 71 23 >>>>> 148 51 128 >>>>> 162 51 128 >>>>> 176 51 128 >>>>> 187 53 151 >>>>> 196 59 151 >>>>> 205 65 151 >>>>> 207 72 219 >>>>> 200 81 219 >>>>> 196 92 192 >>>>> 196 105 192 >>>>> 196 118 192 >>>>> 196 130 192 >>>>> 196 143 192 >>>>> 196 156 192 >>>>> 195 168 204 >>>>> 191 179 204 >>>>> 188 190 204 >>>>> 184 200 204 >>>>> 176 206 0 >>>>> 162 206 0 >>>>> 148 206 0 >>>>> tb 1 >>>>> 64 251 114 >>>>> >>>>> >>>>> pffmtable: >>>>> NULL 0 >>>>> Joined 0 >>>>> |Broken|0|1 0 >>>>> 0 0 >>>>> 9 0 >>>>> 3 0 >>>>> . 0 >>>>> 2 0 >>>>> 5 0 >>>>> 8 0 >>>>> 4 0 >>>>> 1 0 >>>>> 6 0 >>>>> 7 0 >>>>> >>>>> NTS.normproto file: >>>>> linear essential -0.250000 0.750000 >>>>> linear non-essential 0.000000 1.000000 >>>>> linear essential 0.000000 1.000000 >>>>> linear essential 0.000000 1.000000 >>>>> >>>>> 0 1 >>>>> significant elliptical 34 >>>>> 0.364775 0.371404 0.241039 0.150391 >>>>> 0.000400 0.000416 0.000400 0.000400 >>>>> >>>>> 9 1 >>>>> significant elliptical 13 >>>>> 0.372897 0.418750 0.241286 0.157752 >>>>> 0.000400 0.004734 0.000400 0.001087 >>>>> >>>>> 3 1 >>>>> significant elliptical 16 >>>>> 0.365479 0.385596 0.247070 0.143799 >>>>> 0.000400 0.003148 0.000400 0.000702 >>>>> >>>>> . 1 >>>>> significant elliptical 27 >>>>> 0.081019 0.055483 0.060619 0.050492 >>>>> 0.000400 0.000400 0.000400 0.000400 >>>>> >>>>> 2 1 >>>>> significant elliptical 10 >>>>> 0.354297 0.359492 0.248828 0.138672 >>>>> 0.000400 0.000400 0.000400 0.000400 >>>>> >>>>> 5 1 >>>>> significant elliptical 10 >>>>> 0.363672 0.350859 0.248047 0.144922 >>>>> 0.000400 0.000400 0.000400 0.000400 >>>>> >>>>> 8 1 >>>>> significant elliptical 19 >>>>> 0.365543 0.378536 0.234786 0.141653 >>>>> 0.000400 0.000400 0.000400 0.000400 >>>>> >>>>> 4 1 >>>>> significant elliptical 9 >>>>> 0.325521 0.274219 0.215278 0.128038 >>>>> 0.000400 0.000400 0.000400 0.000400 >>>>> >>>>> 1 1 >>>>> significant elliptical 11 >>>>> 0.320312 0.217259 0.248580 0.091974 >>>>> 0.000400 0.000400 0.000400 0.000400 >>>>> >>>>> 6 1 >>>>> significant elliptical 20 >>>>> 0.360156 0.370703 0.238281 0.143164 >>>>> 0.000400 0.000400 0.000400 0.000400 >>>>> >>>>> 7 1 >>>>> significant elliptical 20 >>>>> 0.448633 0.243359 0.242969 0.113477 >>>>> 0.000400 0.000400 0.000400 0.000400 >>>>> >>>>> -- >>>>> >>>> You received this message because you are subscribed to the Google >>>>> Groups "tesseract-ocr" group. >>>>> To unsubscribe from this group and stop receiving emails from it, send >>>>> an email to tesseract-oc...@googlegroups.com. >>>>> To view this discussion on the web visit >>>>> https://groups.google.com/d/msgid/tesseract-ocr/9e3a6851-0311-4148-af1f-b61999f38977n%40googlegroups.com >>>>> >>>>> <https://groups.google.com/d/msgid/tesseract-ocr/9e3a6851-0311-4148-af1f-b61999f38977n%40googlegroups.com?utm_medium=email&utm_source=footer> >>>>> . >>>>> >>>> -- >>> You received this message because you are subscribed to the Google >>> Groups "tesseract-ocr" group. >>> To unsubscribe from this group and stop receiving emails from it, send >>> an email to tesseract-oc...@googlegroups.com. >>> >> To view this discussion on the web visit >>> https://groups.google.com/d/msgid/tesseract-ocr/c09d4786-595e-4e49-b5c6-b7ded4bee47fn%40googlegroups.com >>> >>> <https://groups.google.com/d/msgid/tesseract-ocr/c09d4786-595e-4e49-b5c6-b7ded4bee47fn%40googlegroups.com?utm_medium=email&utm_source=footer> >>> . >>> >> >> >> -- >> >> ____________________________________________________________ >> भजन - कीर्तन - आरती @ http://bhajans.ramparivar.com >> > -- You received this message because you are subscribed to the Google Groups "tesseract-ocr" group. To unsubscribe from this group and stop receiving emails from it, send an email to tesseract-ocr+unsubscr...@googlegroups.com. To view this discussion on the web visit https://groups.google.com/d/msgid/tesseract-ocr/421dd165-6e9e-41cc-83e4-72db7aad30f8n%40googlegroups.com.