Check here: https://lottoticketscanner.iclips.co.za/assets/tesseract/tessdata/ mycustom.traineddata definitely points to the correct location. It must be something else. Would you help me figure it out or at least give me a working solution. Thanks already
On Saturday, August 24, 2019 at 7:38:11 PM UTC+2, Clint William Theron wrote: > > Where do I find the value of TESSDATA_PREFIX? > > On Saturday, August 24, 2019 at 7:34:00 PM UTC+2, Clint William Theron > wrote: >> >> I don't know where to find the TESSDATA_PREFIX value since I'm using >> tesseract CDN on a http web server. What did you do? Did you create a >> node.js app? I install tesseract in windows 10 and I replaced the >> traineddata file in the tessdata directory and it worked. I'm looking to >> build an online solution though. I got started and found out about the >> custom traineddata idea from the following link: >> >> >> https://ourcodeworld.com/articles/read/580/how-to-convert-images-to-text-with-pure-javascript-using-tesseract-js >> >> It's not necessary to use the CDN or even a html web page. The solution >> should just work online and use my custom traineddata language >> >> On Saturday, August 24, 2019 at 7:13:01 PM UTC+2, shree wrote: >>> >>> I have not used tesseract CDN on a http web server in a html web page. >>> >>> The error says that the traineddata file cannot be found. You need to >>> check the value of TESSDATA_PREFIX and put your custom traineddata there. >>> >>> On Sat, 24 Aug 2019, 22:28 Clint William Theron, <[email protected]> >>> wrote: >>> >>>> The traineddata file is available in that location. I took a screenshot >>>> of the current error: >>>> >>>> [image: Untitled.png] >>>> I changed the name to custom.traineddata but it's not relevant to the >>>> problem. I also changed the location of the file to see if that makes a >>>> different but didn't: >>>> >>>> const worker = new Tesseract.TesseractWorker({ >>>> langPath: 'https://iclips.co.za/images/tessdata/', >>>> }); >>>> >>>> worker.recognize(cameraSensor2.toDataURL('image/png'), 'custom')... >>>> >>>> Do you see what I'm missing here? >>>> Thanks already >>>> On Saturday, August 24, 2019 at 4:44:12 PM UTC+2, shree wrote: >>>>> >>>>> Check that mycustom.traineddata is available in >>>>> https://lottoticketscanner.iclips.co.za/assets/tesseract/tessdata/ >>>>> >>>>> On Sat, Aug 24, 2019 at 7:02 PM Clint William Theron < >>>>> [email protected]> wrote: >>>>> >>>>>> Thanks for your answer. I have a different error though. I'm not sure >>>>>> what you mean by tessdata folder. I'm using the tesseract CDN on a http >>>>>> web >>>>>> server in a html web page. The following images illustrates my current >>>>>> problem: >>>>>> >>>>>> [image: Untitled.jpg] >>>>>> >>>>>> [image: Untitled.png] >>>>>> >>>>>> >>>>>> The first image illustrates the type of server and directory >>>>>> structure I'm using and the second image shows the error. What is the >>>>>> resolution? I know I'm close now though, thanks to you. >>>>>> >>>>>> My current code looks like so: >>>>>> >>>>>> html >>>>>> <script src=" >>>>>> https://unpkg.com/[email protected]/dist/tesseract.min.js" >>>>>> ></script> >>>>>> >>>>>> js >>>>>> const worker = new Tesseract.TesseractWorker({ >>>>>> langPath: ' >>>>>> https://lottoticketscanner.iclips.co.za/assets/tesseract/tessdata/', >>>>>> }); >>>>>> >>>>>> >>>>>> worker.recognize(cameraSensor2.toDataURL('image/png'), 'mycustom') >>>>>> .progress(progress => console.log('progress', >>>>>> progress)) >>>>>> .then(result => console.log('result', result >>>>>> )) >>>>>> .finally(() => worker.terminate()); >>>>>> >>>>>> Thank you. >>>>>> >>>>>> On Saturday, August 24, 2019 at 5:56:30 AM UTC+2, shree wrote: >>>>>>> >>>>>>> You can name your custom traineddata file with a different name eg. >>>>>>> mycustom.traineddata, copy the file to your tessdata folder (referred >>>>>>> by >>>>>>> tessdata_prefix) and then use 'mycustom' instead of 'eng' in your >>>>>>> program. >>>>>>> >>>>>>> On Sat, 24 Aug 2019, 09:13 Clint William Theron, < >>>>>>> [email protected]> wrote: >>>>>>> >>>>>>>> Hi. I have a web-app and I'm using the Tesseract CDN like so: >>>>>>>> >>>>>>>> <script src=" >>>>>>>> https://unpkg.com/[email protected]/dist/tesseract.min.js >>>>>>>> "></script> >>>>>>>> >>>>>>>> >>>>>>>> I then try to add the path to my own created traineddata language >>>>>>>> like so: >>>>>>>> >>>>>>>> const worker = new TesseractWorker({ >>>>>>>> langPath: ' >>>>>>>> https://lottoticketscanner.iclips.co.za/assets/tesseract/langs-folder/ >>>>>>>> ', >>>>>>>> }); >>>>>>>> >>>>>>>> and finally I call the recognize method like so: >>>>>>>> >>>>>>>> worker.recognize(canvas.toDataURL('image/png'), 'eng') >>>>>>>> .progress(progress => console.log( >>>>>>>> 'progress', progress)) >>>>>>>> .then(result => console.log('result', >>>>>>>> result.text)) >>>>>>>> .finally(() => worker.terminate()); >>>>>>>> >>>>>>>> This doesn't seem to be using my language (that I created myself). >>>>>>>> I get the same output as I do when I remove the langPath. I mean I can >>>>>>>> cleary see my language is not being used in the recognized process. I >>>>>>>> know >>>>>>>> it's not because I tested my language (traineddata) in windows desktop >>>>>>>> and >>>>>>>> I get the results that I actually want. What needs to change in my >>>>>>>> code so >>>>>>>> my (custom) traineddata, and only mine, is being used? >>>>>>>> >>>>>>>> Thanks. >>>>>>>> >>>>>>>> -- >>>>>>>> You received this message because you are subscribed to the Google >>>>>>>> Groups "tesseract-ocr" group. >>>>>>>> To unsubscribe from this group and stop receiving emails from it, >>>>>>>> send an email to [email protected]. >>>>>>>> To view this discussion on the web visit >>>>>>>> https://groups.google.com/d/msgid/tesseract-ocr/8a823afb-fde5-43aa-a643-2cf69462c2f6%40googlegroups.com >>>>>>>> >>>>>>>> <https://groups.google.com/d/msgid/tesseract-ocr/8a823afb-fde5-43aa-a643-2cf69462c2f6%40googlegroups.com?utm_medium=email&utm_source=footer> >>>>>>>> . >>>>>>>> >>>>>>> -- >>>>>> You received this message because you are subscribed to the Google >>>>>> Groups "tesseract-ocr" group. >>>>>> To unsubscribe from this group and stop receiving emails from it, >>>>>> send an email to [email protected]. >>>>>> To view this discussion on the web visit >>>>>> https://groups.google.com/d/msgid/tesseract-ocr/4d8dcf15-293c-4b3e-9d67-8b0ba8be9a27%40googlegroups.com >>>>>> >>>>>> <https://groups.google.com/d/msgid/tesseract-ocr/4d8dcf15-293c-4b3e-9d67-8b0ba8be9a27%40googlegroups.com?utm_medium=email&utm_source=footer> >>>>>> . >>>>>> >>>>> >>>>> >>>>> -- >>>>> >>>>> ____________________________________________________________ >>>>> भजन - कीर्तन - आरती @ http://bhajans.ramparivar.com >>>>> >>>> -- >>>> You received this message because you are subscribed to the Google >>>> Groups "tesseract-ocr" group. >>>> To unsubscribe from this group and stop receiving emails from it, send >>>> an email to [email protected]. >>>> To view this discussion on the web visit >>>> https://groups.google.com/d/msgid/tesseract-ocr/75347274-7bbe-4f66-a313-7fb51ad9d253%40googlegroups.com >>>> >>>> <https://groups.google.com/d/msgid/tesseract-ocr/75347274-7bbe-4f66-a313-7fb51ad9d253%40googlegroups.com?utm_medium=email&utm_source=footer> >>>> . >>>> >>> -- You received this message because you are subscribed to the Google Groups "tesseract-ocr" group. To unsubscribe from this group and stop receiving emails from it, send an email to [email protected]. To view this discussion on the web visit https://groups.google.com/d/msgid/tesseract-ocr/e6cfa2ed-a33a-40a8-9ac0-cfb50cec2aa9%40googlegroups.com.

