Ahh i see, i will report back once i have the output file if i can't figure out the reason why. You've been very helpful, thanks again :)
On 9 August 2018 at 11:28, Shree Devi Kumar <[email protected]> wrote: > output tesseract.log file should be produced in the directory from where > you are running the command, usually where your OCR output is created. > > On Thu, Aug 9, 2018 at 3:48 PM <[email protected]> wrote: > >> Hello Shree, thank you for your prompt reply. >> >> I have now changed the logfile as instructed. Where can i find the output >> tesseract.log file? will it be produced in the same location as the >> logfile? in C:\Program Files (x86)\Tesseract-OCR\tessdata\configs ? I'm >> guessing the tesseract.log file will be produced once i've used logfile in >> the commands. >> >> Kind Regards, >> >> Damon >> >> >> On Wednesday, 8 August 2018 19:07:02 UTC+1, shree wrote: >>> >>> i think this could be if your new traineddats is not trained to as high >>> a accuracy level as the eng traineddata. >>> >>> You can setup a debug log to verify this. see https://github.com/ >>> tesseract-ocr/tesseract/issues/1275#issuecomment-360367865 for details >>> >>> On Wed, Aug 8, 2018 at 6:04 PM <[email protected]> wrote: >>> >>>> i'm trying to use the combination of two traineddata dictionaries >>>> together due to one of them being able to recognise specific numbers better >>>> than the other. >>>> >>>> Here is an example of the code line. >>>> >>>> $codeLine .= '<br>magick convert "'.$filePath.'" >>>> -quality 90 -density 300x300 -units PixelsPerInch "'.$output.'.jpg"'; // >>>> $codeLine .= '<br>tesseract "'.$output.'.jpg" >>>> "'.$output.'" -l fo+eng txt pdf'; >>>> >>>> Despite the fact i put "fo" in front (this is the one that recognises >>>> the number 4 better), it still gives me an output text file that is exactly >>>> identical to the "eng" dictionary output when i run that solo on it's own. >>>> >>>> For some reason, it chooses to not just prioritise eng but also >>>> completely ignoring the fo traineddata file completely. >>>> >>>> The "fo" file definitely works as i've tested it solo. >>>> >>>> I have attached an image example of the text i'd like to OCR and the >>>> two relevant traineddata files. >>>> >>>> -- >>>> You received this message because you are subscribed to the Google >>>> Groups "tesseract-ocr" group. >>>> To unsubscribe from this group and stop receiving emails from it, send >>>> an email to [email protected]. >>>> To post to this group, send email to [email protected]. >>>> Visit this group at https://groups.google.com/group/tesseract-ocr. >>>> To view this discussion on the web visit https://groups.google.com/d/ >>>> msgid/tesseract-ocr/1a5a6768-baeb-4ba9-9cbd-adda6cba957c% >>>> 40googlegroups.com >>>> <https://groups.google.com/d/msgid/tesseract-ocr/1a5a6768-baeb-4ba9-9cbd-adda6cba957c%40googlegroups.com?utm_medium=email&utm_source=footer> >>>> . >>>> For more options, visit https://groups.google.com/d/optout. >>>> >>> >>> >>> -- >>> >>> ____________________________________________________________ >>> भजन - कीर्तन - आरती @ http://bhajans.ramparivar.com >>> >> -- >> You received this message because you are subscribed to the Google Groups >> "tesseract-ocr" group. >> To unsubscribe from this group and stop receiving emails from it, send an >> email to [email protected]. >> To post to this group, send email to [email protected]. >> Visit this group at https://groups.google.com/group/tesseract-ocr. >> To view this discussion on the web visit https://groups.google.com/d/ >> msgid/tesseract-ocr/befd629e-e433-45dd-bf1a-7a5c955e9a61% >> 40googlegroups.com >> <https://groups.google.com/d/msgid/tesseract-ocr/befd629e-e433-45dd-bf1a-7a5c955e9a61%40googlegroups.com?utm_medium=email&utm_source=footer> >> . >> For more options, visit https://groups.google.com/d/optout. >> > > > -- > > ____________________________________________________________ > भजन - कीर्तन - आरती @ http://bhajans.ramparivar.com > > -- > You received this message because you are subscribed to a topic in the > Google Groups "tesseract-ocr" group. > To unsubscribe from this topic, visit https://groups.google.com/d/ > topic/tesseract-ocr/k5fU3wQzXmY/unsubscribe. > To unsubscribe from this group and all its topics, send an email to > [email protected]. > To post to this group, send email to [email protected]. > Visit this group at https://groups.google.com/group/tesseract-ocr. > To view this discussion on the web visit https://groups.google.com/d/ > msgid/tesseract-ocr/CAG2NduWK2gdGYGq_BX21YAAo5tuAFcs_ > eFkaLho9Hz0T4OegpQ%40mail.gmail.com > <https://groups.google.com/d/msgid/tesseract-ocr/CAG2NduWK2gdGYGq_BX21YAAo5tuAFcs_eFkaLho9Hz0T4OegpQ%40mail.gmail.com?utm_medium=email&utm_source=footer> > . > > For more options, visit https://groups.google.com/d/optout. > -- Damon Kwong Developer Development Team *Max Communications* 3E, Chislehurst High Street <https://maps.google.com/?q=3E,+Chislehurst+High+Street&entry=gmail&source=g> Kent BR7 5AB 020 8309 5445 Cannon House 25, Cadogan Road <https://maps.google.com/?q=25,+Cadogan+RoadLondon+SE18+6LB&entry=gmail&source=g> London SE18 6LB <https://maps.google.com/?q=25,+Cadogan+RoadLondon+SE18+6LB&entry=gmail&source=g> 020 3617 8835 www.maxcommunications.co.uk [image: Max Logo - ISO 9001 Accreditation] -- You received this message because you are subscribed to the Google Groups "tesseract-ocr" group. To unsubscribe from this group and stop receiving emails from it, send an email to [email protected]. To post to this group, send email to [email protected]. Visit this group at https://groups.google.com/group/tesseract-ocr. To view this discussion on the web visit https://groups.google.com/d/msgid/tesseract-ocr/CAD%3DFO-ELDEmE_LUCwpTDEw%3DU91FJqBVB4cphHwiXxtObrPr6mw%40mail.gmail.com. For more options, visit https://groups.google.com/d/optout.

