i think this could be if your new traineddats is not trained to as high a accuracy level as the eng traineddata.
You can setup a debug log to verify this. see https://github.com/tesseract-ocr/tesseract/issues/1275#issuecomment-360367865 for details On Wed, Aug 8, 2018 at 6:04 PM <[email protected]> wrote: > i'm trying to use the combination of two traineddata dictionaries together > due to one of them being able to recognise specific numbers better than the > other. > > Here is an example of the code line. > > $codeLine .= '<br>magick convert "'.$filePath.'" -quality > 90 -density 300x300 -units PixelsPerInch "'.$output.'.jpg"'; // > $codeLine .= '<br>tesseract "'.$output.'.jpg" > "'.$output.'" -l fo+eng txt pdf'; > > Despite the fact i put "fo" in front (this is the one that recognises the > number 4 better), it still gives me an output text file that is exactly > identical to the "eng" dictionary output when i run that solo on it's own. > > For some reason, it chooses to not just prioritise eng but also completely > ignoring the fo traineddata file completely. > > The "fo" file definitely works as i've tested it solo. > > I have attached an image example of the text i'd like to OCR and the two > relevant traineddata files. > > -- > You received this message because you are subscribed to the Google Groups > "tesseract-ocr" group. > To unsubscribe from this group and stop receiving emails from it, send an > email to [email protected]. > To post to this group, send email to [email protected]. > Visit this group at https://groups.google.com/group/tesseract-ocr. > To view this discussion on the web visit > https://groups.google.com/d/msgid/tesseract-ocr/1a5a6768-baeb-4ba9-9cbd-adda6cba957c%40googlegroups.com > <https://groups.google.com/d/msgid/tesseract-ocr/1a5a6768-baeb-4ba9-9cbd-adda6cba957c%40googlegroups.com?utm_medium=email&utm_source=footer> > . > For more options, visit https://groups.google.com/d/optout. > -- ____________________________________________________________ भजन - कीर्तन - आरती @ http://bhajans.ramparivar.com -- You received this message because you are subscribed to the Google Groups "tesseract-ocr" group. To unsubscribe from this group and stop receiving emails from it, send an email to [email protected]. To post to this group, send email to [email protected]. Visit this group at https://groups.google.com/group/tesseract-ocr. To view this discussion on the web visit https://groups.google.com/d/msgid/tesseract-ocr/CAG2NduXF6eSn8cfFLUJrTjJ-ojDuATy_wogH-5ugS4CHt5PFQQ%40mail.gmail.com. For more options, visit https://groups.google.com/d/optout.

