Ahh i see, i will report back once i have the output file if i can't figure
out the reason why. You've been very helpful, thanks again :)

On 9 August 2018 at 11:28, Shree Devi Kumar <[email protected]> wrote:

> output tesseract.log file should be produced in the directory from where
> you are running the command, usually where your OCR output is created.
>
> On Thu, Aug 9, 2018 at 3:48 PM <[email protected]> wrote:
>
>> Hello Shree, thank you for your prompt reply.
>>
>> I have now changed the logfile as instructed. Where can i find the output
>> tesseract.log file? will it be produced in the same location as the
>> logfile? in C:\Program Files (x86)\Tesseract-OCR\tessdata\configs ? I'm
>> guessing the tesseract.log file will be produced once i've used logfile in
>> the commands.
>>
>> Kind Regards,
>>
>> Damon
>>
>>
>> On Wednesday, 8 August 2018 19:07:02 UTC+1, shree wrote:
>>>
>>> i think this could be if your new traineddats is not trained to as high
>>> a accuracy level as the eng traineddata.
>>>
>>> You can setup a debug log to verify this. see https://github.com/
>>> tesseract-ocr/tesseract/issues/1275#issuecomment-360367865 for details
>>>
>>> On Wed, Aug 8, 2018 at 6:04 PM <[email protected]> wrote:
>>>
>>>> i'm trying to use the combination of two traineddata dictionaries
>>>> together due to one of them being able to recognise specific numbers better
>>>> than the other.
>>>>
>>>> Here is an example of the code line.
>>>>
>>>>                  $codeLine .= '<br>magick convert "'.$filePath.'"
>>>> -quality 90 -density 300x300  -units PixelsPerInch "'.$output.'.jpg"'; //
>>>>                  $codeLine .= '<br>tesseract "'.$output.'.jpg"
>>>> "'.$output.'" -l fo+eng txt pdf';
>>>>
>>>> Despite the fact i put "fo" in front (this is the one that recognises
>>>> the number 4 better), it still gives me an output text file that is exactly
>>>> identical to the "eng" dictionary output when i run that solo on it's own.
>>>>
>>>> For some reason, it chooses to not just prioritise eng but also
>>>> completely ignoring the fo traineddata file completely.
>>>>
>>>> The "fo" file definitely works as i've tested it solo.
>>>>
>>>> I have attached an image example of the text i'd like to OCR and the
>>>> two relevant traineddata files.
>>>>
>>>> --
>>>> You received this message because you are subscribed to the Google
>>>> Groups "tesseract-ocr" group.
>>>> To unsubscribe from this group and stop receiving emails from it, send
>>>> an email to [email protected].
>>>> To post to this group, send email to [email protected].
>>>> Visit this group at https://groups.google.com/group/tesseract-ocr.
>>>> To view this discussion on the web visit https://groups.google.com/d/
>>>> msgid/tesseract-ocr/1a5a6768-baeb-4ba9-9cbd-adda6cba957c%
>>>> 40googlegroups.com
>>>> <https://groups.google.com/d/msgid/tesseract-ocr/1a5a6768-baeb-4ba9-9cbd-adda6cba957c%40googlegroups.com?utm_medium=email&utm_source=footer>
>>>> .
>>>> For more options, visit https://groups.google.com/d/optout.
>>>>
>>>
>>>
>>> --
>>>
>>> ____________________________________________________________
>>> भजन - कीर्तन - आरती @ http://bhajans.ramparivar.com
>>>
>> --
>> You received this message because you are subscribed to the Google Groups
>> "tesseract-ocr" group.
>> To unsubscribe from this group and stop receiving emails from it, send an
>> email to [email protected].
>> To post to this group, send email to [email protected].
>> Visit this group at https://groups.google.com/group/tesseract-ocr.
>> To view this discussion on the web visit https://groups.google.com/d/
>> msgid/tesseract-ocr/befd629e-e433-45dd-bf1a-7a5c955e9a61%
>> 40googlegroups.com
>> <https://groups.google.com/d/msgid/tesseract-ocr/befd629e-e433-45dd-bf1a-7a5c955e9a61%40googlegroups.com?utm_medium=email&utm_source=footer>
>> .
>> For more options, visit https://groups.google.com/d/optout.
>>
>
>
> --
>
> ____________________________________________________________
> भजन - कीर्तन - आरती @ http://bhajans.ramparivar.com
>
> --
> You received this message because you are subscribed to a topic in the
> Google Groups "tesseract-ocr" group.
> To unsubscribe from this topic, visit https://groups.google.com/d/
> topic/tesseract-ocr/k5fU3wQzXmY/unsubscribe.
> To unsubscribe from this group and all its topics, send an email to
> [email protected].
> To post to this group, send email to [email protected].
> Visit this group at https://groups.google.com/group/tesseract-ocr.
> To view this discussion on the web visit https://groups.google.com/d/
> msgid/tesseract-ocr/CAG2NduWK2gdGYGq_BX21YAAo5tuAFcs_
> eFkaLho9Hz0T4OegpQ%40mail.gmail.com
> <https://groups.google.com/d/msgid/tesseract-ocr/CAG2NduWK2gdGYGq_BX21YAAo5tuAFcs_eFkaLho9Hz0T4OegpQ%40mail.gmail.com?utm_medium=email&utm_source=footer>
> .
>
> For more options, visit https://groups.google.com/d/optout.
>



-- 
Damon Kwong
Developer
Development Team

*Max Communications*
3E, Chislehurst High Street
<https://maps.google.com/?q=3E,+Chislehurst+High+Street&entry=gmail&source=g>
Kent BR7 5AB
020 8309 5445

Cannon House
25, Cadogan Road
<https://maps.google.com/?q=25,+Cadogan+RoadLondon+SE18+6LB&entry=gmail&source=g>
London SE18 6LB
<https://maps.google.com/?q=25,+Cadogan+RoadLondon+SE18+6LB&entry=gmail&source=g>
020 3617 8835




www.maxcommunications.co.uk

[image: Max Logo - ISO 9001 Accreditation]

-- 
You received this message because you are subscribed to the Google Groups 
"tesseract-ocr" group.
To unsubscribe from this group and stop receiving emails from it, send an email 
to [email protected].
To post to this group, send email to [email protected].
Visit this group at https://groups.google.com/group/tesseract-ocr.
To view this discussion on the web visit 
https://groups.google.com/d/msgid/tesseract-ocr/CAD%3DFO-ELDEmE_LUCwpTDEw%3DU91FJqBVB4cphHwiXxtObrPr6mw%40mail.gmail.com.
For more options, visit https://groups.google.com/d/optout.

Reply via email to