Hi Shree, I've tried to run my commands again by having logfile as the last
variable which has been changed to:
*debug_file tesseract.log*
*multilang_debug_level 3*
*stopper_debug_level 3*
When i entered the command with logfile at the end, it gives an output in
cmd saying: http://puu.sh/BbTla/a34624a9a4.png

The problem is that the files do exist because i tried running the command
again without logfile and the files were being produced... very
confusing... any idea why it can't find the files? as you can see, the
directories are in speech marks too.

On 9 August 2018 at 11:55, Damon Kwong <[email protected]>
wrote:

> Ahh i see, i will report back once i have the output file if i can't
> figure out the reason why. You've been very helpful, thanks again :)
>
> On 9 August 2018 at 11:28, Shree Devi Kumar <[email protected]> wrote:
>
>> output tesseract.log file should be produced in the directory from where
>> you are running the command, usually where your OCR output is created.
>>
>> On Thu, Aug 9, 2018 at 3:48 PM <[email protected]> wrote:
>>
>>> Hello Shree, thank you for your prompt reply.
>>>
>>> I have now changed the logfile as instructed. Where can i find the
>>> output tesseract.log file? will it be produced in the same location as the
>>> logfile? in C:\Program Files (x86)\Tesseract-OCR\tessdata\configs ? I'm
>>> guessing the tesseract.log file will be produced once i've used logfile in
>>> the commands.
>>>
>>> Kind Regards,
>>>
>>> Damon
>>>
>>>
>>> On Wednesday, 8 August 2018 19:07:02 UTC+1, shree wrote:
>>>>
>>>> i think this could be if your new traineddats is not trained to as high
>>>> a accuracy level as the eng traineddata.
>>>>
>>>> You can setup a debug log to verify this. see
>>>> https://github.com/tesseract-ocr/tesseract/issues/1275#
>>>> issuecomment-360367865 for details
>>>>
>>>> On Wed, Aug 8, 2018 at 6:04 PM <[email protected]> wrote:
>>>>
>>>>> i'm trying to use the combination of two traineddata dictionaries
>>>>> together due to one of them being able to recognise specific numbers 
>>>>> better
>>>>> than the other.
>>>>>
>>>>> Here is an example of the code line.
>>>>>
>>>>>                  $codeLine .= '<br>magick convert "'.$filePath.'"
>>>>> -quality 90 -density 300x300  -units PixelsPerInch "'.$output.'.jpg"'; //
>>>>>                  $codeLine .= '<br>tesseract "'.$output.'.jpg"
>>>>> "'.$output.'" -l fo+eng txt pdf';
>>>>>
>>>>> Despite the fact i put "fo" in front (this is the one that recognises
>>>>> the number 4 better), it still gives me an output text file that is 
>>>>> exactly
>>>>> identical to the "eng" dictionary output when i run that solo on it's own.
>>>>>
>>>>> For some reason, it chooses to not just prioritise eng but also
>>>>> completely ignoring the fo traineddata file completely.
>>>>>
>>>>> The "fo" file definitely works as i've tested it solo.
>>>>>
>>>>> I have attached an image example of the text i'd like to OCR and the
>>>>> two relevant traineddata files.
>>>>>
>>>>> --
>>>>> You received this message because you are subscribed to the Google
>>>>> Groups "tesseract-ocr" group.
>>>>> To unsubscribe from this group and stop receiving emails from it, send
>>>>> an email to [email protected].
>>>>> To post to this group, send email to [email protected].
>>>>> Visit this group at https://groups.google.com/group/tesseract-ocr.
>>>>> To view this discussion on the web visit
>>>>> https://groups.google.com/d/msgid/tesseract-ocr/1a5a6768-bae
>>>>> b-4ba9-9cbd-adda6cba957c%40googlegroups.com
>>>>> <https://groups.google.com/d/msgid/tesseract-ocr/1a5a6768-baeb-4ba9-9cbd-adda6cba957c%40googlegroups.com?utm_medium=email&utm_source=footer>
>>>>> .
>>>>> For more options, visit https://groups.google.com/d/optout.
>>>>>
>>>>
>>>>
>>>> --
>>>>
>>>> ____________________________________________________________
>>>> भजन - कीर्तन - आरती @ http://bhajans.ramparivar.com
>>>>
>>> --
>>> You received this message because you are subscribed to the Google
>>> Groups "tesseract-ocr" group.
>>> To unsubscribe from this group and stop receiving emails from it, send
>>> an email to [email protected].
>>> To post to this group, send email to [email protected].
>>> Visit this group at https://groups.google.com/group/tesseract-ocr.
>>> To view this discussion on the web visit https://groups.google.com/d/ms
>>> gid/tesseract-ocr/befd629e-e433-45dd-bf1a-7a5c955e9a61%40goo
>>> glegroups.com
>>> <https://groups.google.com/d/msgid/tesseract-ocr/befd629e-e433-45dd-bf1a-7a5c955e9a61%40googlegroups.com?utm_medium=email&utm_source=footer>
>>> .
>>> For more options, visit https://groups.google.com/d/optout.
>>>
>>
>>
>> --
>>
>> ____________________________________________________________
>> भजन - कीर्तन - आरती @ http://bhajans.ramparivar.com
>>
>> --
>> You received this message because you are subscribed to a topic in the
>> Google Groups "tesseract-ocr" group.
>> To unsubscribe from this topic, visit https://groups.google.com/d/to
>> pic/tesseract-ocr/k5fU3wQzXmY/unsubscribe.
>> To unsubscribe from this group and all its topics, send an email to
>> [email protected].
>> To post to this group, send email to [email protected].
>> Visit this group at https://groups.google.com/group/tesseract-ocr.
>> To view this discussion on the web visit https://groups.google.com/d/ms
>> gid/tesseract-ocr/CAG2NduWK2gdGYGq_BX21YAAo5tuAFcs_eFkaLho9H
>> z0T4OegpQ%40mail.gmail.com
>> <https://groups.google.com/d/msgid/tesseract-ocr/CAG2NduWK2gdGYGq_BX21YAAo5tuAFcs_eFkaLho9Hz0T4OegpQ%40mail.gmail.com?utm_medium=email&utm_source=footer>
>> .
>>
>> For more options, visit https://groups.google.com/d/optout.
>>
>
>
>
> --
> Damon Kwong
> Developer
> Development Team
>
> *Max Communications*
> 3E, Chislehurst High Street
> <https://maps.google.com/?q=3E,+Chislehurst+High+Street&entry=gmail&source=g>
> Kent BR7 5AB
> 020 8309 5445
>
> Cannon House
> 25, Cadogan Road
> <https://maps.google.com/?q=25,+Cadogan+RoadLondon+SE18+6LB&entry=gmail&source=g>
> London SE18 6LB
> <https://maps.google.com/?q=25,+Cadogan+RoadLondon+SE18+6LB&entry=gmail&source=g>
> 020 3617 8835
>
>
>
>
> www.maxcommunications.co.uk
>
> [image: Max Logo - ISO 9001 Accreditation]
>
>
>
>
>


-- 
Damon Kwong
Developer
Development Team

*Max Communications*
3E, Chislehurst High Street
<https://maps.google.com/?q=3E,+Chislehurst+High+Street&entry=gmail&source=g>
Kent BR7 5AB
020 8309 5445

Cannon House
25, Cadogan Road
<https://maps.google.com/?q=25,+Cadogan+RoadLondon+SE18+6LB&entry=gmail&source=g>
London SE18 6LB
<https://maps.google.com/?q=25,+Cadogan+RoadLondon+SE18+6LB&entry=gmail&source=g>
020 3617 8835




www.maxcommunications.co.uk

[image: Max Logo - ISO 9001 Accreditation]

-- 
You received this message because you are subscribed to the Google Groups 
"tesseract-ocr" group.
To unsubscribe from this group and stop receiving emails from it, send an email 
to [email protected].
To post to this group, send email to [email protected].
Visit this group at https://groups.google.com/group/tesseract-ocr.
To view this discussion on the web visit 
https://groups.google.com/d/msgid/tesseract-ocr/CAD%3DFO-E85BQLwi0-ptkUWeHswkdC-ZgWvad57mH90uazQMdbaw%40mail.gmail.com.
For more options, visit https://groups.google.com/d/optout.

Reply via email to