I was using Automatic page segmentation mode. Why automatic mode does not
work? Here is sample command
tesseract.exe "E:\Projects\NeOCR_rev1\Text Image
Segmenter\bin\Debug\tesseract\tmp_20180111201447661_page-6.png" out
--tessdata-dir "E:\Projects\NeOCR_rev1\Text Image
Segmenter\bin\Debug\tesseract\tessdata" -l nep --psm 1 --oem 1
On Friday, 12 January 2018 09:59:32 UTC+5:45, shree wrote:
>
> Works fine for me. What traineddata and options did you use?
>
> Attaching the output from the following, I did not change dpi of image.
>
> #!/bin/bash
> img_files=$(ls ./nepali*.png)
> for img_file in ${img_files}; do
> echo "****************************" ${img_file} oem
> 1"**********************************"
> time tesseract --tessdata-dir /mnt/c/Users/User/shree/tessdata_best/
> ${img_file} ${img_file%.*}-Devanagari-best --oem 1 --psm 6 -l Devanagari
> time tesseract --tessdata-dir /mnt/c/Users/User/shree/tessdata_fast/
> ${img_file} ${img_file%.*}-Devanagari-fast --oem 1 --psm 6 -l Devanagari
> time tesseract --tessdata-dir /mnt/c/Users/User/shree/tessdata/
> ${img_file} ${img_file%.*}-nep --oem 1 --psm 6 -l nep
> done
>
>
>
> ShreeDevi
> ____________________________________________________________
> भजन - कीर्तन - आरती @ http://bhajans.ramparivar.com
>
> On Thu, Jan 11, 2018 at 9:44 PM, Nirajan Pant <[email protected]
> <javascript:>> wrote:
>
>> Tesseract 4.0 is not working with the image provided here. This is a page
>> from Nepali novel. The resolution is slightly low but not too much. The OCR
>> result only few word or in other pages it returns empty result.
>>
>>
>> <https://lh3.googleusercontent.com/-deZEGcNi7-M/WleM_DV1s7I/AAAAAAAAFgs/AvR3lGALhu0hrOcsK7ZQ4pRIySBDhur9ACLcBGAs/s1600/tmp_20180111201447661_page-6.png>
>>
>> --
>> You received this message because you are subscribed to the Google Groups
>> "tesseract-ocr" group.
>> To unsubscribe from this group and stop receiving emails from it, send an
>> email to [email protected] <javascript:>.
>> To post to this group, send email to [email protected]
>> <javascript:>.
>> Visit this group at https://groups.google.com/group/tesseract-ocr.
>> To view this discussion on the web visit
>> https://groups.google.com/d/msgid/tesseract-ocr/534d9a5c-342e-447f-b4cd-7792f7bd7718%40googlegroups.com
>>
>> <https://groups.google.com/d/msgid/tesseract-ocr/534d9a5c-342e-447f-b4cd-7792f7bd7718%40googlegroups.com?utm_medium=email&utm_source=footer>
>> .
>> For more options, visit https://groups.google.com/d/optout.
>>
>
>
--
You received this message because you are subscribed to the Google Groups
"tesseract-ocr" group.
To unsubscribe from this group and stop receiving emails from it, send an email
to [email protected].
To post to this group, send email to [email protected].
Visit this group at https://groups.google.com/group/tesseract-ocr.
To view this discussion on the web visit
https://groups.google.com/d/msgid/tesseract-ocr/78da54a3-cb38-4c99-a422-8c6423c560b7%40googlegroups.com.
For more options, visit https://groups.google.com/d/optout.