Re: [tesseract-ocr] Traineed non unicode font with tesseract

2018-04-05 Thread ShreeDevi Kumar
Are you trying to recognize the text from a pdf or image with non unicode
font?

That is possible to do.

If you want to train using non-unicode font, that is not possible.

On Fri 6 Apr, 2018, 12:03 AM gopal bhalala,  wrote:

> Hi Shree,
>
> Thanks for the quick response, is there any way to train non unicode font
> PDF AND IMAGE?
> i have non unicode pdf file and image for ocr shall i box it and assing
> the uniode font charcter is it right way to do non unicode pdf or image to
> OCR.
>
> On 05-Apr-2018 7:25 AM, "ShreeDevi Kumar"  wrote:
>
>> Training tesseract is only supported using unicode fonts.
>>
>> On Thu 5 Apr, 2018, 12:25 AM gopal bhalala, 
>> wrote:
>>
>>> Hi I am new in tesseract-ocr. I want trainned non unicode font using
>>> tesseract, I tried with to trained it with jTextboxeditor to trained that
>>> data but did not get any sucess.
>>>
>>> --
>>> You received this message because you are subscribed to the Google
>>> Groups "tesseract-ocr" group.
>>> To unsubscribe from this group and stop receiving emails from it, send
>>> an email to tesseract-ocr+unsubscr...@googlegroups.com.
>>> To post to this group, send email to tesseract-ocr@googlegroups.com.
>>> Visit this group at https://groups.google.com/group/tesseract-ocr.
>>> To view this discussion on the web visit
>>> https://groups.google.com/d/msgid/tesseract-ocr/dc1825db-ef94-4bfd-bb3e-9e98d11faf07%40googlegroups.com
>>> 
>>> .
>>> For more options, visit https://groups.google.com/d/optout.
>>>
>> --
>> You received this message because you are subscribed to the Google Groups
>> "tesseract-ocr" group.
>> To unsubscribe from this group and stop receiving emails from it, send an
>> email to tesseract-ocr+unsubscr...@googlegroups.com.
>> To post to this group, send email to tesseract-ocr@googlegroups.com.
>> Visit this group at https://groups.google.com/group/tesseract-ocr.
>> To view this discussion on the web visit
>> https://groups.google.com/d/msgid/tesseract-ocr/CAG2NduWdm5%3DG9MoDskCLHfE1-bdy7pXZteR6HrNp9EDjmzRy4w%40mail.gmail.com
>> 
>> .
>> For more options, visit https://groups.google.com/d/optout.
>>
> --
> You received this message because you are subscribed to the Google Groups
> "tesseract-ocr" group.
> To unsubscribe from this group and stop receiving emails from it, send an
> email to tesseract-ocr+unsubscr...@googlegroups.com.
> To post to this group, send email to tesseract-ocr@googlegroups.com.
> Visit this group at https://groups.google.com/group/tesseract-ocr.
> To view this discussion on the web visit
> https://groups.google.com/d/msgid/tesseract-ocr/CA%2BnTJPCbssxySUh7fNCD_fbHnOLg29v%2BQXemYit4CaBAq%3DP3Jw%40mail.gmail.com
> 
> .
> For more options, visit https://groups.google.com/d/optout.
>

-- 
You received this message because you are subscribed to the Google Groups 
"tesseract-ocr" group.
To unsubscribe from this group and stop receiving emails from it, send an email 
to tesseract-ocr+unsubscr...@googlegroups.com.
To post to this group, send email to tesseract-ocr@googlegroups.com.
Visit this group at https://groups.google.com/group/tesseract-ocr.
To view this discussion on the web visit 
https://groups.google.com/d/msgid/tesseract-ocr/CAG2NduWus1TeUFGFfjmJT57vCWE7h_D%3DEQ2%3DtDoDmmscWajM8g%40mail.gmail.com.
For more options, visit https://groups.google.com/d/optout.


[tesseract-ocr] How to setup tesseract OCR in Suse os

2018-04-05 Thread rahul singh
How to setup tesseract OCR in suse os. please help . 



 



-- 
You received this message because you are subscribed to the Google Groups 
"tesseract-ocr" group.
To unsubscribe from this group and stop receiving emails from it, send an email 
to tesseract-ocr+unsubscr...@googlegroups.com.
To post to this group, send email to tesseract-ocr@googlegroups.com.
Visit this group at https://groups.google.com/group/tesseract-ocr.
To view this discussion on the web visit 
https://groups.google.com/d/msgid/tesseract-ocr/d18f3da8-cb92-444e-8f29-a4857323965c%40googlegroups.com.
For more options, visit https://groups.google.com/d/optout.


Re: [tesseract-ocr] Traineed non unicode font with tesseract

2018-04-05 Thread gopal bhalala
Hi Shree,

Thanks for the quick response, is there any way to train non unicode font
PDF AND IMAGE?
i have non unicode pdf file and image for ocr shall i box it and assing the
uniode font charcter is it right way to do non unicode pdf or image to OCR.

On 05-Apr-2018 7:25 AM, "ShreeDevi Kumar"  wrote:

> Training tesseract is only supported using unicode fonts.
>
> On Thu 5 Apr, 2018, 12:25 AM gopal bhalala, 
> wrote:
>
>> Hi I am new in tesseract-ocr. I want trainned non unicode font using
>> tesseract, I tried with to trained it with jTextboxeditor to trained that
>> data but did not get any sucess.
>>
>> --
>> You received this message because you are subscribed to the Google Groups
>> "tesseract-ocr" group.
>> To unsubscribe from this group and stop receiving emails from it, send an
>> email to tesseract-ocr+unsubscr...@googlegroups.com.
>> To post to this group, send email to tesseract-ocr@googlegroups.com.
>> Visit this group at https://groups.google.com/group/tesseract-ocr.
>> To view this discussion on the web visit https://groups.google.com/d/
>> msgid/tesseract-ocr/dc1825db-ef94-4bfd-bb3e-9e98d11faf07%
>> 40googlegroups.com
>> 
>> .
>> For more options, visit https://groups.google.com/d/optout.
>>
> --
> You received this message because you are subscribed to the Google Groups
> "tesseract-ocr" group.
> To unsubscribe from this group and stop receiving emails from it, send an
> email to tesseract-ocr+unsubscr...@googlegroups.com.
> To post to this group, send email to tesseract-ocr@googlegroups.com.
> Visit this group at https://groups.google.com/group/tesseract-ocr.
> To view this discussion on the web visit https://groups.google.com/d/
> msgid/tesseract-ocr/CAG2NduWdm5%3DG9MoDskCLHfE1-bdy7pXZteR6HrNp9EDjmzRy4w%
> 40mail.gmail.com
> 
> .
> For more options, visit https://groups.google.com/d/optout.
>

-- 
You received this message because you are subscribed to the Google Groups 
"tesseract-ocr" group.
To unsubscribe from this group and stop receiving emails from it, send an email 
to tesseract-ocr+unsubscr...@googlegroups.com.
To post to this group, send email to tesseract-ocr@googlegroups.com.
Visit this group at https://groups.google.com/group/tesseract-ocr.
To view this discussion on the web visit 
https://groups.google.com/d/msgid/tesseract-ocr/CA%2BnTJPCbssxySUh7fNCD_fbHnOLg29v%2BQXemYit4CaBAq%3DP3Jw%40mail.gmail.com.
For more options, visit https://groups.google.com/d/optout.


Re: [tesseract-ocr] Error at training 4.0

2018-04-05 Thread Fanatico
Thanks for the quick response, I did not see this part in the documentation 
...

My problem is that in the image "kor.AppleMyungjo.exp0.tif" the tesseract 
is recognizing nothing, the box file is empty and in the image 
"kor.AppleMyungjo.exp1.tif" it is not recognizing the last quotation marks 
(") and period (.) Can I fix this by running some tests with fonts?


kor.AppleMyungjo.exp1.tif
>>
>>
>> 
>>
>>
>> kor.AppleMyungjo.exp0.tif
>>
>>
>>


-- 
You received this message because you are subscribed to the Google Groups 
"tesseract-ocr" group.
To unsubscribe from this group and stop receiving emails from it, send an email 
to tesseract-ocr+unsubscr...@googlegroups.com.
To post to this group, send email to tesseract-ocr@googlegroups.com.
Visit this group at https://groups.google.com/group/tesseract-ocr.
To view this discussion on the web visit 
https://groups.google.com/d/msgid/tesseract-ocr/a7579774-1941-4480-bce2-203568c00f95%40googlegroups.com.
For more options, visit https://groups.google.com/d/optout.