Re: [tesseract-ocr] Traineed non unicode font with tesseract

2018-04-06 Thread ShreeDevi Kumar
Please see
https://github.com/tesseract-ocr/tesseract/wiki/Command-Line-Usage

For Indian languages, use tesseract-4.0.0beta.1
with the traineddata files from
https://github.com/tesseract-ocr/tessdata_fast

ShreeDevi

भजन - कीर्तन - आरती @ http://bhajans.ramparivar.com

On Fri, Apr 6, 2018 at 12:04 PM, gopal bhalala 
wrote:

> Yes Shree. I am trying to recognized text from a PDF or image with non
> unicode font. I tried with make box and to do that but did not get sucess,
> Can you please give me any guidence on that how to do that?
>
> Best Regards & Thanking you,
> Gopal Dhanjibhai Bhalala
>
> On Fri, Apr 6, 2018 at 1:20 AM, ShreeDevi Kumar 
> wrote:
>
>> Are you trying to recognize the text from a pdf or image with non unicode
>> font?
>>
>> That is possible to do.
>>
>> If you want to train using non-unicode font, that is not possible.
>>
>> On Fri 6 Apr, 2018, 12:03 AM gopal bhalala, 
>> wrote:
>>
>>> Hi Shree,
>>>
>>> Thanks for the quick response, is there any way to train non unicode
>>> font PDF AND IMAGE?
>>> i have non unicode pdf file and image for ocr shall i box it and assing
>>> the uniode font charcter is it right way to do non unicode pdf or image to
>>> OCR.
>>>
>>> On 05-Apr-2018 7:25 AM, "ShreeDevi Kumar"  wrote:
>>>
 Training tesseract is only supported using unicode fonts.

 On Thu 5 Apr, 2018, 12:25 AM gopal bhalala, 
 wrote:

> Hi I am new in tesseract-ocr. I want trainned non unicode font using
> tesseract, I tried with to trained it with jTextboxeditor to trained that
> data but did not get any sucess.
>
> --
> You received this message because you are subscribed to the Google
> Groups "tesseract-ocr" group.
> To unsubscribe from this group and stop receiving emails from it, send
> an email to tesseract-ocr+unsubscr...@googlegroups.com.
> To post to this group, send email to tesseract-ocr@googlegroups.com.
> Visit this group at https://groups.google.com/group/tesseract-ocr.
> To view this discussion on the web visit
> https://groups.google.com/d/msgid/tesseract-ocr/dc1825db-ef9
> 4-4bfd-bb3e-9e98d11faf07%40googlegroups.com
> 
> .
> For more options, visit https://groups.google.com/d/optout.
>
 --
 You received this message because you are subscribed to the Google
 Groups "tesseract-ocr" group.
 To unsubscribe from this group and stop receiving emails from it, send
 an email to tesseract-ocr+unsubscr...@googlegroups.com.
 To post to this group, send email to tesseract-ocr@googlegroups.com.
 Visit this group at https://groups.google.com/group/tesseract-ocr.
 To view this discussion on the web visit https://groups.google.com/d/ms
 gid/tesseract-ocr/CAG2NduWdm5%3DG9MoDskCLHfE1-bdy7pXZteR6HrN
 p9EDjmzRy4w%40mail.gmail.com
 
 .
 For more options, visit https://groups.google.com/d/optout.

>>> --
>>> You received this message because you are subscribed to the Google
>>> Groups "tesseract-ocr" group.
>>> To unsubscribe from this group and stop receiving emails from it, send
>>> an email to tesseract-ocr+unsubscr...@googlegroups.com.
>>> To post to this group, send email to tesseract-ocr@googlegroups.com.
>>> Visit this group at https://groups.google.com/group/tesseract-ocr.
>>> To view this discussion on the web visit https://groups.google.com/d/ms
>>> gid/tesseract-ocr/CA%2BnTJPCbssxySUh7fNCD_fbHnOLg29v%
>>> 2BQXemYit4CaBAq%3DP3Jw%40mail.gmail.com
>>> 
>>> .
>>> For more options, visit https://groups.google.com/d/optout.
>>>
>> --
>> You received this message because you are subscribed to the Google Groups
>> "tesseract-ocr" group.
>> To unsubscribe from this group and stop receiving emails from it, send an
>> email to tesseract-ocr+unsubscr...@googlegroups.com.
>> To post to this group, send email to tesseract-ocr@googlegroups.com.
>> Visit this group at https://groups.google.com/group/tesseract-ocr.
>> To view this discussion on the web visit https://groups.google.com/d/ms
>> gid/tesseract-ocr/CAG2NduWus1TeUFGFfjmJT57vCWE7h_D%3DEQ2%
>> 3DtDoDmmscWajM8g%40mail.gmail.com
>> 
>> .
>>
>> For more options, visit https://groups.google.com/d/optout.
>>
>
> --
> You received this message 

Re: [tesseract-ocr] Traineed non unicode font with tesseract

2018-04-06 Thread gopal bhalala
Yes Shree. I am trying to recognized text from a PDF or image with non
unicode font. I tried with make box and to do that but did not get sucess,
Can you please give me any guidence on that how to do that?

Best Regards & Thanking you,
Gopal Dhanjibhai Bhalala

On Fri, Apr 6, 2018 at 1:20 AM, ShreeDevi Kumar 
wrote:

> Are you trying to recognize the text from a pdf or image with non unicode
> font?
>
> That is possible to do.
>
> If you want to train using non-unicode font, that is not possible.
>
> On Fri 6 Apr, 2018, 12:03 AM gopal bhalala, 
> wrote:
>
>> Hi Shree,
>>
>> Thanks for the quick response, is there any way to train non unicode font
>> PDF AND IMAGE?
>> i have non unicode pdf file and image for ocr shall i box it and assing
>> the uniode font charcter is it right way to do non unicode pdf or image to
>> OCR.
>>
>> On 05-Apr-2018 7:25 AM, "ShreeDevi Kumar"  wrote:
>>
>>> Training tesseract is only supported using unicode fonts.
>>>
>>> On Thu 5 Apr, 2018, 12:25 AM gopal bhalala, 
>>> wrote:
>>>
 Hi I am new in tesseract-ocr. I want trainned non unicode font using
 tesseract, I tried with to trained it with jTextboxeditor to trained that
 data but did not get any sucess.

 --
 You received this message because you are subscribed to the Google
 Groups "tesseract-ocr" group.
 To unsubscribe from this group and stop receiving emails from it, send
 an email to tesseract-ocr+unsubscr...@googlegroups.com.
 To post to this group, send email to tesseract-ocr@googlegroups.com.
 Visit this group at https://groups.google.com/group/tesseract-ocr.
 To view this discussion on the web visit https://groups.google.com/d/
 msgid/tesseract-ocr/dc1825db-ef94-4bfd-bb3e-9e98d11faf07%
 40googlegroups.com
 
 .
 For more options, visit https://groups.google.com/d/optout.

>>> --
>>> You received this message because you are subscribed to the Google
>>> Groups "tesseract-ocr" group.
>>> To unsubscribe from this group and stop receiving emails from it, send
>>> an email to tesseract-ocr+unsubscr...@googlegroups.com.
>>> To post to this group, send email to tesseract-ocr@googlegroups.com.
>>> Visit this group at https://groups.google.com/group/tesseract-ocr.
>>> To view this discussion on the web visit https://groups.google.com/d/
>>> msgid/tesseract-ocr/CAG2NduWdm5%3DG9MoDskCLHfE1-
>>> bdy7pXZteR6HrNp9EDjmzRy4w%40mail.gmail.com
>>> 
>>> .
>>> For more options, visit https://groups.google.com/d/optout.
>>>
>> --
>> You received this message because you are subscribed to the Google Groups
>> "tesseract-ocr" group.
>> To unsubscribe from this group and stop receiving emails from it, send an
>> email to tesseract-ocr+unsubscr...@googlegroups.com.
>> To post to this group, send email to tesseract-ocr@googlegroups.com.
>> Visit this group at https://groups.google.com/group/tesseract-ocr.
>> To view this discussion on the web visit https://groups.google.com/d/
>> msgid/tesseract-ocr/CA%2BnTJPCbssxySUh7fNCD_fbHnOLg29v%2BQXemYit4CaBAq%
>> 3DP3Jw%40mail.gmail.com
>> 
>> .
>> For more options, visit https://groups.google.com/d/optout.
>>
> --
> You received this message because you are subscribed to the Google Groups
> "tesseract-ocr" group.
> To unsubscribe from this group and stop receiving emails from it, send an
> email to tesseract-ocr+unsubscr...@googlegroups.com.
> To post to this group, send email to tesseract-ocr@googlegroups.com.
> Visit this group at https://groups.google.com/group/tesseract-ocr.
> To view this discussion on the web visit https://groups.google.com/d/
> msgid/tesseract-ocr/CAG2NduWus1TeUFGFfjmJT57vCWE7h
> _D%3DEQ2%3DtDoDmmscWajM8g%40mail.gmail.com
> 
> .
>
> For more options, visit https://groups.google.com/d/optout.
>

-- 
You received this message because you are subscribed to the Google Groups 
"tesseract-ocr" group.
To unsubscribe from this group and stop receiving emails from it, send an email 
to tesseract-ocr+unsubscr...@googlegroups.com.
To post to this group, send email to tesseract-ocr@googlegroups.com.
Visit this group at https://groups.google.com/group/tesseract-ocr.
To view this discussion on the web visit 
https://groups.google.com/d/msgid/tesseract-ocr/CA%2BnTJPD0M1aMtw8gsJb7KKPnOvtGUC9SNQTxy36LF%3DACeXLvAg%40mail.gmail.com.
For more options, 

Re: [tesseract-ocr] Traineed non unicode font with tesseract

2018-04-05 Thread ShreeDevi Kumar
Are you trying to recognize the text from a pdf or image with non unicode
font?

That is possible to do.

If you want to train using non-unicode font, that is not possible.

On Fri 6 Apr, 2018, 12:03 AM gopal bhalala,  wrote:

> Hi Shree,
>
> Thanks for the quick response, is there any way to train non unicode font
> PDF AND IMAGE?
> i have non unicode pdf file and image for ocr shall i box it and assing
> the uniode font charcter is it right way to do non unicode pdf or image to
> OCR.
>
> On 05-Apr-2018 7:25 AM, "ShreeDevi Kumar"  wrote:
>
>> Training tesseract is only supported using unicode fonts.
>>
>> On Thu 5 Apr, 2018, 12:25 AM gopal bhalala, 
>> wrote:
>>
>>> Hi I am new in tesseract-ocr. I want trainned non unicode font using
>>> tesseract, I tried with to trained it with jTextboxeditor to trained that
>>> data but did not get any sucess.
>>>
>>> --
>>> You received this message because you are subscribed to the Google
>>> Groups "tesseract-ocr" group.
>>> To unsubscribe from this group and stop receiving emails from it, send
>>> an email to tesseract-ocr+unsubscr...@googlegroups.com.
>>> To post to this group, send email to tesseract-ocr@googlegroups.com.
>>> Visit this group at https://groups.google.com/group/tesseract-ocr.
>>> To view this discussion on the web visit
>>> https://groups.google.com/d/msgid/tesseract-ocr/dc1825db-ef94-4bfd-bb3e-9e98d11faf07%40googlegroups.com
>>> 
>>> .
>>> For more options, visit https://groups.google.com/d/optout.
>>>
>> --
>> You received this message because you are subscribed to the Google Groups
>> "tesseract-ocr" group.
>> To unsubscribe from this group and stop receiving emails from it, send an
>> email to tesseract-ocr+unsubscr...@googlegroups.com.
>> To post to this group, send email to tesseract-ocr@googlegroups.com.
>> Visit this group at https://groups.google.com/group/tesseract-ocr.
>> To view this discussion on the web visit
>> https://groups.google.com/d/msgid/tesseract-ocr/CAG2NduWdm5%3DG9MoDskCLHfE1-bdy7pXZteR6HrNp9EDjmzRy4w%40mail.gmail.com
>> 
>> .
>> For more options, visit https://groups.google.com/d/optout.
>>
> --
> You received this message because you are subscribed to the Google Groups
> "tesseract-ocr" group.
> To unsubscribe from this group and stop receiving emails from it, send an
> email to tesseract-ocr+unsubscr...@googlegroups.com.
> To post to this group, send email to tesseract-ocr@googlegroups.com.
> Visit this group at https://groups.google.com/group/tesseract-ocr.
> To view this discussion on the web visit
> https://groups.google.com/d/msgid/tesseract-ocr/CA%2BnTJPCbssxySUh7fNCD_fbHnOLg29v%2BQXemYit4CaBAq%3DP3Jw%40mail.gmail.com
> 
> .
> For more options, visit https://groups.google.com/d/optout.
>

-- 
You received this message because you are subscribed to the Google Groups 
"tesseract-ocr" group.
To unsubscribe from this group and stop receiving emails from it, send an email 
to tesseract-ocr+unsubscr...@googlegroups.com.
To post to this group, send email to tesseract-ocr@googlegroups.com.
Visit this group at https://groups.google.com/group/tesseract-ocr.
To view this discussion on the web visit 
https://groups.google.com/d/msgid/tesseract-ocr/CAG2NduWus1TeUFGFfjmJT57vCWE7h_D%3DEQ2%3DtDoDmmscWajM8g%40mail.gmail.com.
For more options, visit https://groups.google.com/d/optout.


Re: [tesseract-ocr] Traineed non unicode font with tesseract

2018-04-05 Thread gopal bhalala
Hi Shree,

Thanks for the quick response, is there any way to train non unicode font
PDF AND IMAGE?
i have non unicode pdf file and image for ocr shall i box it and assing the
uniode font charcter is it right way to do non unicode pdf or image to OCR.

On 05-Apr-2018 7:25 AM, "ShreeDevi Kumar"  wrote:

> Training tesseract is only supported using unicode fonts.
>
> On Thu 5 Apr, 2018, 12:25 AM gopal bhalala, 
> wrote:
>
>> Hi I am new in tesseract-ocr. I want trainned non unicode font using
>> tesseract, I tried with to trained it with jTextboxeditor to trained that
>> data but did not get any sucess.
>>
>> --
>> You received this message because you are subscribed to the Google Groups
>> "tesseract-ocr" group.
>> To unsubscribe from this group and stop receiving emails from it, send an
>> email to tesseract-ocr+unsubscr...@googlegroups.com.
>> To post to this group, send email to tesseract-ocr@googlegroups.com.
>> Visit this group at https://groups.google.com/group/tesseract-ocr.
>> To view this discussion on the web visit https://groups.google.com/d/
>> msgid/tesseract-ocr/dc1825db-ef94-4bfd-bb3e-9e98d11faf07%
>> 40googlegroups.com
>> 
>> .
>> For more options, visit https://groups.google.com/d/optout.
>>
> --
> You received this message because you are subscribed to the Google Groups
> "tesseract-ocr" group.
> To unsubscribe from this group and stop receiving emails from it, send an
> email to tesseract-ocr+unsubscr...@googlegroups.com.
> To post to this group, send email to tesseract-ocr@googlegroups.com.
> Visit this group at https://groups.google.com/group/tesseract-ocr.
> To view this discussion on the web visit https://groups.google.com/d/
> msgid/tesseract-ocr/CAG2NduWdm5%3DG9MoDskCLHfE1-bdy7pXZteR6HrNp9EDjmzRy4w%
> 40mail.gmail.com
> 
> .
> For more options, visit https://groups.google.com/d/optout.
>

-- 
You received this message because you are subscribed to the Google Groups 
"tesseract-ocr" group.
To unsubscribe from this group and stop receiving emails from it, send an email 
to tesseract-ocr+unsubscr...@googlegroups.com.
To post to this group, send email to tesseract-ocr@googlegroups.com.
Visit this group at https://groups.google.com/group/tesseract-ocr.
To view this discussion on the web visit 
https://groups.google.com/d/msgid/tesseract-ocr/CA%2BnTJPCbssxySUh7fNCD_fbHnOLg29v%2BQXemYit4CaBAq%3DP3Jw%40mail.gmail.com.
For more options, visit https://groups.google.com/d/optout.


Re: [tesseract-ocr] Traineed non unicode font with tesseract

2018-04-04 Thread ShreeDevi Kumar
Training tesseract is only supported using unicode fonts.

On Thu 5 Apr, 2018, 12:25 AM gopal bhalala,  wrote:

> Hi I am new in tesseract-ocr. I want trainned non unicode font using
> tesseract, I tried with to trained it with jTextboxeditor to trained that
> data but did not get any sucess.
>
> --
> You received this message because you are subscribed to the Google Groups
> "tesseract-ocr" group.
> To unsubscribe from this group and stop receiving emails from it, send an
> email to tesseract-ocr+unsubscr...@googlegroups.com.
> To post to this group, send email to tesseract-ocr@googlegroups.com.
> Visit this group at https://groups.google.com/group/tesseract-ocr.
> To view this discussion on the web visit
> https://groups.google.com/d/msgid/tesseract-ocr/dc1825db-ef94-4bfd-bb3e-9e98d11faf07%40googlegroups.com
> 
> .
> For more options, visit https://groups.google.com/d/optout.
>

-- 
You received this message because you are subscribed to the Google Groups 
"tesseract-ocr" group.
To unsubscribe from this group and stop receiving emails from it, send an email 
to tesseract-ocr+unsubscr...@googlegroups.com.
To post to this group, send email to tesseract-ocr@googlegroups.com.
Visit this group at https://groups.google.com/group/tesseract-ocr.
To view this discussion on the web visit 
https://groups.google.com/d/msgid/tesseract-ocr/CAG2NduWdm5%3DG9MoDskCLHfE1-bdy7pXZteR6HrNp9EDjmzRy4w%40mail.gmail.com.
For more options, visit https://groups.google.com/d/optout.