Re: [tesseract-ocr] Re: Tesseract ignores tessedit_char_whitelist parameter

2017-10-19 Thread Quan Nguyen
https://github.com/tesseract-ocr/tesseract/issues/751

Use current version 3.05.x, if possible.


On Thursday, October 19, 2017 at 9:19:08 AM UTC-5, Ľuboš Katrinec wrote:
>
> I used --print-parameters with this version and I could see the parameter 
> in the list included. Do you think it is not used even if listed? It's the 
> same with tessedit_char_blacklist? Is there an alternative?
>
> Thanks and regards,
> Lubos
>
> On Saturday, October 14, 2017 at 5:43:16 PM UTC+2, shree wrote:
>>
>> whitelist parameter does not work with tesseract 4.0x
>>
>> ShreeDevi
>> 
>> भजन - कीर्तन - आरती @ http://bhajans.ramparivar.com
>>
>> On Sat, Oct 14, 2017 at 8:25 PM, Dan9er  wrote:
>>
>>> -c goes at the very end of the command, and you can combine those two 
>>> arguments. Try this:
>>>
>>> > tesseract threshold_problem1.jpeg stdout -c tessedit_char_whitelist=
>>> ABCDEFGHIJKLMNOPQRSTUVWXYZ tessedit_char_blacklist=abcdef
>>> ghijklmnopqrstuvwxyz
>>>
>>> On Friday, October 13, 2017 at 5:43:46 AM UTC-4, Ľuboš Katrinec wrote:

 Hello,

 I'm trying to solve captcha images just for fun (or rather a challenge 
 ;-) ). I'm passing tessedit_char_whitelist and tessedit_char_blacklist 
 parameters but somehow they seem to be ignored. Perhaps I just miss 
 something.

 > tesseract -c tessedit_char_whitelist=ABCDEFGHIJKLMNOPQRSTUVWXYZ -c 
 tessedit_char_blacklist=abcdefghijklmnopqrstuvwxyz  
 threshold_problem1.jpeg 
 stdout
 Warning. Invalid resolution 0 dpi. Using 70 instead.
 R x C Eo e

 I'm using a windows version:

 > tesseract -v
 tesseract 4.00.00alpha
  leptonica-1.74.1
   libgif 4.1.6(?) : libjpeg 8d (libjpeg-turbo 1.5.0) : libpng 1.6.20 : 
 libtiff 4.0.6 : zlib 1.2.8 : libwebp 0.4.3 : libopenjp2 2.1.0


 I'm doing it over a JPEG, could that be a problem?

 Thanks and regards,
 Lubos

>>> -- 
>>> You received this message because you are subscribed to the Google 
>>> Groups "tesseract-ocr" group.
>>> To unsubscribe from this group and stop receiving emails from it, send 
>>> an email to tesseract-oc...@googlegroups.com.
>>> To post to this group, send email to tesser...@googlegroups.com.
>>> Visit this group at https://groups.google.com/group/tesseract-ocr.
>>> To view this discussion on the web visit 
>>> https://groups.google.com/d/msgid/tesseract-ocr/7036c184-2d91-43f1-874f-44f2c29f3d61%40googlegroups.com
>>>  
>>> 
>>> .
>>> For more options, visit https://groups.google.com/d/optout.
>>>
>>
>>

-- 
You received this message because you are subscribed to the Google Groups 
"tesseract-ocr" group.
To unsubscribe from this group and stop receiving emails from it, send an email 
to tesseract-ocr+unsubscr...@googlegroups.com.
To post to this group, send email to tesseract-ocr@googlegroups.com.
Visit this group at https://groups.google.com/group/tesseract-ocr.
To view this discussion on the web visit 
https://groups.google.com/d/msgid/tesseract-ocr/b091515d-b04b-46bb-93c0-5e908c52d326%40googlegroups.com.
For more options, visit https://groups.google.com/d/optout.


Re: [tesseract-ocr] Re: Tesseract ignores tessedit_char_whitelist parameter

2017-10-19 Thread Ľuboš Katrinec
I used --print-parameters with this version and I could see the parameter 
in the list included. Do you think it is not used even if listed? It's the 
same with tessedit_char_blacklist? Is there an alternative?

Thanks and regards,
Lubos

On Saturday, October 14, 2017 at 5:43:16 PM UTC+2, shree wrote:
>
> whitelist parameter does not work with tesseract 4.0x
>
> ShreeDevi
> 
> भजन - कीर्तन - आरती @ http://bhajans.ramparivar.com
>
> On Sat, Oct 14, 2017 at 8:25 PM, Dan9er  > wrote:
>
>> -c goes at the very end of the command, and you can combine those two 
>> arguments. Try this:
>>
>> > tesseract threshold_problem1.jpeg stdout -c tessedit_char_whitelist=
>> ABCDEFGHIJKLMNOPQRSTUVWXYZ tessedit_char_blacklist=abcdef
>> ghijklmnopqrstuvwxyz
>>
>> On Friday, October 13, 2017 at 5:43:46 AM UTC-4, Ľuboš Katrinec wrote:
>>>
>>> Hello,
>>>
>>> I'm trying to solve captcha images just for fun (or rather a challenge 
>>> ;-) ). I'm passing tessedit_char_whitelist and tessedit_char_blacklist 
>>> parameters but somehow they seem to be ignored. Perhaps I just miss 
>>> something.
>>>
>>> > tesseract -c tessedit_char_whitelist=ABCDEFGHIJKLMNOPQRSTUVWXYZ -c 
>>> tessedit_char_blacklist=abcdefghijklmnopqrstuvwxyz  threshold_problem1.jpeg 
>>> stdout
>>> Warning. Invalid resolution 0 dpi. Using 70 instead.
>>> R x C Eo e
>>>
>>> I'm using a windows version:
>>>
>>> > tesseract -v
>>> tesseract 4.00.00alpha
>>>  leptonica-1.74.1
>>>   libgif 4.1.6(?) : libjpeg 8d (libjpeg-turbo 1.5.0) : libpng 1.6.20 : 
>>> libtiff 4.0.6 : zlib 1.2.8 : libwebp 0.4.3 : libopenjp2 2.1.0
>>>
>>>
>>> I'm doing it over a JPEG, could that be a problem?
>>>
>>> Thanks and regards,
>>> Lubos
>>>
>> -- 
>> You received this message because you are subscribed to the Google Groups 
>> "tesseract-ocr" group.
>> To unsubscribe from this group and stop receiving emails from it, send an 
>> email to tesseract-oc...@googlegroups.com .
>> To post to this group, send email to tesser...@googlegroups.com 
>> .
>> Visit this group at https://groups.google.com/group/tesseract-ocr.
>> To view this discussion on the web visit 
>> https://groups.google.com/d/msgid/tesseract-ocr/7036c184-2d91-43f1-874f-44f2c29f3d61%40googlegroups.com
>>  
>> 
>> .
>> For more options, visit https://groups.google.com/d/optout.
>>
>
>

-- 
You received this message because you are subscribed to the Google Groups 
"tesseract-ocr" group.
To unsubscribe from this group and stop receiving emails from it, send an email 
to tesseract-ocr+unsubscr...@googlegroups.com.
To post to this group, send email to tesseract-ocr@googlegroups.com.
Visit this group at https://groups.google.com/group/tesseract-ocr.
To view this discussion on the web visit 
https://groups.google.com/d/msgid/tesseract-ocr/6a79aa66-3e42-42d0-9b90-6513bea58c1d%40googlegroups.com.
For more options, visit https://groups.google.com/d/optout.


[tesseract-ocr] Re: Tesseract ignores tessedit_char_whitelist parameter

2017-10-19 Thread Ľuboš Katrinec
I already tried this, didn't help at all.

On Saturday, October 14, 2017 at 4:55:42 PM UTC+2, Dan9er wrote:
>
> -c goes at the very end of the command, and you can combine those two 
> arguments. Try this:
>
> > tesseract threshold_problem1.jpeg stdout -c tessedit_char_whitelist=
> ABCDEFGHIJKLMNOPQRSTUVWXYZ tessedit_char_blacklist=abcdef
> ghijklmnopqrstuvwxyz
>
> On Friday, October 13, 2017 at 5:43:46 AM UTC-4, Ľuboš Katrinec wrote:
>>
>> Hello,
>>
>> I'm trying to solve captcha images just for fun (or rather a challenge 
>> ;-) ). I'm passing tessedit_char_whitelist and tessedit_char_blacklist 
>> parameters but somehow they seem to be ignored. Perhaps I just miss 
>> something.
>>
>> > tesseract -c tessedit_char_whitelist=ABCDEFGHIJKLMNOPQRSTUVWXYZ -c 
>> tessedit_char_blacklist=abcdefghijklmnopqrstuvwxyz  threshold_problem1.jpeg 
>> stdout
>> Warning. Invalid resolution 0 dpi. Using 70 instead.
>> R x C Eo e
>>
>> I'm using a windows version:
>>
>> > tesseract -v
>> tesseract 4.00.00alpha
>>  leptonica-1.74.1
>>   libgif 4.1.6(?) : libjpeg 8d (libjpeg-turbo 1.5.0) : libpng 1.6.20 : 
>> libtiff 4.0.6 : zlib 1.2.8 : libwebp 0.4.3 : libopenjp2 2.1.0
>>
>>
>> I'm doing it over a JPEG, could that be a problem?
>>
>> Thanks and regards,
>> Lubos
>>
>

-- 
You received this message because you are subscribed to the Google Groups 
"tesseract-ocr" group.
To unsubscribe from this group and stop receiving emails from it, send an email 
to tesseract-ocr+unsubscr...@googlegroups.com.
To post to this group, send email to tesseract-ocr@googlegroups.com.
Visit this group at https://groups.google.com/group/tesseract-ocr.
To view this discussion on the web visit 
https://groups.google.com/d/msgid/tesseract-ocr/ba46f0fb-a23d-45d7-a51a-ad9ef84cec42%40googlegroups.com.
For more options, visit https://groups.google.com/d/optout.


Re: [tesseract-ocr] Re: Tesseract ignores tessedit_char_whitelist parameter

2017-10-14 Thread ShreeDevi Kumar
whitelist parameter does not work with tesseract 4.0x

ShreeDevi

भजन - कीर्तन - आरती @ http://bhajans.ramparivar.com

On Sat, Oct 14, 2017 at 8:25 PM, Dan9er  wrote:

> -c goes at the very end of the command, and you can combine those two
> arguments. Try this:
>
> > tesseract threshold_problem1.jpeg stdout -c tessedit_char_whitelist=
> ABCDEFGHIJKLMNOPQRSTUVWXYZ tessedit_char_blacklist=abcdef
> ghijklmnopqrstuvwxyz
>
> On Friday, October 13, 2017 at 5:43:46 AM UTC-4, Ľuboš Katrinec wrote:
>>
>> Hello,
>>
>> I'm trying to solve captcha images just for fun (or rather a challenge
>> ;-) ). I'm passing tessedit_char_whitelist and tessedit_char_blacklist
>> parameters but somehow they seem to be ignored. Perhaps I just miss
>> something.
>>
>> > tesseract -c tessedit_char_whitelist=ABCDEFGHIJKLMNOPQRSTUVWXYZ -c
>> tessedit_char_blacklist=abcdefghijklmnopqrstuvwxyz  threshold_problem1.jpeg
>> stdout
>> Warning. Invalid resolution 0 dpi. Using 70 instead.
>> R x C Eo e
>>
>> I'm using a windows version:
>>
>> > tesseract -v
>> tesseract 4.00.00alpha
>>  leptonica-1.74.1
>>   libgif 4.1.6(?) : libjpeg 8d (libjpeg-turbo 1.5.0) : libpng 1.6.20 :
>> libtiff 4.0.6 : zlib 1.2.8 : libwebp 0.4.3 : libopenjp2 2.1.0
>>
>>
>> I'm doing it over a JPEG, could that be a problem?
>>
>> Thanks and regards,
>> Lubos
>>
> --
> You received this message because you are subscribed to the Google Groups
> "tesseract-ocr" group.
> To unsubscribe from this group and stop receiving emails from it, send an
> email to tesseract-ocr+unsubscr...@googlegroups.com.
> To post to this group, send email to tesseract-ocr@googlegroups.com.
> Visit this group at https://groups.google.com/group/tesseract-ocr.
> To view this discussion on the web visit https://groups.google.com/d/
> msgid/tesseract-ocr/7036c184-2d91-43f1-874f-44f2c29f3d61%
> 40googlegroups.com
> 
> .
> For more options, visit https://groups.google.com/d/optout.
>

-- 
You received this message because you are subscribed to the Google Groups 
"tesseract-ocr" group.
To unsubscribe from this group and stop receiving emails from it, send an email 
to tesseract-ocr+unsubscr...@googlegroups.com.
To post to this group, send email to tesseract-ocr@googlegroups.com.
Visit this group at https://groups.google.com/group/tesseract-ocr.
To view this discussion on the web visit 
https://groups.google.com/d/msgid/tesseract-ocr/CAG2NduW%3DEJdz4%3DhD9LEhj4Mc-k7EpvqVXrNCMitfdREYRVb-8w%40mail.gmail.com.
For more options, visit https://groups.google.com/d/optout.


[tesseract-ocr] Re: Tesseract ignores tessedit_char_whitelist parameter

2017-10-14 Thread Dan9er
-c goes at the very end of the command, and you can combine those two 
arguments. Try this:

> tesseract threshold_problem1.jpeg stdout -c 
> tessedit_char_whitelist=ABCDEFGHIJKLMNOPQRSTUVWXYZ 
tessedit_char_blacklist=abcdefghijklmnopqrstuvwxyz

On Friday, October 13, 2017 at 5:43:46 AM UTC-4, Ľuboš Katrinec wrote:
>
> Hello,
>
> I'm trying to solve captcha images just for fun (or rather a challenge ;-) 
> ). I'm passing tessedit_char_whitelist and tessedit_char_blacklist 
> parameters but somehow they seem to be ignored. Perhaps I just miss 
> something.
>
> > tesseract -c tessedit_char_whitelist=ABCDEFGHIJKLMNOPQRSTUVWXYZ -c 
> tessedit_char_blacklist=abcdefghijklmnopqrstuvwxyz  threshold_problem1.jpeg 
> stdout
> Warning. Invalid resolution 0 dpi. Using 70 instead.
> R x C Eo e
>
> I'm using a windows version:
>
> > tesseract -v
> tesseract 4.00.00alpha
>  leptonica-1.74.1
>   libgif 4.1.6(?) : libjpeg 8d (libjpeg-turbo 1.5.0) : libpng 1.6.20 : 
> libtiff 4.0.6 : zlib 1.2.8 : libwebp 0.4.3 : libopenjp2 2.1.0
>
>
> I'm doing it over a JPEG, could that be a problem?
>
> Thanks and regards,
> Lubos
>

-- 
You received this message because you are subscribed to the Google Groups 
"tesseract-ocr" group.
To unsubscribe from this group and stop receiving emails from it, send an email 
to tesseract-ocr+unsubscr...@googlegroups.com.
To post to this group, send email to tesseract-ocr@googlegroups.com.
Visit this group at https://groups.google.com/group/tesseract-ocr.
To view this discussion on the web visit 
https://groups.google.com/d/msgid/tesseract-ocr/7036c184-2d91-43f1-874f-44f2c29f3d61%40googlegroups.com.
For more options, visit https://groups.google.com/d/optout.