[tesseract-ocr] Re: Install and run tesseract 4.0 on MAC OSX step by step

2018-09-26 Thread Mark Phillips
Not sure if this was right or now but given the before errors were 
"warnings" I continued onward and got these errors with Scrollview.jar - 

[Wed Sep 
26-19:17:29][MEPMBP2017][()markphillips](~/Documents/Development/Tesseract/tesseract/java)
 
=>>SCROLLVIEW_PATH=~/Documents/Development/Tesseract/tesseract/java make 
ScrollView.jar

javac -encoding UTF8 -sourcepath . -classpath 
piccolo2d-core-3.0.jar:piccolo2d-extras-3.0.jar 
./com/google/scrollview/ui/SVAbstractMenuItem.java 
./com/google/scrollview/ui/SVCheckboxMenuItem.java 
./com/google/scrollview/ui/SVEmptyMenuItem.java 
./com/google/scrollview/events/SVEvent.java 
./com/google/scrollview/events/SVEventHandler.java 
./com/google/scrollview/events/SVEventType.java 
./com/google/scrollview/ui/SVImageHandler.java 
./com/google/scrollview/ui/SVMenuBar.java 
./com/google/scrollview/ui/SVMenuItem.java 
./com/google/scrollview/ui/SVPopupMenu.java 
./com/google/scrollview/ui/SVSubMenuItem.java 
./com/google/scrollview/ui/SVWindow.java 
./com/google/scrollview/ScrollView.java -d .

./com/google/scrollview/ui/SVImageHandler.java:19: error: package 
javax.xml.bind is not visible

import javax.xml.bind.DatatypeConverter;

^

  (package javax.xml.bind is declared in module java.xml.bind, which is not 
in the module graph)

1 error

make: *** [com/google/scrollview/ui/SVAbstractMenuItem.class] Error 1

On Sunday, April 8, 2018 at 2:50:20 AM UTC-7, Fanatico wrote:
>
> I just posted at the repo issues a step to step that I needed to do so I 
> could use tessercat 4.0 from my MAC, so I'm just sharing the link in case 
> someone has the same problems I got.
> Obs.: It can save a few days of your life
>
> https://github.com/tesseract-ocr/tesseract/issues/1453
>

-- 
You received this message because you are subscribed to the Google Groups 
"tesseract-ocr" group.
To unsubscribe from this group and stop receiving emails from it, send an email 
to tesseract-ocr+unsubscr...@googlegroups.com.
To post to this group, send email to tesseract-ocr@googlegroups.com.
Visit this group at https://groups.google.com/group/tesseract-ocr.
To view this discussion on the web visit 
https://groups.google.com/d/msgid/tesseract-ocr/392bf20b-6982-4b19-9fb8-9a66c193257e%40googlegroups.com.
For more options, visit https://groups.google.com/d/optout.


[tesseract-ocr] Re: Install and run tesseract 4.0 on MAC OSX step by step

2018-09-26 Thread Mark Phillips
I guess the build failed from before...

[Wed Sep 
26-19:27:21][MEPMBP2017][()markphillips](~/Documents/Development/Tesseract/tesseract)
 
=>>text2image --list_available_fonts --fonts_dir=/Library/Fonts

dyld: Library not loaded: /usr/local/opt/icu4c/lib/libicui18n.62.dylib

  Referenced from: /usr/local/bin/text2image

  Reason: image not found

Abort trap: 6

[Wed Sep 
26-19:27:59][MEPMBP2017][()markphillips](~/Documents/Development/Tesseract/tesseract)
 
=>>

On Sunday, April 8, 2018 at 2:50:20 AM UTC-7, Fanatico wrote:
>
> I just posted at the repo issues a step to step that I needed to do so I 
> could use tessercat 4.0 from my MAC, so I'm just sharing the link in case 
> someone has the same problems I got.
> Obs.: It can save a few days of your life
>
> https://github.com/tesseract-ocr/tesseract/issues/1453
>

-- 
You received this message because you are subscribed to the Google Groups 
"tesseract-ocr" group.
To unsubscribe from this group and stop receiving emails from it, send an email 
to tesseract-ocr+unsubscr...@googlegroups.com.
To post to this group, send email to tesseract-ocr@googlegroups.com.
Visit this group at https://groups.google.com/group/tesseract-ocr.
To view this discussion on the web visit 
https://groups.google.com/d/msgid/tesseract-ocr/8b968dbf-3826-4301-aac4-a61a2751cf97%40googlegroups.com.
For more options, visit https://groups.google.com/d/optout.


Re: [tesseract-ocr] Tesseract to detect numbers with opencv (c++) and cmake on a raspberry pi

2018-09-26 Thread Zdenko Podobny
Just goole for help. There are plenty examples. e.g.
https://stackoverflow.com/questions/24570916/add-external-libraries-to-cmakelist-txt-c

Zdenko


st 26. 9. 2018 o 14:38 Adam Richards  napísal(a):

> Hi Zdenko,
>
> Sorry, I'm just still unsure how exactly to get this working? What should
> I be doing to set my Tesseract_INCLUDE_DIRS and Tesseract_LIBRARIES
> manually...
>
> Any further help you could provide would be appreciated,
>
> thanks,
> Adam
>
> On Monday, September 24, 2018 at 4:46:05 PM UTC+10, zdenop wrote:
>>
>> This means that your tesseract packages were build with autotools and
>> they did not includes cmake support.
>> So you need to set your Tesseract_INCLUDE_DIRS and Tesseract_LIBRARIES
>> manually...
>>
>> Zdenko
>>
>>
>> po 24. 9. 2018 o 0:24 Adam Richards  napísal(a):
>>
>>> I did a search on the Pi and it couldn't find A TesseractConfig.cmake
>>> file anywhere.
>>>
>>> Where should it be exactly?
>>>
>>> Should it automatocally come with Tesseract or do I need to download or
>>> add it?
>>>
>>> --
>>> You received this message because you are subscribed to the Google
>>> Groups "tesseract-ocr" group.
>>> To unsubscribe from this group and stop receiving emails from it, send
>>> an email to tesseract-oc...@googlegroups.com.
>>> To post to this group, send email to tesser...@googlegroups.com.
>>> Visit this group at https://groups.google.com/group/tesseract-ocr.
>>> To view this discussion on the web visit
>>> https://groups.google.com/d/msgid/tesseract-ocr/64319e95-3c44-49bb-8e8e-8650a17f8a8a%40googlegroups.com
>>> .
>>> For more options, visit https://groups.google.com/d/optout.
>>>
>> --
> You received this message because you are subscribed to the Google Groups
> "tesseract-ocr" group.
> To unsubscribe from this group and stop receiving emails from it, send an
> email to tesseract-ocr+unsubscr...@googlegroups.com.
> To post to this group, send email to tesseract-ocr@googlegroups.com.
> Visit this group at https://groups.google.com/group/tesseract-ocr.
> To view this discussion on the web visit
> https://groups.google.com/d/msgid/tesseract-ocr/ff3a4ab9-69a3-463a-bf98-8a4190e327c1%40googlegroups.com
> 
> .
> For more options, visit https://groups.google.com/d/optout.
>

-- 
You received this message because you are subscribed to the Google Groups 
"tesseract-ocr" group.
To unsubscribe from this group and stop receiving emails from it, send an email 
to tesseract-ocr+unsubscr...@googlegroups.com.
To post to this group, send email to tesseract-ocr@googlegroups.com.
Visit this group at https://groups.google.com/group/tesseract-ocr.
To view this discussion on the web visit 
https://groups.google.com/d/msgid/tesseract-ocr/CAJbzG8x%3DXfQDguHXshQSA_eyV-dmYp1infMw8g4Dd3uxe29-Ng%40mail.gmail.com.
For more options, visit https://groups.google.com/d/optout.


Re: [tesseract-ocr] Tesseract to detect numbers with opencv (c++) and cmake on a raspberry pi

2018-09-26 Thread Adam Richards
Hi Zdenko,

Sorry, I'm just still unsure how exactly to get this working? What should I 
be doing to set my Tesseract_INCLUDE_DIRS and Tesseract_LIBRARIES 
manually...

Any further help you could provide would be appreciated,

thanks,
Adam

On Monday, September 24, 2018 at 4:46:05 PM UTC+10, zdenop wrote:
>
> This means that your tesseract packages were build with autotools and they 
> did not includes cmake support.
> So you need to set your Tesseract_INCLUDE_DIRS and Tesseract_LIBRARIES 
> manually...
>
> Zdenko
>
>
> po 24. 9. 2018 o 0:24 Adam Richards > 
> napísal(a):
>
>> I did a search on the Pi and it couldn't find A TesseractConfig.cmake 
>> file anywhere. 
>>
>> Where should it be exactly?
>>
>> Should it automatocally come with Tesseract or do I need to download or 
>> add it?
>>
>> -- 
>> You received this message because you are subscribed to the Google Groups 
>> "tesseract-ocr" group.
>> To unsubscribe from this group and stop receiving emails from it, send an 
>> email to tesseract-oc...@googlegroups.com .
>> To post to this group, send email to tesser...@googlegroups.com 
>> .
>> Visit this group at https://groups.google.com/group/tesseract-ocr.
>> To view this discussion on the web visit 
>> https://groups.google.com/d/msgid/tesseract-ocr/64319e95-3c44-49bb-8e8e-8650a17f8a8a%40googlegroups.com
>> .
>> For more options, visit https://groups.google.com/d/optout.
>>
>

-- 
You received this message because you are subscribed to the Google Groups 
"tesseract-ocr" group.
To unsubscribe from this group and stop receiving emails from it, send an email 
to tesseract-ocr+unsubscr...@googlegroups.com.
To post to this group, send email to tesseract-ocr@googlegroups.com.
Visit this group at https://groups.google.com/group/tesseract-ocr.
To view this discussion on the web visit 
https://groups.google.com/d/msgid/tesseract-ocr/ff3a4ab9-69a3-463a-bf98-8a4190e327c1%40googlegroups.com.
For more options, visit https://groups.google.com/d/optout.


[tesseract-ocr] Re: I Need help getting Tesseract 4.0 C# .Net Wrapper working please!

2018-09-26 Thread THintz
I assume you mean the charlesw/tesseract wrapper on Github.  Questions are 
more directly answered there.  What steps did you perform and what is the 
symptom?

-- 
You received this message because you are subscribed to the Google Groups 
"tesseract-ocr" group.
To unsubscribe from this group and stop receiving emails from it, send an email 
to tesseract-ocr+unsubscr...@googlegroups.com.
To post to this group, send email to tesseract-ocr@googlegroups.com.
Visit this group at https://groups.google.com/group/tesseract-ocr.
To view this discussion on the web visit 
https://groups.google.com/d/msgid/tesseract-ocr/87e323ca-ac0a-4063-8adb-09a428178acf%40googlegroups.com.
For more options, visit https://groups.google.com/d/optout.


Re: [tesseract-ocr] Compute CTC targets failed while training

2018-09-26 Thread Khosrobeigy.zohreh
No, I always train from scratch.
best fast.traindata doesn't recognize eng and persian and the accuracy is
too low in some fonts.
I want to solve this problem.
For fine tune can have different unicharset. As I read in wiki of
tesseract, it is the number of class of lstm. So if Mr. Smit has trained
for example 120 unicharset, can i have 160 unicharset in fine tune?
As I know the number of class in lstm cannot change.
all character in eng and fas and punc are aroud 164 character.

On Wed, Sep 26, 2018 at 12:34 PM Shree Devi Kumar 
wrote:

>
> >By version alpha, I trained about 1000 line and it is not so bad
>
> You must have only done fine tuning of model then and now you are trying
> to train from scratch.
>
> On Wed, 26 Sep 2018, 04:01 Khosrobeigy.zohreh, 
> wrote:
>
>> I know, actually I am master in lstm. I want to resolve all error and
>> then train big text.
>> By version alpha, I trained about 1000 line and it is not so bad. But in
>> version beta 4 I got many error.
>> In alpha,
>> # Use LSTM
>> tessedit_ocr_engine_mode 1
>> tessedit_pageseg_mode 6
>>
>> # Arabic page layout variables
>> segment_nonalphabetic_script 1
>>
>> # Avoid dropping rows
>> textord_noise_rowratio 20.0
>> textord_noise_syfract 0.6
>>
>> textord_min_linesize 2.5
>>
>> # Avoid over-estimating intra-word spacing at both row and
>> # block levels when using old to method
>> tosp_old_to_method T
>> tosp_old_to_constrain_sp_kn T
>> tosp_old_sp_kn_th_factor 4.0
>>
>> tosp_only_small_gaps_for_kern T
>> tosp_use_pre_chopping T
>>  I used all these, but now my model doesn't learn.
>> Has any thing changed in beta 4 for example text2image?
>>
>> On Wed, Sep 26, 2018 at 12:53 AM Shree Devi Kumar 
>> wrote:
>>
>>>   --fontlist "Arial"
>>>
>>> Does that have good coverage for Farsi?
>>>
>>>
>>> --max_iterations 5000
>>>
>>> You are trying to train from scratch with 18000 lines of text and only
>>> 5000 iterations. That will not work.
>>>
>>> Ray has trained on hundreds of thousands of lines of text and millions
>>> of iterations.
>>>
>>> On Tue, 25 Sep 2018, 16:20 Zohreh Khosrobeygi, 
>>> wrote:
>>>
 Hi, I use this :
 tesseract 4.0.0-beta.4
  leptonica-1.74.4
   libjpeg 8d (libjpeg-turbo 1.4.2) : libpng 1.2.54 : libtiff 4.0.6 :
 zlib 1.2.8

  Found AVX2
  Found AVX
  Found SSE
 I've trained about 18000 line for persian language. I use this command:

 bash -x tesstrain.sh --fonts_dir /usr/share/fonts --lang fas
 --training_text
  
 /home/zohreh/Desktop/tesseract-master/src/training/langdata/fas/fas.training_text.txt
 --wordlist
 /home/zohreh/Desktop/tesseract-master/src/training/langdata/fas/fas.wordlist.txt
 --linedata_only \
   --noextract_font_properties --langdata_dir
 /home/zohreh/Desktop/tesseract-master/src/training/langdata \
   --tessdata_dir /home/zohreh/Desktop/tesseract-master/tessdata \
   --fontlist "Arial" --output_dir
 /home/zohreh/Desktop/tesseract-master/src/training/langdata/fas/Phase2
 and then run this:
 sudo /home/zohreh/Desktop/tesseract-master/src/training/lstmtraining   \
   --traineddata
 /home/zohreh/Desktop/tesseract-master/src/training/langdata/fas/Phase2/fas/fas.traineddata
  --net_spec '[1,48,0,1Ct3,3,16Mp3,3Lfys64Lfx96Lrx96Lfx192O1c1]' \
   --model_output
 /home/zohreh/Desktop/tesseract-master/src/training/langdata/fas/Out/base
 --learning_rate 0.001 \
   --train_listfile
 /home/zohreh/Desktop/tesseract-master/src/training/langdata/fas/Phase2/fas.training_files.txt
 \
   --eval_listfile
 /home/zohreh/Desktop/tesseract-master/src/training/langdata/fas/v/fas.training_files.txt
 \
   --max_iterations 5000
 &>/home/zohreh/Desktop/tesseract-master/src/training/langdata/fas/Out/basetrain.log
 but always show Compute CTC targets failed and the model is not well at
 all.
 I normal my text and each line of the text have 20 token(max).
 Could you pleas help me?


 --
 You received this message because you are subscribed to the Google
 Groups "tesseract-ocr" group.
 To unsubscribe from this group and stop receiving emails from it, send
 an email to tesseract-ocr+unsubscr...@googlegroups.com.
 To post to this group, send email to tesseract-ocr@googlegroups.com.
 Visit this group at https://groups.google.com/group/tesseract-ocr.
 To view this discussion on the web visit
 https://groups.google.com/d/msgid/tesseract-ocr/04872dc6-7d92-4f95-9f65-8bb0cbf87c8c%40googlegroups.com
 
 .
 For more options, visit https://groups.google.com/d/optout.

>>> --
>>> You received this message because you are subscribed to a topic in the
>>> Google Groups "tesseract-ocr" group.
>>> To unsubscribe from this topic, visit
>>> 

Re: [tesseract-ocr] Compute CTC targets failed while training

2018-09-26 Thread Shree Devi Kumar
>By version alpha, I trained about 1000 line and it is not so bad

You must have only done fine tuning of model then and now you are trying to
train from scratch.

On Wed, 26 Sep 2018, 04:01 Khosrobeigy.zohreh, 
wrote:

> I know, actually I am master in lstm. I want to resolve all error and then
> train big text.
> By version alpha, I trained about 1000 line and it is not so bad. But in
> version beta 4 I got many error.
> In alpha,
> # Use LSTM
> tessedit_ocr_engine_mode 1
> tessedit_pageseg_mode 6
>
> # Arabic page layout variables
> segment_nonalphabetic_script 1
>
> # Avoid dropping rows
> textord_noise_rowratio 20.0
> textord_noise_syfract 0.6
>
> textord_min_linesize 2.5
>
> # Avoid over-estimating intra-word spacing at both row and
> # block levels when using old to method
> tosp_old_to_method T
> tosp_old_to_constrain_sp_kn T
> tosp_old_sp_kn_th_factor 4.0
>
> tosp_only_small_gaps_for_kern T
> tosp_use_pre_chopping T
>  I used all these, but now my model doesn't learn.
> Has any thing changed in beta 4 for example text2image?
>
> On Wed, Sep 26, 2018 at 12:53 AM Shree Devi Kumar 
> wrote:
>
>>   --fontlist "Arial"
>>
>> Does that have good coverage for Farsi?
>>
>>
>> --max_iterations 5000
>>
>> You are trying to train from scratch with 18000 lines of text and only
>> 5000 iterations. That will not work.
>>
>> Ray has trained on hundreds of thousands of lines of text and millions of
>> iterations.
>>
>> On Tue, 25 Sep 2018, 16:20 Zohreh Khosrobeygi, 
>> wrote:
>>
>>> Hi, I use this :
>>> tesseract 4.0.0-beta.4
>>>  leptonica-1.74.4
>>>   libjpeg 8d (libjpeg-turbo 1.4.2) : libpng 1.2.54 : libtiff 4.0.6 :
>>> zlib 1.2.8
>>>
>>>  Found AVX2
>>>  Found AVX
>>>  Found SSE
>>> I've trained about 18000 line for persian language. I use this command:
>>>
>>> bash -x tesstrain.sh --fonts_dir /usr/share/fonts --lang fas
>>> --training_text
>>>  
>>> /home/zohreh/Desktop/tesseract-master/src/training/langdata/fas/fas.training_text.txt
>>> --wordlist
>>> /home/zohreh/Desktop/tesseract-master/src/training/langdata/fas/fas.wordlist.txt
>>> --linedata_only \
>>>   --noextract_font_properties --langdata_dir
>>> /home/zohreh/Desktop/tesseract-master/src/training/langdata \
>>>   --tessdata_dir /home/zohreh/Desktop/tesseract-master/tessdata \
>>>   --fontlist "Arial" --output_dir
>>> /home/zohreh/Desktop/tesseract-master/src/training/langdata/fas/Phase2
>>> and then run this:
>>> sudo /home/zohreh/Desktop/tesseract-master/src/training/lstmtraining   \
>>>   --traineddata
>>> /home/zohreh/Desktop/tesseract-master/src/training/langdata/fas/Phase2/fas/fas.traineddata
>>>  --net_spec '[1,48,0,1Ct3,3,16Mp3,3Lfys64Lfx96Lrx96Lfx192O1c1]' \
>>>   --model_output
>>> /home/zohreh/Desktop/tesseract-master/src/training/langdata/fas/Out/base
>>> --learning_rate 0.001 \
>>>   --train_listfile
>>> /home/zohreh/Desktop/tesseract-master/src/training/langdata/fas/Phase2/fas.training_files.txt
>>> \
>>>   --eval_listfile
>>> /home/zohreh/Desktop/tesseract-master/src/training/langdata/fas/v/fas.training_files.txt
>>> \
>>>   --max_iterations 5000
>>> &>/home/zohreh/Desktop/tesseract-master/src/training/langdata/fas/Out/basetrain.log
>>> but always show Compute CTC targets failed and the model is not well at
>>> all.
>>> I normal my text and each line of the text have 20 token(max).
>>> Could you pleas help me?
>>>
>>>
>>> --
>>> You received this message because you are subscribed to the Google
>>> Groups "tesseract-ocr" group.
>>> To unsubscribe from this group and stop receiving emails from it, send
>>> an email to tesseract-ocr+unsubscr...@googlegroups.com.
>>> To post to this group, send email to tesseract-ocr@googlegroups.com.
>>> Visit this group at https://groups.google.com/group/tesseract-ocr.
>>> To view this discussion on the web visit
>>> https://groups.google.com/d/msgid/tesseract-ocr/04872dc6-7d92-4f95-9f65-8bb0cbf87c8c%40googlegroups.com
>>> 
>>> .
>>> For more options, visit https://groups.google.com/d/optout.
>>>
>> --
>> You received this message because you are subscribed to a topic in the
>> Google Groups "tesseract-ocr" group.
>> To unsubscribe from this topic, visit
>> https://groups.google.com/d/topic/tesseract-ocr/hGQMuZip6io/unsubscribe.
>> To unsubscribe from this group and all its topics, send an email to
>> tesseract-ocr+unsubscr...@googlegroups.com.
>> To post to this group, send email to tesseract-ocr@googlegroups.com.
>> Visit this group at https://groups.google.com/group/tesseract-ocr.
>> To view this discussion on the web visit
>> https://groups.google.com/d/msgid/tesseract-ocr/CAG2NduUcjmoC%2BfvY5qvn3e4PBVMhBFiEGDGP9WCkEUnsygQTpw%40mail.gmail.com
>> 
>> .
>> For more options, visit https://groups.google.com/d/optout.
>>
>

Re: [tesseract-ocr] Compute CTC targets failed while training

2018-09-26 Thread Khosrobeigy.zohreh
I know, actually I am master in lstm. I want to resolve all error and then
train big text.
By version alpha, I trained about 1000 line and it is not so bad. But in
version beta 4 I got many error.
In alpha,
# Use LSTM
tessedit_ocr_engine_mode 1
tessedit_pageseg_mode 6

# Arabic page layout variables
segment_nonalphabetic_script 1

# Avoid dropping rows
textord_noise_rowratio 20.0
textord_noise_syfract 0.6

textord_min_linesize 2.5

# Avoid over-estimating intra-word spacing at both row and
# block levels when using old to method
tosp_old_to_method T
tosp_old_to_constrain_sp_kn T
tosp_old_sp_kn_th_factor 4.0

tosp_only_small_gaps_for_kern T
tosp_use_pre_chopping T
 I used all these, but now my model doesn't learn.
Has any thing changed in beta 4 for example text2image?

On Wed, Sep 26, 2018 at 12:53 AM Shree Devi Kumar 
wrote:

>   --fontlist "Arial"
>
> Does that have good coverage for Farsi?
>
>
> --max_iterations 5000
>
> You are trying to train from scratch with 18000 lines of text and only
> 5000 iterations. That will not work.
>
> Ray has trained on hundreds of thousands of lines of text and millions of
> iterations.
>
> On Tue, 25 Sep 2018, 16:20 Zohreh Khosrobeygi, 
> wrote:
>
>> Hi, I use this :
>> tesseract 4.0.0-beta.4
>>  leptonica-1.74.4
>>   libjpeg 8d (libjpeg-turbo 1.4.2) : libpng 1.2.54 : libtiff 4.0.6 : zlib
>> 1.2.8
>>
>>  Found AVX2
>>  Found AVX
>>  Found SSE
>> I've trained about 18000 line for persian language. I use this command:
>>
>> bash -x tesstrain.sh --fonts_dir /usr/share/fonts --lang fas
>> --training_text
>>  
>> /home/zohreh/Desktop/tesseract-master/src/training/langdata/fas/fas.training_text.txt
>> --wordlist
>> /home/zohreh/Desktop/tesseract-master/src/training/langdata/fas/fas.wordlist.txt
>> --linedata_only \
>>   --noextract_font_properties --langdata_dir
>> /home/zohreh/Desktop/tesseract-master/src/training/langdata \
>>   --tessdata_dir /home/zohreh/Desktop/tesseract-master/tessdata \
>>   --fontlist "Arial" --output_dir
>> /home/zohreh/Desktop/tesseract-master/src/training/langdata/fas/Phase2
>> and then run this:
>> sudo /home/zohreh/Desktop/tesseract-master/src/training/lstmtraining   \
>>   --traineddata
>> /home/zohreh/Desktop/tesseract-master/src/training/langdata/fas/Phase2/fas/fas.traineddata
>>  --net_spec '[1,48,0,1Ct3,3,16Mp3,3Lfys64Lfx96Lrx96Lfx192O1c1]' \
>>   --model_output
>> /home/zohreh/Desktop/tesseract-master/src/training/langdata/fas/Out/base
>> --learning_rate 0.001 \
>>   --train_listfile
>> /home/zohreh/Desktop/tesseract-master/src/training/langdata/fas/Phase2/fas.training_files.txt
>> \
>>   --eval_listfile
>> /home/zohreh/Desktop/tesseract-master/src/training/langdata/fas/v/fas.training_files.txt
>> \
>>   --max_iterations 5000
>> &>/home/zohreh/Desktop/tesseract-master/src/training/langdata/fas/Out/basetrain.log
>> but always show Compute CTC targets failed and the model is not well at
>> all.
>> I normal my text and each line of the text have 20 token(max).
>> Could you pleas help me?
>>
>>
>> --
>> You received this message because you are subscribed to the Google Groups
>> "tesseract-ocr" group.
>> To unsubscribe from this group and stop receiving emails from it, send an
>> email to tesseract-ocr+unsubscr...@googlegroups.com.
>> To post to this group, send email to tesseract-ocr@googlegroups.com.
>> Visit this group at https://groups.google.com/group/tesseract-ocr.
>> To view this discussion on the web visit
>> https://groups.google.com/d/msgid/tesseract-ocr/04872dc6-7d92-4f95-9f65-8bb0cbf87c8c%40googlegroups.com
>> 
>> .
>> For more options, visit https://groups.google.com/d/optout.
>>
> --
> You received this message because you are subscribed to a topic in the
> Google Groups "tesseract-ocr" group.
> To unsubscribe from this topic, visit
> https://groups.google.com/d/topic/tesseract-ocr/hGQMuZip6io/unsubscribe.
> To unsubscribe from this group and all its topics, send an email to
> tesseract-ocr+unsubscr...@googlegroups.com.
> To post to this group, send email to tesseract-ocr@googlegroups.com.
> Visit this group at https://groups.google.com/group/tesseract-ocr.
> To view this discussion on the web visit
> https://groups.google.com/d/msgid/tesseract-ocr/CAG2NduUcjmoC%2BfvY5qvn3e4PBVMhBFiEGDGP9WCkEUnsygQTpw%40mail.gmail.com
> 
> .
> For more options, visit https://groups.google.com/d/optout.
>


-- 
Zohreh Khosrobeygi
University of Tehran, 2016
Tel: +989196042887
khosrobeygi.zo...@ut.ac.ir 

-- 
You received this message because you are subscribed to the Google Groups 
"tesseract-ocr" group.
To unsubscribe from this group and stop receiving emails from it, send an email 
to tesseract-ocr+unsubscr...@googlegroups.com.
To post to this group, send email to 

[tesseract-ocr] Re: I Need help getting Tesseract 4.0 C# .Net Wrapper working please!

2018-09-26 Thread James Q
Hi Vipin
I didn't get much further with that wrapper I'm afraid. In the end, I went 
for building tesseract from the C++ source code.

On Tuesday, September 25, 2018 at 6:03:19 PM UTC+1, Vipin Tom Varghese 
wrote:
>
> Hi James, my apologies to hit you up so randomly, but I had no ther 
> options left. Ive been trying to get Tesseract 4 working using 
> tesseract.net wrapper following the wiki here 
> ,
>  
> but i'm unable to build from source. Could share how you got it working ?
>
> Thanks
> Vipin
>
> On Monday, 8 January 2018 15:33:50 UTC+5:30, James Q wrote:
>>
>> By the way I do have the Tesseract.net nuget package working ( 
>> https://www.nuget.org/packages/tesseract.net/ ), but have 2 issues with 
>> this:
>> 1.) I need to write a separate Bitmap -> Pix converter in C#
>> 2.) I haven't yet got whitelists/blacklists working
>>
>> Neither of these were issues with the tesseract 3 Charles Weld wrapper, 
>> hence my reason for trying to get the tdhintz one working (as this is based 
>> on Charles Weld's 3 wrapper).
>> Thanks
>> James
>>
>> On Monday, January 8, 2018 at 7:49:43 AM UTC, Mohammad Mahdizadeh wrote:
>>>
>>> I have the same problem 
>>>
>>>
>>> On Friday, January 5, 2018 at 8:38:08 PM UTC+3:30, James Q wrote:

 I'm trying to use this wrapper:
 https://github.com/tdhintz/tesseract4win64

 It's an x64 .Net assembly with one main DLL (Tesseract.dll) and two 
 dependency DLLs (liblept1741.dll and libtesseract400.dll). To start with 
 I'm just trying to get a Visual Studio console app running. I've added 
 Tesseract.dll in as a reference but it fails to recognize the dependency 
 DLLs, throwing a runtime DllNotFoundException: "Failed to find library 
 "liblept1741.dll" for platform x64.".

 I've tried placing the DLLs in the .\bin\x64\Debug folder and elsewhere 
 along the project structure but no luck! I've tried manually adding them 
 to 
 an ItemGroup in the csproj file with 'CopyToOutputDirectory Always'. I've 
 also tried setting TesseractEnviornment.CustomSearchPath in my Main class, 
 but although the runtime searches in the correct folders, it still doesn't 
 find the DLLs. My app is for x64 so the image type should match. I can't 
 think of what else to try.

 If anyone has this working I would greatly appreciate any advice.

 Thanks in advance
 James




-- 
You received this message because you are subscribed to the Google Groups 
"tesseract-ocr" group.
To unsubscribe from this group and stop receiving emails from it, send an email 
to tesseract-ocr+unsubscr...@googlegroups.com.
To post to this group, send email to tesseract-ocr@googlegroups.com.
Visit this group at https://groups.google.com/group/tesseract-ocr.
To view this discussion on the web visit 
https://groups.google.com/d/msgid/tesseract-ocr/8b41f8d0-7526-44f0-b2ac-f3b62e164e4d%40googlegroups.com.
For more options, visit https://groups.google.com/d/optout.