Re: [tesseract-ocr] [SOLVED] Re: tess4j: NullPointerException while reading text in rectangle of image.

2018-06-09 Thread ShreeDevi Kumar
For tess4j see

https://github.com/nguyenq/tess4j/blob/master/src/test/java/net/sourceforge/tess4j/TessAPI1Test.java



On Sun 10 Jun, 2018, 12:51 AM Dattatraya Tembare, 
wrote:

> I have used another method, and it worked perfectly.
>
> public static void main(String[] args) {
>  String fileStr = "C:/EA/mp-out/im/1/3/1-0.png";
>  File file = new File(fileStr);
>  //380x45+220+170
>  int xsize = 0;
>  int ysize = 0;
>  BufferedImage bufImage=null;
>  ByteBuffer buf = null;
>  try {
>  bufImage = ImageIO.read(file);
>  IIOImage image=new IIOImage(bufImage,null,null);
>  buf = ImageIOHelper.getImageByteBuffer(image);
>  } catch (IOException e2) {
>  e2.printStackTrace();
>  }
>  Rectangle rect = new Rectangle(220, 170, 380, 45); // define an equal or
> smaller region of interest on the image
>  int bpp = 8; //Gray=8, RGB=24
>
>  Tesseract in = new ReadImageText().getTesseractInstance("C:/Program
> Files (x86)/Tesseract-OCR/tessdata/", "hin");
>  try {
>  String resultText = in.doOCR(bufImage, rect);
>  //in.doOCR(xsize, ysize, buf, rect, bpp);
>  log.info("resultText: {}", resultText);
>  } catch (TesseractException e) {
>  e.printStackTrace();
>  }
>  }
>
>
> On Saturday, June 9, 2018 at 3:07:02 PM UTC-4, Dattatraya Tembare wrote:
>>
>> I'm trying to read the text from an image at some particular location in
>> an image. I have image dimensions and desired rectangle dometions.
>> Here is the code implementation:
>>
>> public static void main(String[] args) {
>>  String fileStr = "C:/EA/mp-out/im/1/3/1-0.png";
>>  File file = new File(fileStr);
>>  //380x45+220+170
>>  int xsize = 0;
>>  int ysize = 0;
>>  BufferedImage bufImage;
>>  ByteBuffer buf = null;
>>  try {
>>  bufImage = ImageIO.read(file);
>>  IIOImage image=new IIOImage(bufImage,null,null);
>>  buf = ImageIOHelper.getImageByteBuffer(image);
>>  } catch (IOException e2) {
>>  e2.printStackTrace();
>>  }
>>  Rectangle rect = new Rectangle(0, 0, 600, 265); // define an equal or
>> smaller region of interest on the image
>>  int bpp = 8; //Gray=8, RGB=24
>>
>>  Tesseract in = new ReadImageText().getTesseractInstance("C:/Program
>> Files (x86)/Tesseract-OCR/tessdata/", "hin");
>>  try {
>>  String resultText = in.doOCR(xsize, ysize, buf, rect, bpp);
>>  log.info("resultText: {}", resultText);
>>  } catch (TesseractException e) {
>>  e.printStackTrace();
>>  }
>>  }
>>
>> When I have executed the code, go below error:
>>
>> java.lang.NullPointerException: null
>>  at net.sourceforge.tess4j.Tesseract.getOCRText(Tesseract.java:434)
>>  at net.sourceforge.tess4j.Tesseract.doOCR(Tesseract.java:351)
>>  at net.sourceforge.tess4j.Tesseract.doOCR(Tesseract.java:323)
>>  at com.ea.ocr.tesseract.ReadImageText.main(ReadImageText.java:74)
>>
>> Please look into it and let me know if anyone has any idea.
>>
> --
> You received this message because you are subscribed to the Google Groups
> "tesseract-ocr" group.
> To unsubscribe from this group and stop receiving emails from it, send an
> email to tesseract-ocr+unsubscr...@googlegroups.com.
> To post to this group, send email to tesseract-ocr@googlegroups.com.
> Visit this group at https://groups.google.com/group/tesseract-ocr.
> To view this discussion on the web visit
> https://groups.google.com/d/msgid/tesseract-ocr/243234f1-405a-45bb-a249-dd8eebd0e9f9%40googlegroups.com
> 
> .
> For more options, visit https://groups.google.com/d/optout.
>

-- 
You received this message because you are subscribed to the Google Groups 
"tesseract-ocr" group.
To unsubscribe from this group and stop receiving emails from it, send an email 
to tesseract-ocr+unsubscr...@googlegroups.com.
To post to this group, send email to tesseract-ocr@googlegroups.com.
Visit this group at https://groups.google.com/group/tesseract-ocr.
To view this discussion on the web visit 
https://groups.google.com/d/msgid/tesseract-ocr/CAG2NduUyDSJ%2BvER64TySsg82jaP-UAH1T9BX%3Db_yzJDWnRjjZg%40mail.gmail.com.
For more options, visit https://groups.google.com/d/optout.


[tesseract-ocr] [SOLVED] Re: tess4j: NullPointerException while reading text in rectangle of image.

2018-06-09 Thread Dattatraya Tembare
I have used another method, and it worked perfectly.

public static void main(String[] args) {
 String fileStr = "C:/EA/mp-out/im/1/3/1-0.png";
 File file = new File(fileStr);
 //380x45+220+170
 int xsize = 0;
 int ysize = 0;
 BufferedImage bufImage=null;
 ByteBuffer buf = null;
 try {
 bufImage = ImageIO.read(file);
 IIOImage image=new IIOImage(bufImage,null,null);
 buf = ImageIOHelper.getImageByteBuffer(image);
 } catch (IOException e2) {
 e2.printStackTrace();
 }
 Rectangle rect = new Rectangle(220, 170, 380, 45); // define an equal or 
smaller region of interest on the image
 int bpp = 8; //Gray=8, RGB=24
 
 Tesseract in = new ReadImageText().getTesseractInstance("C:/Program Files 
(x86)/Tesseract-OCR/tessdata/", "hin");
 try {
 String resultText = in.doOCR(bufImage, rect);
 //in.doOCR(xsize, ysize, buf, rect, bpp);
 log.info("resultText: {}", resultText);
 } catch (TesseractException e) {
 e.printStackTrace();
 }
 }


On Saturday, June 9, 2018 at 3:07:02 PM UTC-4, Dattatraya Tembare wrote:
>
> I'm trying to read the text from an image at some particular location in 
> an image. I have image dimensions and desired rectangle dometions.
> Here is the code implementation:
>
> public static void main(String[] args) {
>  String fileStr = "C:/EA/mp-out/im/1/3/1-0.png";
>  File file = new File(fileStr);
>  //380x45+220+170
>  int xsize = 0;
>  int ysize = 0;
>  BufferedImage bufImage;
>  ByteBuffer buf = null;
>  try {
>  bufImage = ImageIO.read(file);
>  IIOImage image=new IIOImage(bufImage,null,null);
>  buf = ImageIOHelper.getImageByteBuffer(image);
>  } catch (IOException e2) {
>  e2.printStackTrace();
>  }
>  Rectangle rect = new Rectangle(0, 0, 600, 265); // define an equal or 
> smaller region of interest on the image
>  int bpp = 8; //Gray=8, RGB=24
>  
>  Tesseract in = new ReadImageText().getTesseractInstance("C:/Program 
> Files (x86)/Tesseract-OCR/tessdata/", "hin");
>  try {
>  String resultText = in.doOCR(xsize, ysize, buf, rect, bpp);
>  log.info("resultText: {}", resultText);
>  } catch (TesseractException e) {
>  e.printStackTrace();
>  }
>  }
>
> When I have executed the code, go below error: 
>
> java.lang.NullPointerException: null
>  at net.sourceforge.tess4j.Tesseract.getOCRText(Tesseract.java:434)
>  at net.sourceforge.tess4j.Tesseract.doOCR(Tesseract.java:351)
>  at net.sourceforge.tess4j.Tesseract.doOCR(Tesseract.java:323)
>  at com.ea.ocr.tesseract.ReadImageText.main(ReadImageText.java:74)
>
> Please look into it and let me know if anyone has any idea.
>

-- 
You received this message because you are subscribed to the Google Groups 
"tesseract-ocr" group.
To unsubscribe from this group and stop receiving emails from it, send an email 
to tesseract-ocr+unsubscr...@googlegroups.com.
To post to this group, send email to tesseract-ocr@googlegroups.com.
Visit this group at https://groups.google.com/group/tesseract-ocr.
To view this discussion on the web visit 
https://groups.google.com/d/msgid/tesseract-ocr/243234f1-405a-45bb-a249-dd8eebd0e9f9%40googlegroups.com.
For more options, visit https://groups.google.com/d/optout.


[tesseract-ocr] tess4j: NullPointerException while reading text in rectangle of image.

2018-06-09 Thread Dattatraya Tembare
I'm trying to read the text from an image at some particular location in an 
image. I have image dimensions and desired rectangle dometions.
Here is the code implementation:

public static void main(String[] args) {
 String fileStr = "C:/EA/mp-out/im/1/3/1-0.png";
 File file = new File(fileStr);
 //380x45+220+170
 int xsize = 0;
 int ysize = 0;
 BufferedImage bufImage;
 ByteBuffer buf = null;
 try {
 bufImage = ImageIO.read(file);
 IIOImage image=new IIOImage(bufImage,null,null);
 buf = ImageIOHelper.getImageByteBuffer(image);
 } catch (IOException e2) {
 e2.printStackTrace();
 }
 Rectangle rect = new Rectangle(0, 0, 600, 265); // define an equal or 
smaller region of interest on the image
 int bpp = 8; //Gray=8, RGB=24
 
 Tesseract in = new ReadImageText().getTesseractInstance("C:/Program Files 
(x86)/Tesseract-OCR/tessdata/", "hin");
 try {
 String resultText = in.doOCR(xsize, ysize, buf, rect, bpp);
 log.info("resultText: {}", resultText);
 } catch (TesseractException e) {
 e.printStackTrace();
 }
 }

When I have executed the code, go below error: 

java.lang.NullPointerException: null
 at net.sourceforge.tess4j.Tesseract.getOCRText(Tesseract.java:434)
 at net.sourceforge.tess4j.Tesseract.doOCR(Tesseract.java:351)
 at net.sourceforge.tess4j.Tesseract.doOCR(Tesseract.java:323)
 at com.ea.ocr.tesseract.ReadImageText.main(ReadImageText.java:74)

Please look into it and let me know if anyone has any idea.

-- 
You received this message because you are subscribed to the Google Groups 
"tesseract-ocr" group.
To unsubscribe from this group and stop receiving emails from it, send an email 
to tesseract-ocr+unsubscr...@googlegroups.com.
To post to this group, send email to tesseract-ocr@googlegroups.com.
Visit this group at https://groups.google.com/group/tesseract-ocr.
To view this discussion on the web visit 
https://groups.google.com/d/msgid/tesseract-ocr/b947f127-d173-46a8-aa94-5c4b480ab405%40googlegroups.com.
For more options, visit https://groups.google.com/d/optout.


[tesseract-ocr] Re: error while running tesseract

2018-06-09 Thread Shinehah-Gnolaum
How do I go into tessdatamanager.cpp and fix the problem?

On Wednesday, February 8, 2017 at 7:59:37 AM UTC-6, Quan Nguyen wrote:
>
> Wrong version of traineddata?
>
>
> https://github.com/tesseract-ocr/tesseract/wiki/FAQ#actual_tessdata_num_entries_-tessdata_num_entrieserrorassert-failedin-file-ccutiltessdatamanagercpp-line-55_
>
> On Wednesday, February 8, 2017 at 1:47:29 AM UTC-6, vibha@prueba.io 
> wrote:
>>
>> I got this error while running a test case which involves verifying a 
>> toast message using tesseract on intellij3.4( java version 1.8.0_121)
>>
>>
>> A fatal error has been detected by the Java Runtime Environment:
>> SIGSEGV (0xb) at pc=0x7f10e046333a, pid=4980, tid=0x7f111d577700
>> JRE version: Java(TM) SE Runtime Environment (8.0_121-b13) (build 
>> 1.8.0_121-b13)
>> Java VM: Java HotSpot(TM) 64-Bit Server VM (25.121-b13 mixed mode 
>> linux-amd64 compressed oops)
>> Problematic frame:
>> C [libtesseract.so.3.0.3+0x22533a] ERRCODE::error(char const*, 
>> TessErrorLogCode, char const*, ...) const+0x16a
>>
>> Failed to write core dump. Core dumps have been disabled. To enable core 
>> dumping, try "ulimit -c unlimited" before starting Java again
>>
>> actual_tessdata_num_entries_ <= TESSDATA_NUM_ENTRIES:Error:Assert 
>> failed:in file tessdatamanager.cpp, line 53
>>
>>
>> tried executing "ulimit -c unlimited" still doesn't set it right.
>>
>>
>> any suggestion will be of great help.
>>
>

-- 
You received this message because you are subscribed to the Google Groups 
"tesseract-ocr" group.
To unsubscribe from this group and stop receiving emails from it, send an email 
to tesseract-ocr+unsubscr...@googlegroups.com.
To post to this group, send email to tesseract-ocr@googlegroups.com.
Visit this group at https://groups.google.com/group/tesseract-ocr.
To view this discussion on the web visit 
https://groups.google.com/d/msgid/tesseract-ocr/365c02b8-2179-4f56-bcc0-fb1ca4db4ed3%40googlegroups.com.
For more options, visit https://groups.google.com/d/optout.


[tesseract-ocr] error running configure; how do I start over?

2018-06-09 Thread Shinehah-Gnolaum
I made an error running configure. The first time I ran the line given in 
the instructions at 
https://github.com/tesseract-ocr/tesseract/wiki/Compiling#macos It didn't 
work because the c++ compiler couldn't make executables, it said. The 
second time I didn't set the environment variables.

How do I start over? Do I have to undo what I've done before I start over, 
or can I just start over and what was done before gets overwritten?

-- 
You received this message because you are subscribed to the Google Groups 
"tesseract-ocr" group.
To unsubscribe from this group and stop receiving emails from it, send an email 
to tesseract-ocr+unsubscr...@googlegroups.com.
To post to this group, send email to tesseract-ocr@googlegroups.com.
Visit this group at https://groups.google.com/group/tesseract-ocr.
To view this discussion on the web visit 
https://groups.google.com/d/msgid/tesseract-ocr/b6d93813-18eb-4ddd-9895-7517b3350092%40googlegroups.com.
For more options, visit https://groups.google.com/d/optout.


Re: [tesseract-ocr] error

2018-06-09 Thread ShreeDevi Kumar
You are probably using a wrong traineddata file i.e. 3.0x version file with
latest 4.0x code from master branch.


ShreeDevi

भजन - कीर्तन - आरती @ http://bhajans.ramparivar.com


On Sat, Jun 9, 2018 at 3:33 PM Vishal Jha  wrote:

> 1, 'read_params_file: parameter not found: enable_new_segsearch')
>
> --
> You received this message because you are subscribed to the Google Groups
> "tesseract-ocr" group.
> To unsubscribe from this group and stop receiving emails from it, send an
> email to tesseract-ocr+unsubscr...@googlegroups.com.
> To post to this group, send email to tesseract-ocr@googlegroups.com.
> Visit this group at https://groups.google.com/group/tesseract-ocr.
> To view this discussion on the web visit
> https://groups.google.com/d/msgid/tesseract-ocr/c76d5113-2583-4e72-8c7b-59eee8b39f02%40googlegroups.com
> 
> .
> For more options, visit https://groups.google.com/d/optout.
>

-- 
You received this message because you are subscribed to the Google Groups 
"tesseract-ocr" group.
To unsubscribe from this group and stop receiving emails from it, send an email 
to tesseract-ocr+unsubscr...@googlegroups.com.
To post to this group, send email to tesseract-ocr@googlegroups.com.
Visit this group at https://groups.google.com/group/tesseract-ocr.
To view this discussion on the web visit 
https://groups.google.com/d/msgid/tesseract-ocr/CAG2NduUFqW8U_2T7Cfc9xCAY7XO6uF1JeFCdy3yicNWG%3D2%2B%2B-w%40mail.gmail.com.
For more options, visit https://groups.google.com/d/optout.


[tesseract-ocr] error

2018-06-09 Thread Vishal Jha


1, 'read_params_file: parameter not found: enable_new_segsearch')

-- 
You received this message because you are subscribed to the Google Groups 
"tesseract-ocr" group.
To unsubscribe from this group and stop receiving emails from it, send an email 
to tesseract-ocr+unsubscr...@googlegroups.com.
To post to this group, send email to tesseract-ocr@googlegroups.com.
Visit this group at https://groups.google.com/group/tesseract-ocr.
To view this discussion on the web visit 
https://groups.google.com/d/msgid/tesseract-ocr/c76d5113-2583-4e72-8c7b-59eee8b39f02%40googlegroups.com.
For more options, visit https://groups.google.com/d/optout.


Re: [tesseract-ocr] Unrecognized argument --linedata_only

2018-06-09 Thread ShreeDevi Kumar
Try without   --eval_listfile /home/kddlab/Desktop/tesseract-master/1MyData/
testfas1/fas.training_files.txt \

ShreeDevi

भजन - कीर्तन - आरती @ http://bhajans.ramparivar.com


On Sat, Jun 9, 2018 at 1:58 PM Khosrobeigy.zohreh 
wrote:

> Thank. by your command fixed.
>  but next i used this:
>
> lstmtraining   \
>   --traineddata
> /home/kddlab/Desktop/tesseract-master/1MyData/testfas/fas/fas.traineddata
>  --net_spec '[1,48,0,1Ct3,3,16Mp3,3Lfys64Lfx96Lrx96Lfx192O1c1]' \
>   --model_output
> /home/kddlab/Desktop/tesseract-master/1MyData/testfasout/base
> --learning_rate 20e-4 \
>   --train_listfile
> /home/kddlab/Desktop/tesseract-master/1MyData/testfas/fas.training_files.txt
> \
>   --eval_listfile
> /home/kddlab/Desktop/tesseract-master/1MyData/testfas1/fas.training_files.txt
> \
>   --max_iterations 5000
> &>/home/kddlab/Desktop/tesseract-master/1MyData/testfasout/basetrain.log
>  and i have this *error now*
>
> *Segmentation fault (core dumped)*
>
>
> Could you please help me again?
>
> On Sat, Jun 9, 2018 at 11:33 AM, ShreeDevi Kumar 
> wrote:
>
>> --linedata_only should work.
>>
>> > tesseract 4.0.0-beta.1
>>
>> Do you know which commit? Please try with latest code.
>>
>> >   i am using   src/training/tesstrain.sh
>>
>> The command you used was:
>>
>> >  sudo tesstrain.sh
>>
>> Why do you need sudo?
>>
>> Please run the script with
>>
>> bash -x   src/training/tesstrain.sh etc ... and report with the console
>> log.
>>
>> ShreeDevi
>> 
>> भजन - कीर्तन - आरती @ http://bhajans.ramparivar.com
>>
>>
>> On Sat, Jun 9, 2018 at 11:57 AM Zohreh Khosrobeygi <
>> beigy.zoh...@gmail.com> wrote:
>>
>>> Yes, i am using   src/training/tesstrain.sh
>>>
>>>
>>> On Friday, June 8, 2018 at 6:44:27 PM UTC+4:30, shree wrote:

 Are you using the correct version of tesstrain.sh?

 It should be in src/training/tesstrain.sh


 ShreeDevi
 
 भजन - कीर्तन - आरती @ http://bhajans.ramparivar.com


 On Fri, Jun 8, 2018 at 6:49 PM Zohreh Khosrobeygi 
 wrote:

> Hi,
> I have been training tesseract but i have this errore"
>
> Unrecognized argument --linedata_only
>
> And it's my version of tesseract"
> tesseract 4.0.0-beta.1
>  leptonica-1.74.4
>   libjpeg 8d (libjpeg-turbo 1.4.2) : libpng 1.2.54 : libtiff 4.0.6 :
> zlib 1.2.8
>
>  Found AVX2
>  Found AVX
>  Found SSE
>
> Besides it's my command:
> sudo tesstrain.sh --fonts_dir /usr/share/fonts --lang fas
> --training_text
> /home/kddlab/Desktop/tesseract-master/1MyData/fas/fas.training_text
>  --linedata_only \
>   --noextract_font_properties --langdata_dir
> /home/kddlab/Desktop/tesseract-master/langdata \
>   --tessdata_dir /home/kddlab/Desktop/tesseract-master/tessdata \
>   --fontlist "B Mitra" --output_dir
> /home/kddlab/Desktop/tesseract-master/1MyData/testfas
>
> And i have config file:
> # Use LSTM
> tessedit_ocr_engine_mode 1
> tessedit_pageseg_mode 6
>
> How can i solve this?
>
> --
> You received this message because you are subscribed to the Google
> Groups "tesseract-ocr" group.
> To unsubscribe from this group and stop receiving emails from it, send
> an email to tesseract-oc...@googlegroups.com.
> To post to this group, send email to tesser...@googlegroups.com.
> Visit this group at https://groups.google.com/group/tesseract-ocr.
> To view this discussion on the web visit
> https://groups.google.com/d/msgid/tesseract-ocr/a692d903-34be-4a51-99c5-11ed34bb6cef%40googlegroups.com
> 
> .
> For more options, visit https://groups.google.com/d/optout.
>
 --
>>> You received this message because you are subscribed to the Google
>>> Groups "tesseract-ocr" group.
>>> To unsubscribe from this group and stop receiving emails from it, send
>>> an email to tesseract-ocr+unsubscr...@googlegroups.com.
>>> To post to this group, send email to tesseract-ocr@googlegroups.com.
>>> Visit this group at https://groups.google.com/group/tesseract-ocr.
>>> To view this discussion on the web visit
>>> https://groups.google.com/d/msgid/tesseract-ocr/caf0b092-1a2c-4e73-9171-16678495af51%40googlegroups.com
>>> 
>>> .
>>> For more options, visit https://groups.google.com/d/optout.
>>>
>> --
>> You received this message because you are subscribed to a topic in the
>> Google Groups "tesseract-ocr" group.
>> To unsubscribe from this topic, visit
>> 

Re: [tesseract-ocr] Unrecognized argument --linedata_only

2018-06-09 Thread Khosrobeigy.zohreh
Thank. by your command fixed.
 but next i used this:

lstmtraining   \
  --traineddata
/home/kddlab/Desktop/tesseract-master/1MyData/testfas/fas/fas.traineddata
 --net_spec '[1,48,0,1Ct3,3,16Mp3,3Lfys64Lfx96Lrx96Lfx192O1c1]' \
  --model_output
/home/kddlab/Desktop/tesseract-master/1MyData/testfasout/base
--learning_rate 20e-4 \
  --train_listfile
/home/kddlab/Desktop/tesseract-master/1MyData/testfas/fas.training_files.txt
\
  --eval_listfile
/home/kddlab/Desktop/tesseract-master/1MyData/testfas1/fas.training_files.txt
\
  --max_iterations 5000
&>/home/kddlab/Desktop/tesseract-master/1MyData/testfasout/basetrain.log
 and i have this *error now*

*Segmentation fault (core dumped)*


Could you please help me again?

On Sat, Jun 9, 2018 at 11:33 AM, ShreeDevi Kumar 
wrote:

> --linedata_only should work.
>
> > tesseract 4.0.0-beta.1
>
> Do you know which commit? Please try with latest code.
>
> >   i am using   src/training/tesstrain.sh
>
> The command you used was:
>
> >  sudo tesstrain.sh
>
> Why do you need sudo?
>
> Please run the script with
>
> bash -x   src/training/tesstrain.sh etc ... and report with the console
> log.
>
> ShreeDevi
> 
> भजन - कीर्तन - आरती @ http://bhajans.ramparivar.com
>
>
> On Sat, Jun 9, 2018 at 11:57 AM Zohreh Khosrobeygi 
> wrote:
>
>> Yes, i am using   src/training/tesstrain.sh
>>
>>
>> On Friday, June 8, 2018 at 6:44:27 PM UTC+4:30, shree wrote:
>>>
>>> Are you using the correct version of tesstrain.sh?
>>>
>>> It should be in src/training/tesstrain.sh
>>>
>>>
>>> ShreeDevi
>>> 
>>> भजन - कीर्तन - आरती @ http://bhajans.ramparivar.com
>>>
>>>
>>> On Fri, Jun 8, 2018 at 6:49 PM Zohreh Khosrobeygi 
>>> wrote:
>>>
 Hi,
 I have been training tesseract but i have this errore"

 Unrecognized argument --linedata_only

 And it's my version of tesseract"
 tesseract 4.0.0-beta.1
  leptonica-1.74.4
   libjpeg 8d (libjpeg-turbo 1.4.2) : libpng 1.2.54 : libtiff 4.0.6 :
 zlib 1.2.8

  Found AVX2
  Found AVX
  Found SSE

 Besides it's my command:
 sudo tesstrain.sh --fonts_dir /usr/share/fonts --lang fas
 --training_text 
 /home/kddlab/Desktop/tesseract-master/1MyData/fas/fas.training_text
--linedata_only \
   --noextract_font_properties --langdata_dir 
 /home/kddlab/Desktop/tesseract-master/langdata
 \
   --tessdata_dir /home/kddlab/Desktop/tesseract-master/tessdata \
   --fontlist "B Mitra" --output_dir /home/kddlab/Desktop/
 tesseract-master/1MyData/testfas

 And i have config file:
 # Use LSTM
 tessedit_ocr_engine_mode 1
 tessedit_pageseg_mode 6

 How can i solve this?

 --
 You received this message because you are subscribed to the Google
 Groups "tesseract-ocr" group.
 To unsubscribe from this group and stop receiving emails from it, send
 an email to tesseract-oc...@googlegroups.com.
 To post to this group, send email to tesser...@googlegroups.com.
 Visit this group at https://groups.google.com/group/tesseract-ocr.
 To view this discussion on the web visit https://groups.google.com/d/
 msgid/tesseract-ocr/a692d903-34be-4a51-99c5-11ed34bb6cef%
 40googlegroups.com
 
 .
 For more options, visit https://groups.google.com/d/optout.

>>> --
>> You received this message because you are subscribed to the Google Groups
>> "tesseract-ocr" group.
>> To unsubscribe from this group and stop receiving emails from it, send an
>> email to tesseract-ocr+unsubscr...@googlegroups.com.
>> To post to this group, send email to tesseract-ocr@googlegroups.com.
>> Visit this group at https://groups.google.com/group/tesseract-ocr.
>> To view this discussion on the web visit https://groups.google.com/d/
>> msgid/tesseract-ocr/caf0b092-1a2c-4e73-9171-16678495af51%
>> 40googlegroups.com
>> 
>> .
>> For more options, visit https://groups.google.com/d/optout.
>>
> --
> You received this message because you are subscribed to a topic in the
> Google Groups "tesseract-ocr" group.
> To unsubscribe from this topic, visit https://groups.google.com/d/
> topic/tesseract-ocr/GLlgILi5xOA/unsubscribe.
> To unsubscribe from this group and all its topics, send an email to
> tesseract-ocr+unsubscr...@googlegroups.com.
> To post to this group, send email to tesseract-ocr@googlegroups.com.
> Visit this group at https://groups.google.com/group/tesseract-ocr.
> To view this discussion on the web visit https://groups.google.com/d/
> msgid/tesseract-ocr/CAG2NduWUNUQGwuRfmQ5EsdewAcvBj
> xOEzKcTrBqYmrSynHuoWg%40mail.gmail.com
> 

[tesseract-ocr] How to recognize text in images with blue background and boxed

2018-06-09 Thread eng . ahmed . osama . 1190
Hi All,


I have this image and I want to recognize the letters inside it. How I can 
to that.


I have tried to change the background to white and text to black but failed 
also.



-- 
You received this message because you are subscribed to the Google Groups 
"tesseract-ocr" group.
To unsubscribe from this group and stop receiving emails from it, send an email 
to tesseract-ocr+unsubscr...@googlegroups.com.
To post to this group, send email to tesseract-ocr@googlegroups.com.
Visit this group at https://groups.google.com/group/tesseract-ocr.
To view this discussion on the web visit 
https://groups.google.com/d/msgid/tesseract-ocr/88cc477d-c69b-42e5-98b2-0c4ef8998196%40googlegroups.com.
For more options, visit https://groups.google.com/d/optout.


Re: [tesseract-ocr] Unrecognized argument --linedata_only

2018-06-09 Thread ShreeDevi Kumar
--linedata_only should work.

> tesseract 4.0.0-beta.1

Do you know which commit? Please try with latest code.

>   i am using   src/training/tesstrain.sh

The command you used was:

>  sudo tesstrain.sh

Why do you need sudo?

Please run the script with

bash -x   src/training/tesstrain.sh etc ... and report with the console log.

ShreeDevi

भजन - कीर्तन - आरती @ http://bhajans.ramparivar.com


On Sat, Jun 9, 2018 at 11:57 AM Zohreh Khosrobeygi 
wrote:

> Yes, i am using   src/training/tesstrain.sh
>
>
> On Friday, June 8, 2018 at 6:44:27 PM UTC+4:30, shree wrote:
>>
>> Are you using the correct version of tesstrain.sh?
>>
>> It should be in src/training/tesstrain.sh
>>
>>
>> ShreeDevi
>> 
>> भजन - कीर्तन - आरती @ http://bhajans.ramparivar.com
>>
>>
>> On Fri, Jun 8, 2018 at 6:49 PM Zohreh Khosrobeygi 
>> wrote:
>>
>>> Hi,
>>> I have been training tesseract but i have this errore"
>>>
>>> Unrecognized argument --linedata_only
>>>
>>> And it's my version of tesseract"
>>> tesseract 4.0.0-beta.1
>>>  leptonica-1.74.4
>>>   libjpeg 8d (libjpeg-turbo 1.4.2) : libpng 1.2.54 : libtiff 4.0.6 :
>>> zlib 1.2.8
>>>
>>>  Found AVX2
>>>  Found AVX
>>>  Found SSE
>>>
>>> Besides it's my command:
>>> sudo tesstrain.sh --fonts_dir /usr/share/fonts --lang fas
>>> --training_text
>>> /home/kddlab/Desktop/tesseract-master/1MyData/fas/fas.training_text
>>>  --linedata_only \
>>>   --noextract_font_properties --langdata_dir
>>> /home/kddlab/Desktop/tesseract-master/langdata \
>>>   --tessdata_dir /home/kddlab/Desktop/tesseract-master/tessdata \
>>>   --fontlist "B Mitra" --output_dir
>>> /home/kddlab/Desktop/tesseract-master/1MyData/testfas
>>>
>>> And i have config file:
>>> # Use LSTM
>>> tessedit_ocr_engine_mode 1
>>> tessedit_pageseg_mode 6
>>>
>>> How can i solve this?
>>>
>>> --
>>> You received this message because you are subscribed to the Google
>>> Groups "tesseract-ocr" group.
>>> To unsubscribe from this group and stop receiving emails from it, send
>>> an email to tesseract-oc...@googlegroups.com.
>>> To post to this group, send email to tesser...@googlegroups.com.
>>> Visit this group at https://groups.google.com/group/tesseract-ocr.
>>> To view this discussion on the web visit
>>> https://groups.google.com/d/msgid/tesseract-ocr/a692d903-34be-4a51-99c5-11ed34bb6cef%40googlegroups.com
>>> 
>>> .
>>> For more options, visit https://groups.google.com/d/optout.
>>>
>> --
> You received this message because you are subscribed to the Google Groups
> "tesseract-ocr" group.
> To unsubscribe from this group and stop receiving emails from it, send an
> email to tesseract-ocr+unsubscr...@googlegroups.com.
> To post to this group, send email to tesseract-ocr@googlegroups.com.
> Visit this group at https://groups.google.com/group/tesseract-ocr.
> To view this discussion on the web visit
> https://groups.google.com/d/msgid/tesseract-ocr/caf0b092-1a2c-4e73-9171-16678495af51%40googlegroups.com
> 
> .
> For more options, visit https://groups.google.com/d/optout.
>

-- 
You received this message because you are subscribed to the Google Groups 
"tesseract-ocr" group.
To unsubscribe from this group and stop receiving emails from it, send an email 
to tesseract-ocr+unsubscr...@googlegroups.com.
To post to this group, send email to tesseract-ocr@googlegroups.com.
Visit this group at https://groups.google.com/group/tesseract-ocr.
To view this discussion on the web visit 
https://groups.google.com/d/msgid/tesseract-ocr/CAG2NduWUNUQGwuRfmQ5EsdewAcvBjxOEzKcTrBqYmrSynHuoWg%40mail.gmail.com.
For more options, visit https://groups.google.com/d/optout.


Re: [tesseract-ocr] Unrecognized argument --linedata_only

2018-06-09 Thread Zohreh Khosrobeygi
Yes, i am using   src/training/tesstrain.sh


On Friday, June 8, 2018 at 6:44:27 PM UTC+4:30, shree wrote:
>
> Are you using the correct version of tesstrain.sh?
>
> It should be in src/training/tesstrain.sh
>
>
> ShreeDevi
> 
> भजन - कीर्तन - आरती @ http://bhajans.ramparivar.com
>
>
> On Fri, Jun 8, 2018 at 6:49 PM Zohreh Khosrobeygi  > wrote:
>
>> Hi,
>> I have been training tesseract but i have this errore"
>>
>> Unrecognized argument --linedata_only
>>  
>> And it's my version of tesseract"
>> tesseract 4.0.0-beta.1
>>  leptonica-1.74.4
>>   libjpeg 8d (libjpeg-turbo 1.4.2) : libpng 1.2.54 : libtiff 4.0.6 : zlib 
>> 1.2.8
>>
>>  Found AVX2
>>  Found AVX
>>  Found SSE
>>
>> Besides it's my command:
>> sudo tesstrain.sh --fonts_dir /usr/share/fonts --lang fas
>> --training_text 
>> /home/kddlab/Desktop/tesseract-master/1MyData/fas/fas.training_text
>>  --linedata_only \
>>   --noextract_font_properties --langdata_dir 
>> /home/kddlab/Desktop/tesseract-master/langdata \
>>   --tessdata_dir /home/kddlab/Desktop/tesseract-master/tessdata \
>>   --fontlist "B Mitra" --output_dir 
>> /home/kddlab/Desktop/tesseract-master/1MyData/testfas
>>
>> And i have config file:
>> # Use LSTM
>> tessedit_ocr_engine_mode 1
>> tessedit_pageseg_mode 6
>>
>> How can i solve this?
>>
>> -- 
>> You received this message because you are subscribed to the Google Groups 
>> "tesseract-ocr" group.
>> To unsubscribe from this group and stop receiving emails from it, send an 
>> email to tesseract-oc...@googlegroups.com .
>> To post to this group, send email to tesser...@googlegroups.com 
>> .
>> Visit this group at https://groups.google.com/group/tesseract-ocr.
>> To view this discussion on the web visit 
>> https://groups.google.com/d/msgid/tesseract-ocr/a692d903-34be-4a51-99c5-11ed34bb6cef%40googlegroups.com
>>  
>> 
>> .
>> For more options, visit https://groups.google.com/d/optout.
>>
>

-- 
You received this message because you are subscribed to the Google Groups 
"tesseract-ocr" group.
To unsubscribe from this group and stop receiving emails from it, send an email 
to tesseract-ocr+unsubscr...@googlegroups.com.
To post to this group, send email to tesseract-ocr@googlegroups.com.
Visit this group at https://groups.google.com/group/tesseract-ocr.
To view this discussion on the web visit 
https://groups.google.com/d/msgid/tesseract-ocr/caf0b092-1a2c-4e73-9171-16678495af51%40googlegroups.com.
For more options, visit https://groups.google.com/d/optout.


[tesseract-ocr] Re: Unrecognized argument --linedata_only

2018-06-09 Thread Zohreh Khosrobeygi


On Friday, June 8, 2018 at 5:49:43 PM UTC+4:30, Zohreh Khosrobeygi wrote:
>
> Hi,
> I have been training tesseract but i have this errore"
>
> Unrecognized argument --linedata_only
>  
> And it's my version of tesseract"
> tesseract 4.0.0-beta.1
>  leptonica-1.74.4
>   libjpeg 8d (libjpeg-turbo 1.4.2) : libpng 1.2.54 : libtiff 4.0.6 : zlib 
> 1.2.8
>
>  Found AVX2
>  Found AVX
>  Found SSE
>
> Besides it's my command:
> sudo tesstrain.sh --fonts_dir /usr/share/fonts --lang fas
> --training_text 
> /home/kddlab/Desktop/tesseract-master/1MyData/fas/fas.training_text
>  --linedata_only \
>   --noextract_font_properties --langdata_dir 
> /home/kddlab/Desktop/tesseract-master/langdata \
>   --tessdata_dir /home/kddlab/Desktop/tesseract-master/tessdata \
>   --fontlist "B Mitra" --output_dir 
> /home/kddlab/Desktop/tesseract-master/1MyData/testfas
>
> And i have config file:
> # Use LSTM
> tessedit_ocr_engine_mode 1
> tessedit_pageseg_mode 6
>
> How can i solve this?
>

-- 
You received this message because you are subscribed to the Google Groups 
"tesseract-ocr" group.
To unsubscribe from this group and stop receiving emails from it, send an email 
to tesseract-ocr+unsubscr...@googlegroups.com.
To post to this group, send email to tesseract-ocr@googlegroups.com.
Visit this group at https://groups.google.com/group/tesseract-ocr.
To view this discussion on the web visit 
https://groups.google.com/d/msgid/tesseract-ocr/2fb39a22-f0e9-4bb0-96b3-8c6624694bc9%40googlegroups.com.
For more options, visit https://groups.google.com/d/optout.