[tesseract-ocr] Re: text close to lines

2017-07-10 Thread Quan Nguyen
You can call Lept4J's *LeptUtils* 

.*removeLines* 

(*Pix* 

 pixs).

http://tess4j.sourceforge.net/docs/index.html


On Monday, July 10, 2017 at 3:24:12 AM UTC-5, GuillaumeQ wrote:

> I have in a document some text written in a table. the lines of the table 
> are pretty close to the text. when i doOCR, i dont get the text between the 
> lines. is there any way to improve this performance and read some text 
> close to lines? the image is attached
>
> my code:
>
> def ocrToStream(){
> def imageFile = new File("path\\to.PNG")
>  ITesseract instance = new Tesseract1() // JNA Direct Mapping
> instance.setDatapath("") // replace  with 
> path to parent directory of tessdata
> instance.setLanguage("fra")
>
> try {
> def result = instance.doOCR(imageFile)
> System.out.println(result)
> } catch (TesseractException e) {
> System.err.println(e.getMessage())
> } catch (IOException e) {
> System.err.println(e.getMessage())
> }
> }
>
>

-- 
You received this message because you are subscribed to the Google Groups 
"tesseract-ocr" group.
To unsubscribe from this group and stop receiving emails from it, send an email 
to tesseract-ocr+unsubscr...@googlegroups.com.
To post to this group, send email to tesseract-ocr@googlegroups.com.
Visit this group at https://groups.google.com/group/tesseract-ocr.
To view this discussion on the web visit 
https://groups.google.com/d/msgid/tesseract-ocr/d040f7a8-9cd6-4830-b29c-7175e3be58e7%40googlegroups.com.
For more options, visit https://groups.google.com/d/optout.


[tesseract-ocr] Re: text close to lines

2017-07-10 Thread THintz
Charles Weld's Tesseract .Net implements Leptonica's RemoveLines for 
grayscale in Pix.cs.

-- 
You received this message because you are subscribed to the Google Groups 
"tesseract-ocr" group.
To unsubscribe from this group and stop receiving emails from it, send an email 
to tesseract-ocr+unsubscr...@googlegroups.com.
To post to this group, send email to tesseract-ocr@googlegroups.com.
Visit this group at https://groups.google.com/group/tesseract-ocr.
To view this discussion on the web visit 
https://groups.google.com/d/msgid/tesseract-ocr/a557c26c-f2f6-4ba2-bc8e-c5ab7df20627%40googlegroups.com.
For more options, visit https://groups.google.com/d/optout.