Hi,

Is there any config. parameter or setting to detect the beginning of a 
paragraph while using tesseract?

I am using Tess4J to convert a pdf to text. I am facing problems while 
recognising same paragraph after page break, i.e, when the page changes in 
the pdf, the first line of the next page is in continuation to the last 
paragraph of the last page , or if the line is a new paragraph? 

Thanks,
Ashish

-- 
You received this message because you are subscribed to the Google Groups 
"tesseract-ocr" group.
To unsubscribe from this group and stop receiving emails from it, send an email 
to [email protected].
To post to this group, send email to [email protected].
Visit this group at http://groups.google.com/group/tesseract-ocr.
To view this discussion on the web visit 
https://groups.google.com/d/msgid/tesseract-ocr/6e3f2124-fd1a-4374-bcb6-5771bd87bab7%40googlegroups.com.
For more options, visit https://groups.google.com/d/optout.

Reply via email to