@Quan Ok that with pam 0 OCR isn't performed but I'm expecting that when I run "tesseract input_image output_text -l eng -psm 0" I'll get the analysis response in the output_text file. With Tesseract 3.02 it isn't so :(
@zdenop So Tesseract v. 3.02 doesn't support this feature... I'll try 3.03 version! Many thanks! Il giorno domenica 11 maggio 2014 13:53:45 UTC+2, Quan Nguyen ha scritto: > > With psm 0, Tesseract does not perform normal OCR function but analyzes > layout; it produces such characteristics as Orientation, Writing Direction, > and Textline Order. Check Tess4J unit tests for usage of OSD. > > On Sunday, May 11, 2014 5:48:39 AM UTC-5, Joe Aspara wrote: >> >> I'm struggling with the OSD function of Tesseract 3.02. >> I tried the standalone version via command line and the Tess4J version >> too, but I always obtain an error with different input types. >> >> I downloaded the osd.traineddata for version 3.01 (I guess no such file >> still exist for v3.02) from here >> https://code.google.com/p/tesseract-ocr/downloads/detail?name=tesseract-ocr-3.01.osd.tar.gz&can=2&q= >> and I copied it properly in the TESSDATA folder >> >> Below my experiments: >> >> COMMAND LINE >> tesseract input_image output_text -l eng -psm 0 >> response: Error during processing. >> >> With psm = 1 it read text with very bad quality, with psm = 2 or 3 it >> give my empty output. >> >> As far as I know only 0 and 1 values perform OSD! From the reference: >> 0 = Orientation and script detection (OSD) only. >> 1 = Automatic page segmentation with OSD. >> >> >> TESS4J >> Tesseract instance = Tesseract.getInstance(); >> instance.setLanguage("ita"); >> instance.setPageSegMode(TessPageSegMode.PSM_AUTO_OSD); >> String result = instance.doOCR(myImage); >> >> result always is empty at the end >> >> To know the input orientation it's critical for my project but at now I'm >> not able to find a way to accomplish this. >> >> I hope somebody can help me! Thanks in advance >> >> -- You received this message because you are subscribed to the Google Groups "tesseract-ocr" group. To unsubscribe from this group and stop receiving emails from it, send an email to [email protected]. To post to this group, send email to [email protected]. Visit this group at http://groups.google.com/group/tesseract-ocr. To view this discussion on the web visit https://groups.google.com/d/msgid/tesseract-ocr/f8f0e275-9c21-4a8e-80a4-06ecd0760bcd%40googlegroups.com. For more options, visit https://groups.google.com/d/optout.

