Here are the brief instructions on how to set up the Tesseract interactive debug environment (ScrollView) on Windows:
1. Make sure you have Java Runtime Environment installed 2. Download my home-brewed single archived installation suite from http://www.4shared.com/get/Z4gnbJdP/tess_debug.html 3. Unpack the installation suit 4. Run cmd.exe 5. Change working directory to where you've unpacked the installation suit 6. Follow the instructions in http://code.google.com/p/tesseract-ocr/wiki/ViewerDebugging to run Tesseract+ScrollView from the command line To keep the reasonable forum post size here in Google Groups, I placed the more verbose and overall nicer looking instructions in my blog at http://rdaemons.blogspot.com/2011/02/tesseract-ocr-setting-up-interactive.html Warm regards, Dmitry Silaev 2011/2/6 Sriranga(78yrsold) <withblessi...@gmail.com> > Dear dmitry, > Though it may or may not help me much atleast it will be benefited for > users of tesseract-ocr - > for which users of the forum/newbies shall be thankful to you. > With Warmest Regards, > -sriranga(78yrs) > > On Sun, Feb 6, 2011 at 1:47 AM, Dmitry Silaev <daemons2...@gmail.com>wrote: > >> Dear Sriranga, >> >> I've just managed to start the interactive Tess's visualizer. I don't >> really know if it might help you much, but I can publish the step-by-step >> instructions on how to make it work. At least these instructions may help >> some of Tess community newbies. Most likely, I'll be able to publish this >> within the next 24 hours. >> >> However it's not a workable solution for me. I still in desperate need to >> know if I can provide Tess with my own baseline info using some high-level >> structures and methods. Or whatever information you may have on this >> subject. >> >> Warm regards, >> Dmitry Silaev >> >> >> >> >> 2011/2/5 Sriranga(78yrsold) <withblessi...@gmail.com> >> >> Tried to install in WinXP but failed. extract of cmd is reproduced below >>> for further guidance please. >>> C:\>set JAVA_HOME=C:\jdk1.4 >>> >>> C:\>.\build.bat all (win32) >>> '.\build.bat' is not recognized as an internal or external command, >>> operable program or batch file. >>> >>> C:\> >>> C:\>j: >>> >>> J:\tesseract-ocr-3.01alpha-r527\java>.\build.bat all (win32) >>> Piccolo Build System >>> ------------------- >>> Building with classpath >>> C:\jdk1.4\lib\tools.jar;.\lib\ant.jar;.\lib\junit.jar; >>> Starting Ant... >>> The system cannot find the path specified. >>> >>> J:\tesseract-ocr-3.01alpha-r527\java> >>> J:\tesseract-ocr-3.01alpha-r527\java> >>> >>> I may kindly be intimated where I made a mistake? >>> with warmest regards, >>> -sriranga(78yrs) >>> >>> >>> On Sat, Feb 5, 2011 at 7:28 PM, Sriranga(78yrsold) < >>> withblessi...@gmail.com> wrote: >>> >>>> As per wiki instruction on debug mode , On Windows: The build process >>>> for building ScrollView.jar is not defined. Instead copy piccolo-1.2.jar >>>> and >>>> piccolox-1.2.jar to tesseract/java - which appears prescribed >>>> for*tesseract 2.04 >>>> * >>>> . >>>> It is presumed whether by coping piccolo-1.2jar and piccolox-1.2 to >>>> tesseract/java folder of tesserac-3.01Alpha >>>> ( r527) will work? For this purpose whether picolo.java1.2( compiled >>>> source 4.3MB)have to be downloaded for WinXP? Kindly confirm - since I am >>>> not programmer/developer. >>>> With Regards, >>>> -Sriranga(78yrs) >>>> >>>> >>>> 2011/2/5 Zdenko Podobný <zde...@gmail.com> >>>> >>>> I am not sure what you if it helps you, but did you try debug mode ( >>>>> http://code.google.com/p/tesseract-ocr/wiki/ViewerDebugging)? >>>>> >>>>> Zd. >>>>> >>>>> >>>>> Dňa 05.02.2011 01:33, daemon-s wrote / napísal(a): >>>>> >>>>> Hi! >>>>> >>>>> I train Tess using separate images for every text line. Recognition is >>>>> also ran over single text line images. Recognition performs pretty >>>>> well, however there are many errors that, I believe, related to >>>>> misdetected baselines, during training or recognition - I don't know. >>>>> These include: >>>>> >>>>> " (double quote) detected as n >>>>> S detected as s (and vice versa) >>>>> V detected as v (and vice versa) >>>>> etc. >>>>> >>>>> Is there any (preferably high-level) way to provide Tess with baseline >>>>> info? Or at least obtain baseline info from Tess in order to visualize >>>>> it further for debugging? >>>>> >>>>> Thanks, >>>>> Dmitry >>>>> >>>>> >>>>> -- >>>>> You received this message because you are subscribed to the Google >>>>> Groups "tesseract-ocr" group. >>>>> To post to this group, send email to tesseract-ocr@googlegroups.com. >>>>> To unsubscribe from this group, send email to >>>>> tesseract-ocr+unsubscr...@googlegroups.com. >>>>> For more options, visit this group at >>>>> http://groups.google.com/group/tesseract-ocr?hl=en. >>>>> >>>> >>>> >>> -- >>> You received this message because you are subscribed to the Google Groups >>> "tesseract-ocr" group. >>> To post to this group, send email to tesseract-ocr@googlegroups.com. >>> To unsubscribe from this group, send email to >>> tesseract-ocr+unsubscr...@googlegroups.com. >>> For more options, visit this group at >>> http://groups.google.com/group/tesseract-ocr?hl=en. >>> >> >> -- >> You received this message because you are subscribed to the Google Groups >> "tesseract-ocr" group. >> To post to this group, send email to tesseract-ocr@googlegroups.com. >> To unsubscribe from this group, send email to >> tesseract-ocr+unsubscr...@googlegroups.com. >> For more options, visit this group at >> http://groups.google.com/group/tesseract-ocr?hl=en. >> > > -- > You received this message because you are subscribed to the Google Groups > "tesseract-ocr" group. > To post to this group, send email to tesseract-ocr@googlegroups.com. > To unsubscribe from this group, send email to > tesseract-ocr+unsubscr...@googlegroups.com. > For more options, visit this group at > http://groups.google.com/group/tesseract-ocr?hl=en. > -- You received this message because you are subscribed to the Google Groups "tesseract-ocr" group. To post to this group, send email to tesseract-ocr@googlegroups.com. To unsubscribe from this group, send email to tesseract-ocr+unsubscr...@googlegroups.com. For more options, visit this group at http://groups.google.com/group/tesseract-ocr?hl=en.