dear Mr. Sriranga, I am interested in Telugu OCR and I am following your posts in all TEssaract forums with interest . I myself have not been able to run the tesseract program on my PC windows Xp every time I try the black windows flashes once and disappears . how did you prepare your TIFF training files and how did you determine which conjunct letters to include and which to ignore . as theoretically both for telugu and kannada there can be double and triple conjunct in an infinite varieties in to a few thousands
On Apr 25, 8:16 am, "Sriranga(77yrsold)" <[email protected]> wrote: > Today, successfully trained and generated kan.trainedata file - using > tesseract.exe, mftraining, cntraining, unicharset extractor.exe of * > tesseract(r319svn)*. > I did NOT use wordlist2dawg.exe. Thus able to generated six files viz. > intemp, Microfeat, normproto, uncharset, mfunicharset, pffmtable. lastly > generated kan.traineddata file. Tested using tif file output was fine ( with > a few misspelling) > Attached extract of cmd.exe of WinXP and tesseractlog file > reports(incomplete) for your information. > With regards, > -sriranga(77yrsold) > > -- > You received this message because you are subscribed to the Google Groups > "tesseract-ocr" group. > To post to this group, send email to [email protected]. > To unsubscribe from this group, send email to > [email protected]. > For more options, visit this group > athttp://groups.google.com/group/tesseract-ocr?hl=en. > > tesseract-log reports.txt > 21KViewDownload > > Extract of CMD(winXP).odt > 16KViewDownload > > kan.unicharset > 38KViewDownload -- You received this message because you are subscribed to the Google Groups "tesseract-ocr" group. To post to this group, send email to [email protected]. To unsubscribe from this group, send email to [email protected]. For more options, visit this group at http://groups.google.com/group/tesseract-ocr?hl=en.

