Dear HARIDALLAS, I am happy to note you are interested in Telugu OCR. You must have heard name of Rakesh who is doing research on Telugu. I think it is better to approach [email protected] for further details. I hope your wishes will fulfilled. With Best wishes and good luck, -sriranga(77yrsold).
On Sun, May 9, 2010 at 5:47 AM, haridallas <[email protected]> wrote: > dear Mr. Sriranga, > I am interested in Telugu OCR and I am following your posts in all > TEssaract forums with interest . I myself have not been able to run > the tesseract program on my PC windows Xp every time I try the > black windows flashes once and disappears . > how did you prepare your TIFF training files and how did you > determine which conjunct letters to include and which to ignore . > as theoretically both for telugu and kannada there can be double > and triple conjunct in an infinite varieties in to a few thousands > > On Apr 25, 8:16 am, "Sriranga(77yrsold)" <[email protected]> > wrote: > > Today, successfully trained and generated kan.trainedata file - using > > tesseract.exe, mftraining, cntraining, unicharset extractor.exe of * > > tesseract(r319svn)*. > > I did NOT use wordlist2dawg.exe. Thus able to generated six files viz. > > intemp, Microfeat, normproto, uncharset, mfunicharset, pffmtable. lastly > > generated kan.traineddata file. Tested using tif file output was fine ( > with > > a few misspelling) > > Attached extract of cmd.exe of WinXP and tesseractlog file > > reports(incomplete) for your information. > > With regards, > > -sriranga(77yrsold) > > > > -- > > You received this message because you are subscribed to the Google Groups > "tesseract-ocr" group. > > To post to this group, send email to [email protected]. > > To unsubscribe from this group, send email to > [email protected]<tesseract-ocr%[email protected]> > . > > For more options, visit this group athttp:// > groups.google.com/group/tesseract-ocr?hl=en. > > > > tesseract-log reports.txt > > 21KViewDownload > > > > Extract of CMD(winXP).odt > > 16KViewDownload > > > > kan.unicharset > > 38KViewDownload > > -- > You received this message because you are subscribed to the Google Groups > "tesseract-ocr" group. > To post to this group, send email to [email protected]. > To unsubscribe from this group, send email to > [email protected]<tesseract-ocr%[email protected]> > . > For more options, visit this group at > http://groups.google.com/group/tesseract-ocr?hl=en. > > -- You received this message because you are subscribed to the Google Groups "tesseract-ocr" group. To post to this group, send email to [email protected]. To unsubscribe from this group, send email to [email protected]. For more options, visit this group at http://groups.google.com/group/tesseract-ocr?hl=en.

