Hello, this sounds interesting. I don't know much, almost nothing about programing. But, would it be possible to create a font of someone's handwriting, (my mother's) then use that font with your script to train the machine to recognise her handwriting? I have 50 years worth of letters to be OCR'ed. The letters are all in Norwegian, so we need the teach the machine norwegian as well? Thanks alleykat
On Feb 27, 1:26 pm, Debayan Banerjee <[email protected]> wrote: > Hi list, > I have been working on a tool to automatically generate the files required > by tesseract-ocr for adding support to a new script. This tool takes as > input a file containing all characters of the alphabet, and a directory of > all different fonts. It then generates several tif images and corresponding > box files, and then proceeds to generate the 5 training files: > > - inttemp > - normproto > - unicharset > - Microfeat > - pffmtable > > Here are the links: > > 1. > http://tesseractindic.googlecode.com/files/tesseract_trainer.beta.tar.gz > - The tar ball itself > 2. > http://code.google.com/p/tesseractindic/source/browse/trunk/tesseract... > - The readme file > 3.http://www.youtube.com/watch?v=vuuVwm5ZjkI- YouTube video of the tool > working for Bengali > > I request feedback. > > Thank You, > Debayan Banerjee > NIT Durgapur, India > > -- > Be Intelligent, Use > GNU/Linux.http://debayan.wordpress.comhttp://lug.nitdgp.ac.inhttp://planet-india.randomink.org --~--~---------~--~----~------------~-------~--~----~ You received this message because you are subscribed to the Google Groups "tesseract-ocr" group. To post to this group, send email to [email protected] To unsubscribe from this group, send email to [email protected] For more options, visit this group at http://groups.google.com/group/tesseract-ocr?hl=en -~----------~----~----~----~------~----~------~--~---

