Re: [tesseract-ocr] Re: Ground Truth from Box Files

2020-04-24 Thread Shree Devi Kumar
On Sat, Apr 25, 2020 at 2:13 AM Peyi Oyelo wrote: > @shree hello sir/maam? > Maam :-) > > On Wednesday, April 22, 2020 at 7:23:28 AM UTC-7, Peyi Oyelo wrote: >> >> I created the akan.traineddata using the typical tesseract 3 legacy >> workflow. >> > OK. The box/tiff pairs work for creating

Re: [tesseract-ocr] Re: Ground Truth from Box Files

2020-04-24 Thread Peyi Oyelo
@shree hello sir/maam? On Wednesday, April 22, 2020 at 7:23:28 AM UTC-7, Peyi Oyelo wrote: > > I created the akan.traineddata using the typical tesseract 3 legacy > workflow. I do not have word/freq/punc lists. As of now I would like to > train using lstm to support as many fonts i.e. 45000