Is that an actual input file to tesseract? Is this what you claim 99% accuracy with commercial OCR engines? I am surprised if that's the case, the quality is very bad.
On 19 August 2016 at 21:06, Lucas Alexandre <[email protected]> wrote: > > Hello, > > See the attached file. > > Regards, > Lucas Alexandre > > -----Mensagem original----- > De: Allistair C <[email protected]> > Para: [email protected] > Data: Sexta, 19 de Agosto de 2016 20:45 > Assunto: Re: [tesseract-ocr] Help in read Blue and White image. > > Do you have a sample image? > > Sent from my iPhone > > > On 19 Aug 2016, at 20:33, Lucas Alexandre <[email protected]> > wrote: > > > > > > Hello, > > > > I am a new member of this mailing list. I am creating a small project to > read electronic screens through OCR. In other words, we set up some > equipment that capture > > the VGA output of computers and other devices, and converts the signal > to RCA composite, so I can take pictures and videos of the machine. My idea > is to capture > > BIOS images (Setup) and convert them to text that can be read by > visually impaired users, like me. The fact is that Tesseract does not seem > to understand my images, > > but other commercial OCRs can read almost 99% of the text, with amazing > accuracy. Before purchasing any license these OCRs, I wonder if there is > anything I can > > do > > to make the tesseract is able to read my screens with some precision. > I've tried to make the tesseract tessinput.tif return the file, and the > result is a 1KB file > > with bad picture quality, with completely blurred and distorted letters. > I believe this happens because the tesseract tries to improve the image > internally, but > > ends up destroying it. If there was any option for tesseract not modify > the image, certainly I could best results. Even in commercial OCRs, I can > choose whether > > you want the image to be converted to black and white. > > > > Does anyone have any idea how I can do this by tesseract? I thought > even in trying to recompile the tesseract in a way that it does not alter > the original > image. > > Or is there some binary for Windows or Linux you already have this > capability? > > > > Very grateful. > > > > Sincerely, > > Lucas Alexandre > > > > -- > > You received this message because you are subscribed to the Google > Groups "tesseract-ocr" group. > > To unsubscribe from this group and stop receiving emails from it, send > an email to [email protected]. > > To post to this group, send email to [email protected]. > > Visit this group at https://groups.google.com/group/tesseract-ocr. > > To view this discussion on the web visit https://groups.google.com/d/ms > gid/tesseract-ocr/57b75ef6.9a35ed0a.27a7a.1bb5%40mx.google.com. > > For more options, visit https://groups.google.com/d/optout. > > -- > You received this message because you are subscribed to the Google Groups > "tesseract-ocr" group. > To unsubscribe from this group and stop receiving emails from it, send an > email to [email protected]. > To post to this group, send email to [email protected]. > Visit this group at https://groups.google.com/group/tesseract-ocr. > To view this discussion on the web visit https://groups.google.com/d/ms > gid/tesseract-ocr/6FCAE2B5-FF13-4643-9EFA-823A4931A647%40gmail.com. > For more options, visit https://groups.google.com/d/optout. > > -- > You received this message because you are subscribed to the Google Groups > "tesseract-ocr" group. > To unsubscribe from this group and stop receiving emails from it, send an > email to [email protected]. > To post to this group, send email to [email protected]. > Visit this group at https://groups.google.com/group/tesseract-ocr. > To view this discussion on the web visit https://groups.google.com/d/ms > gid/tesseract-ocr/6FCAE2B5-FF13-4643-9EFA-823A4931A647%40gmail.com. > For more options, visit https://groups.google.com/d/optout. > -- You received this message because you are subscribed to the Google Groups "tesseract-ocr" group. To unsubscribe from this group and stop receiving emails from it, send an email to [email protected]. To post to this group, send email to [email protected]. Visit this group at https://groups.google.com/group/tesseract-ocr. To view this discussion on the web visit https://groups.google.com/d/msgid/tesseract-ocr/CAORW5vib%3DFEmRig6ezTz--Od2K0F2wsRHHBym%2B2A5itJjF-QGQ%40mail.gmail.com. For more options, visit https://groups.google.com/d/optout.

