I am running command "Tesseract image.jpg output -l eng -psm 6" which generates output.txt file. Generated file has ANSI encoding. However, UTF-8 extension is desired.
On Thursday, 9 January 2020 14:07:51 UTC+5:30, Manankumar Bhatt wrote: > > Hello there, > > I have been using Tesseract 4. When as running using command-line, I am > getting output text file as "ANSI" encoded instead of "UTF-8". > > I have tried creating a new file and saving it as UTF-8 encoding. But when > I run using command line , it generates ANSI encoded file by default. > > Can you please help on this? > > Thanks, > Manan Bhatt > > > -- You received this message because you are subscribed to the Google Groups "tesseract-ocr" group. To unsubscribe from this group and stop receiving emails from it, send an email to [email protected]. To view this discussion on the web visit https://groups.google.com/d/msgid/tesseract-ocr/81933cef-e1ae-4a85-a9f2-02614101fe15%40googlegroups.com.

