this is not tesseract problem: https://ask.libreoffice.org/en/question/97993/why-doesnt-lo-writer-open-and-save-text-documents-encoded-in-utf-8-without-bom-any-plans-to-fix-this-soon/
tesseract output is UTF-8 encoded. Zdenko pi 29. 6. 2018 o 19:37 Martin Jenniges <[email protected]> napísal(a): > Hello, > > when I use the TXT-File, which was created from Tesseract in > Windows-Cmd, with Libre Office Writer: the German Spezial Character üöä > ect are wrong. > > I help me, with open the txt-foöe with Notepad++ and copy and paste the > text in Writer. > > Can I do anything, that Libre Office Writer open the txt-file with the > correct Characters ? > > Thank You for your Answers! > > See regard > > Martin Jenniges > > -- > You received this message because you are subscribed to the Google Groups > "tesseract-ocr" group. > To unsubscribe from this group and stop receiving emails from it, send an > email to [email protected]. > To post to this group, send email to [email protected]. > Visit this group at https://groups.google.com/group/tesseract-ocr. > To view this discussion on the web visit > https://groups.google.com/d/msgid/tesseract-ocr/27334e1f-beae-3a97-4bde-02bc45d18c0e%40skynet.be > . > For more options, visit https://groups.google.com/d/optout. > -- You received this message because you are subscribed to the Google Groups "tesseract-ocr" group. To unsubscribe from this group and stop receiving emails from it, send an email to [email protected]. To post to this group, send email to [email protected]. Visit this group at https://groups.google.com/group/tesseract-ocr. To view this discussion on the web visit https://groups.google.com/d/msgid/tesseract-ocr/CAJbzG8xrafbB-WeQOAQaNWYjQ-1SKnEaLjrqopKbWBOrMVfDYw%40mail.gmail.com. For more options, visit https://groups.google.com/d/optout.

