Public bug reported: Example using non-ASCII apostrophe:
----- $ echo 'This won’t work well' | txt2html <!DOCTYPE html PUBLIC "-//W3C//DTD XHTML 1.0 Strict//EN" "http://www.w3.org/TR/xhtml1/DTD/xhtml1-strict.dtd"> <html xmlns="http://www.w3.org/1999/xhtml"> <head> <title></title> <meta name="generator" content="HTML::TextToHTML v2.53"/> </head> <body> <p>This wonâ<sup>TM</sup>t work well</p> </body> </html> ------ Which displays in web browser as "This wonâ�TMt work well" ProblemType: Bug DistroRelease: Ubuntu 20.04 Package: txt2html 1:2.53-2 ProcVersionSignature: Microsoft 4.4.0-18362.1049-Microsoft 4.4.35 Uname: Linux 4.4.0-18362-Microsoft x86_64 ApportVersion: 2.20.11-0ubuntu27.17 Architecture: amd64 CasperMD5CheckResult: skip Date: Wed Jun 2 19:32:47 2021 PackageArchitecture: all ProcEnviron: SHELL=/bin/bash LANG=C.UTF-8 TERM=xterm-256color PATH=(custom, user) SourcePackage: txt2html UpgradeStatus: Upgraded to focal on 2021-04-17 (46 days ago) ** Affects: txt2html (Ubuntu) Importance: Undecided Status: New ** Tags: amd64 apport-bug focal third-party-packages uec-images -- You received this bug notification because you are a member of Ubuntu Bugs, which is subscribed to Ubuntu. https://bugs.launchpad.net/bugs/1930643 Title: program mangles output when input contains Unicode characters To manage notifications about this bug go to: https://bugs.launchpad.net/ubuntu/+source/txt2html/+bug/1930643/+subscriptions -- ubuntu-bugs mailing list ubuntu-bugs@lists.ubuntu.com https://lists.ubuntu.com/mailman/listinfo/ubuntu-bugs