----- Original Message ----- | From: "Tim Chase" <lynx....@tim.thechases.com> | To: "Thorsten Glaser" <t...@mirbsd.de> | Cc: "Chime Hart" <ch...@hubert-humphrey.com>, "lynx-dev" <lynx-dev@nongnu.org> | Sent: Wednesday, March 31, 2021 3:06:12 PM | Subject: Re: [Lynx-dev] Missing First Letter?
| On 2021-03-31 18:36, Thorsten Glaser wrote: |> This helps nothing without a way to reproduce this locally, |> for example a URL in question. | | The source seems to have been the text/html component of an email. | However, here's a reproduction case: | | $ xxd chime.html | 00000000: 3c68 746d 6c3e 3c62 6f64 793e e280 9c48 <html><body>...H | 00000010: 656c 6c6f 3c2f 626f 6479 3e3c 2f68 746d ello</body></htm | 00000020: 6c3e 0a | | (if you want to reverse that, you can | | $ xxd -r > chime.html << EOF | 00000000: 3c68 746d 6c3e 3c62 6f64 793e e280 9c48 <html><body>...H | 00000010: 656c 6c6f 3c2f 626f 6479 3e3c 2f68 746d ello</body></htm | 00000020: 6c3e 0a | EOF | | to create "chime.html" exactly) | | When viewed in GUI browsers, | | $ firefox chime.html | | there's a UTF-8 double-quote before the "Hello" as marked by the | bytes 0xE2, 0x80, 0x9C. However, when I open that document with | lynx | | $ lynx chime.html | | it (1) doesn't display the double-quote, and also (2) eats the | "H" in "Hello". Trying to view-source ("\") on that document also | manifests the same symptoms. | | Hopefully that gives you adequate test case to break things, too. :-) | | -tim you'll get that behavior if your locale is set to non-UTF-8, e.g,. "C" (using "en_US" rather than "en_US.UTF-8" may also look like this, depending on the terminal) -- Thomas E. Dickey <dic...@invisible-island.net> http://invisible-island.net ftp://ftp.invisible-island.net _______________________________________________ Lynx-dev mailing list Lynx-dev@nongnu.org https://lists.nongnu.org/mailman/listinfo/lynx-dev