On Mon, Jun 21, 2021 at 05:09:53PM +0200, Cédric Hannotier via Lynx-dev wrote: > Hi all, > > I got an HTML email with gb3212 charset. s/3232/2312/
> It seems that lynx is unable to print some characters. yes... I see that the problem is that lynx's conversion of euc-cn to utf-8 is incomplete. This is not a new issue as you can see here: https://lists.nongnu.org/archive/cgi-bin/namazu.cgi?query=gb2312&submit=Search%21&idxname=lynx-dev&max=20&result=normal&sort=date%3Alate > Changing the declared charset to euc-cn gives the same result. gb2312 is equated to euc-cn internally. > Using another browser works (qutebrowser). > First converting that file to utf-8 (using iconv) also works. yes, lynx uses iconv after organizing the characters :-) > Lynx build is from Debian testing: > > Lynx Version 2.9.0dev.6 (05 Sep 2020) > libwww-FM 2.14, SSL-MM 1.4.1, GNUTLS 3.7.0, ncurses 6.2.20201114(wide) > Built on linux-gnu. > > Someone else tested it with both 2.8.9rel.1 debian 3 (from Debian 10) > and 2.9.0dev.6 debian 2, but none of them worked. > > The HTML can be found there: https://ttm.sh/FJP.html > > Regards > -- > > Cédric Hannotier > > _______________________________________________ > Lynx-dev mailing list > Lynx-dev@nongnu.org > https://lists.nongnu.org/mailman/listinfo/lynx-dev -- Thomas E. Dickey <dic...@invisible-island.net> https://invisible-island.net ftp://ftp.invisible-island.net
signature.asc
Description: PGP signature
_______________________________________________ Lynx-dev mailing list Lynx-dev@nongnu.org https://lists.nongnu.org/mailman/listinfo/lynx-dev