Package: lynx-cur Version: 2.8.8dev.8-1 Severity: normal Using unicode utf-8 is broken. Looking a a Russian site which is utf-8 results in about half of the Russian characters missing (and replaced by escaped codes like ~B, ~D, etc. I use a utf-8 terminal (using the Putty program on a laptop) connected to the serial port of my Linux PC. utf-8 works fine with this setup (but not with lynx). I can read a utf-8 file using "more" or "cat". The text browser elinks works fine and displays all Russian characters.
I observed what was going on by using lynx on a Russian utf-8 site and saving the page. The page display while using lynx was very corrupt but reading the saved page (a file) using "more" is fine. Using vim to read this saved file gives more clues. If I just use vim (which expects an ascii file) I get the same corruption. But if I configure vim for utf-8, it works fine. Thus lynx would display fine if it just sent the raw file to my utf-8 terminal. But instead, it behaves as if it was a 7-bit-byte ascii file and replaces any 8-bit-byte (with the high order bit 1, value >= 80-hex) with an escape code like ~B. Seeing ~B means that the code was control-B with the high order bit set (82-hex). This in utf-8 is the 2nd byte of the pair for Russian character "??". I just typed the Russian letter "t" and you see it doesn't show "t" but .~B and I'm using the Debian reportbug program which is giving the same error as what I'm reporting (probably since it's likely using vim which thinks I'm typing an ascii file). And I was going to also mention that lynx replaces the first byte of the utf-8 pair with the error character as you see, provided the 2nd byte of the pair is an above-ascii character. Here's part of the options menu: Display and Character Set Use locale-based character set(!): [OFF] Use HTML5 charset replacements(!): [OFF] Display character set : [UNICODE (UTF-8)________________] Assumed document character set(!): [iso-8859-1______] Raw 8-bit (!) : [OFF] The problem exists with the above options. If I set Raw to on, it reloads the webpage with the same corruption, and then when I view the options menu Raw is shown as off. Is Raw actually being turned on? I don't know. -- System Information: Debian Release: squeeze/sid APT prefers testing APT policy: (500, 'testing') Architecture: i386 (i586) Kernel: Linux 2.6.32-5-486 Locale: LANG=en_US, LC_CTYPE=en_US (charmap=ISO-8859-15) Shell: /bin/sh linked to /bin/bash Versions of packages lynx-cur depends on: ii debconf [debconf-2.0] 1.5.38 Debian configuration management sy ii libbsd0 0.2.0-1 utility functions from BSD systems ii libc6 2.11.2-11 Embedded GNU C Library: Shared lib ii libgcrypt11 1.4.6-5 LGPL Crypto library - runtime libr ii libgnutls26 2.10.4-2 the GNU TLS library - runtime libr ii libidn11 1.18-1 GNU Libidn library, implementation ii libncursesw5 5.7+20100313-5 shared libraries for terminal hand ii zlib1g 1:1.2.3.4.dfsg-3 compression library - runtime Versions of packages lynx-cur recommends: ii mime-support 3.51-1 MIME files 'mime.types' & 'mailcap Versions of packages lynx-cur suggests: pn lynx-cur-wrapper <none> (no description available) -- Configuration Files: /etc/lynx-cur/lynx.cfg changed: .h1 Auxiliary Facilities .h2 INCLUDE .ex .ex .h2 STARTFILE .ex .h2 HELPFILE .ex HELPFILE:file://localhost/usr/share/doc/lynx-cur/lynx_help/lynx_help_main.html.gz .h2 DEFAULT_INDEX_FILE DEFAULT_INDEX_FILE:http://lynx.isc.org/ .h1 Interaction .h2 GOTOBUFFER .h2 JUMP_PROMPT .h1 Auxiliary Facilities .h2 JUMPFILE .ex .h2 JUMPBUFFER .h1 Internal Behavior .h2 SAVE_SPACE .h2 REUSE_TEMPFILES .h2 LYNX_HOST_NAME .h2 LOCALHOST_ALIAS .ex 2 .h2 LOCAL_DOMAIN .h1 Session support .h2 AUTO_SESSION .h2 SESSION_FILE .h2 SESSION_LIMIT .h1 Character sets .h2 CHARACTER_SET .nf .fi CHARACTER_SET:iso-8859-1 .h2 LOCALE_CHARSET LOCALE_CHARSET:TRUE .h2 ASSUME_CHARSET .h2 ASSUMED_DOC_CHARSET_CHOICE .h2 DISPLAY_CHARSET_CHOICE .ex .ex 4 .h2 ASSUME_LOCAL_CHARSET .h2 PREPEND_CHARSET_TO_SOURCE PREPEND_CHARSET_TO_SOURCE:FALSE .h2 NCR_IN_BOOKMARKS .h2 FORCE_8BIT_TOUPPER .h2 OUTGOING_MAIL_CHARSET .h2 ASSUME_UNREC_CHARSET .h2 PREFERRED_LANGUAGE PREFERRED_LANGUAGE:en .h2 PREFERRED_CHARSET .h2 CHARSETS_DIRECTORY .h2 CHARSET_SWITCH_RULES .h1 Interaction .h2 URL_DOMAIN_PREFIXES .h2 URL_DOMAIN_SUFFIXES .h2 FORMS_OPTIONS .h2 PARTIAL .h2 PARTIAL_THRES .h2 SHOW_KB_RATE .h2 SHOW_KB_NAME .h1 Timeouts .h2 INFOSECS .h2 MESSAGESECS .h2 ALERTSECS .h2 NO_PAUSE .h2 DEBUGSECS .h2 REPLAYSECS .h1 Appearance .h2 USE_SELECT_POPUPS .h2 SHOW_CURSOR SHOW_CURSOR:TRUE .h2 UNDERLINE_LINKS .h2 BOLD_HEADERS .h2 BOLD_H1 .h2 BOLD_NAME_ANCHORS .h1 Internal Behavior .h2 DEFAULT_CACHE_SIZE .h2 DEFAULT_VIRTUAL_MEMORY_SIZE .h2 SOURCE_CACHE .h2 SOURCE_CACHE_FOR_ABORTED .h2 ALWAYS_RESUBMIT_POSTS .h2 TRIM_INPUT_FIELDS .h1 HTML Parsing .h2 NO_ISMAP_IF_USEMAP .h2 SEEK_FRAG_MAP_IN_CUR .h2 SEEK_FRAG_AREA_IN_CUR .h1 CGI scripts .h2 LOCAL_EXECUTION_LINKS_ALWAYS_ON .h2 LOCAL_EXECUTION_LINKS_ON_BUT_NOT_REMOTE LOCAL_EXECUTION_LINKS_ALWAYS_ON:FALSE LOCAL_EXECUTION_LINKS_ON_BUT_NOT_REMOTE:FALSE .h2 TRUSTED_EXEC TRUSTED_EXEC:none .h2 ALWAYS_TRUSTED_EXEC ALWAYS_TRUSTED_EXEC:none .h2 TRUSTED_LYNXCGI TRUSTED_LYNXCGI:none .h2 LYNXCGI_ENVIRONMENT .h2 LYNXCGI_DOCUMENT_ROOT .h1 Cookies .h2 FORCE_SSL_COOKIES_SECURE .h1 Internal Behavior .h2 MAIL_SYSTEM_ERROR_LOGGING .h2 CHECKMAIL .h1 News-groups .h2 NNTPSERVER .h2 LIST_NEWS_NUMBERS .h2 LIST_NEWS_DATES .h2 NEWS_CHUNK_SIZE .h2 NEWS_MAX_CHUNK .h2 NEWS_POSTING .h2 LYNX_SIG_FILE .h1 Bibliographic Protocol (bibp scheme) .h2 BIBP_GLOBAL_SERVER .h2 BIBP_BIBHOST .h1 Interaction .h2 SCROLLBAR .h2 SCROLLBAR_ARROW .h2 USE_MOUSE .h1 HTML Parsing .h2 COLLAPSE_BR_TAGS .h2 TAGSOUP .h1 Cookies .h2 SET_COOKIES .h2 ACCEPT_ALL_COOKIES .h2 COOKIE_ACCEPT_DOMAINS .h2 COOKIE_REJECT_DOMAINS .h2 COOKIE_LOOSE_INVALID_DOMAINS .h2 COOKIE_STRICT_INVALID_DOMAINS .h2 COOKIE_QUERY_INVALID_DOMAINS .h2 MAX_COOKIES_DOMAIN .h2 MAX_COOKIES_GLOBAL .h2 MAX_COOKIES_BUFFER .h2 PERSISTENT_COOKIES PERSISTENT_COOKIES:FALSE .h2 COOKIE_FILE .h2 COOKIE_SAVE_FILE .h1 Mail-related .h2 SYSTEM_MAIL .h2 SYSTEM_MAIL_FLAGS .ex 2 .ex 2 .ex 2 .ex 2 .ex 2 .h2 MAIL_ADRS .h2 USE_FIXED_RECORDS .h1 Keyboard Input .h2 VI_KEYS_ALWAYS_ON .h2 EMACS_KEYS_ALWAYS_ON .h2 DEFAULT_KEYPAD_MODE .h2 NUMBER_LINKS_ON_LEFT .h2 NUMBER_FIELDS_ON_LEFT .h2 DEFAULT_KEYPAD_MODE_IS_NUMBERS_AS_ARROWS .h2 CASE_SENSITIVE_ALWAYS_ON .h1 Auxiliary Facilities .h2 DEFAULT_BOOKMARK_FILE .h2 MULTI_BOOKMARK_SUPPORT .h2 BLOCK_MULTI_BOOKMARKS .h1 Interaction .h2 DEFAULT_USER_MODE .h1 External Programs .h2 DEFAULT_EDITOR .h2 SYSTEM_EDITOR .h3 POSITIONABLE_EDITOR .h1 Proxy .h2 HTTP_PROXY .h2 HTTPS_PROXY .h2 FTP_PROXY .h2 GOPHER_PROXY .h2 NEWSPOST_PROXY .h2 NEWSREPLY_PROXY .h2 NEWS_PROXY .h2 NNTP_PROXY .h2 SNEWSPOST_PROXY .h2 SNEWSREPLY_PROXY .h2 SNEWS_PROXY .h2 WAIS_PROXY .h2 FINGER_PROXY .h2 CSO_PROXY .ex 15 .h2 NO_PROXY .ex .ex .h1 External Programs .h2 PRINTER .h2 DOWNLOADER .h2 UPLOADER .ex 3 .ex .ex .ex .ex .ex .ex .ex .ex 2 .ex .h1 Interaction .h2 NO_DOT_FILES NO_DOT_FILES:FALSE .h1 Internal Behavior .h2 NO_FROM_HEADER .h2 NO_REFERER_HEADER .h1 Internal Behavior .h2 NO_FILE_REFERER .h2 REFERER_WITH_QUERY .h1 Appearance .h2 VERBOSE_IMAGES .h2 MAKE_LINKS_FOR_ALL_IMAGES .h2 MAKE_PSEUDO_ALTS_FOR_INLINES .h2 SUBSTITUTE_UNDERSCORES .h1 Interaction .h2 QUIT_DEFAULT_YES .h1 HTML Parsing .h2 HISTORICAL_COMMENTS .h2 MINIMAL_COMMENTS MINIMAL_COMMENTS:TRUE .h2 SOFT_DQUOTES .h2 STRIP_DOTDOT_URLS .h1 Appearance .h2 ENABLE_SCROLLBACK .h2 SCAN_FOR_BURIED_NEWS_REFS .h2 PREPEND_BASE_TO_SOURCE .h1 External Programs .h2 GLOBAL_EXTENSION_MAP .h2 PERSONAL_EXTENSION_MAP GLOBAL_EXTENSION_MAP:/etc/mime.types PERSONAL_EXTENSION_MAP:.mime.types .h2 SUFFIX_ORDER .h2 SUFFIX .ex .ex .ex .ex .ex 29 SUFFIX:.tgz:application/octet-stream SUFFIX:.deb:application/octet-stream .h2 XLOADIMAGE_COMMAND .h2 VIEWER .ex 7 .h2 GLOBAL_MAILCAP .h2 PERSONAL_MAILCAP GLOBAL_MAILCAP:/etc/mailcap PERSONAL_MAILCAP:.mailcap .h2 PREFERRED_MEDIA_TYPES .h2 PREFERRED_ENCODING .h1 Keyboard Input .h2 KEYBOARD_LAYOUT .h2 KEYMAP .nf .fi .nf .fi .nf .fi .nf .fi .nf .fi .nf .fi .nf .fi .h1 External Programs .h2 CSWING_PATH .h1 Internal Behavior .h2 AUTO_UNCACHE_DIRLISTS .h1 Appearance .h2 LIST_FORMAT .nf .fi .ex .ex .ex .h1 External Programs .h2 DIRED_MENU .nf .fi .h1 Internal Behavior .h2 NONRESTARTING_SIGWINCH .h2 NO_FORCED_CORE_DUMP .h1 Appearance .h2 COLOR .nf .fi COLOR:6:brightred:black .h2 COLOR_STYLE .h2 NESTED_TABLES .h2 ASSUMED_COLOR .h2 DEFAULT_COLORS .h1 External Programs .h2 EXTERNAL .h1 Internal Behavior .h2 RULE .h2 RULESFILE .ex 5 .h1 Appearance .h2 PRETTYSRC .h2 PRETTYSRC_SPEC .nf .fi .ex .ex .ex .h2 HTMLSRC_ATTRNAME_XFORM .h2 HTMLSRC_TAGNAME_XFORM .h2 PRETTYSRC_VIEW_NO_ANCHOR_NUMBERING .h1 HTML Parsing .h2 FORCE_EMPTY_HREFLESS_A .h2 HIDDEN_LINK_MARKER .h2 XHTML_PARSING .h1 Appearance .h2 JUSTIFY .h2 JUSTIFY_MAX_VOID_PERCENT .h1 Interaction .h2 TEXTFIELDS_NEED_ACTIVATION .h2 LEFTARROW_IN_TEXTFIELD_PROMPT .h1 Timeouts .h2 CONNECT_TIMEOUT .h2 READ_TIMEOUT .h1 Internal Behavior .h2 FTP_PASSIVE .h2 ENABLE_LYNXRC .nf .fi .h1 External Programs .h2 BZIP2_PATH .h2 CHMOD_PATH .h2 COMPRESS_PATH .h2 COPY_PATH .h2 GZIP_PATH .h2 INFLATE_PATH .h2 INSTALL_PATH .h2 MKDIR_PATH .h2 MV_PATH .h2 RLOGIN_PATH .h2 RMDIR_PATH .h2 RM_PATH .h2 SETFONT_PATH .h2 TAR_PATH .h2 TELNET_PATH .h2 TN3270_PATH .h2 TOUCH_PATH .h2 UNCOMPRESS_PATH .h2 UNZIP_PATH .h2 UUDECODE_PATH .h2 ZCAT_PATH .h2 ZIP_PATH .h1 Interaction .h2 FORCE_SSL_PROMPT .h2 FORCE_COOKIE_PROMPT .h2 SSL_CERT_FILE SSL_CERT_FILE:/etc/ssl/certs/ca-certificates.crt .h1 Appearance .h2 SCREEN_SIZE .h2 NO_MARGINS .h2 NO_TITLE .h1 External Programs .h2 SYSLOG_REQUESTED_URLS SYSLOG_REQUESTED_URLS:FALSE .h2 SYSLOG_TEXT .h1 Internal Behavior .h2 BROKEN_FTP_RETR .h2 BROKEN_FTP_EPSV .h1 Appearance .h2 FTP_FORMAT .h1 Internal Behavior .h2 STATUS_BUFFER_SIZE EXTERNAL:ftp:w3m %s:TRUE EXTERNAL:file:w3m %s:TRUE EXTERNAL:http:w3m %s:TRUE EXTERNAL:http:wget %s:TRUE EXTERNAL:http:wget -r %s:TRUE EXTERNAL:ftp:netscape %s:TRUE EXTERNAL:file:netscape %s:TRUE EXTERNAL:http:netscape %s:TRUE INCLUDE:/etc/lynx-cur/local.cfg INCLUDE:~/.lynx/colors:COLOR INCLUDE:~/.lynx/keymap:KEYMAP INCLUDE:~/.lynx/viewers:VIEWER INCLUDE:~/.lynx/external:EXTERNAL /etc/lynx-cur/lynx.lss changed: em: bold: brightblue strong: bold: brightred b: bold: red i: bold: brightblue a: bold: black img: dim: brown fig: normal: gray caption: reverse: brown hr: normal: yellow blockquote: normal: brightblue ul: normal: brown address: normal: magenta title: normal: brightmagenta tt: dim: brightmagenta: black h1: bold: yellow: blue label: normal: magenta q: normal: yellow: magenta small: dim: default big: bold: yellow sup: bold: yellow sub: dim: gray li: normal: brightmagenta code: normal: cyan cite: normal: cyan table: normal: brightcyan tr: bold: brown td: normal: default br: normal: default alink: reverse: yellow: black status: reverse: yellow: blue alert: bold: yellow: red whereis: reverse+underline: magenta: cyan menu.bg: normal: black: lightgray menu.frame: normal: black: lightgray menu.entry: normal: lightgray: black menu.n: normal: red: gray menu.active: normal: yellow: black menu.sb: normal: brightred: lightgray forwbackw.arrow:reverse hot.paste: normal: brightred: gray ul.red: underline: brightred ul.blue: bold: brightblue li.red: reverse: red: yellow li.blue: bold: blue strong.a: bold: black: red em.a: reverse: black: blue strong.b: bold: white: red em.b: reverse: white: blue strong.debug: reverse: green font.letter: normal: white: blue input.submit: normal: cyan tr.baone: bold: yellow tr.batwo: bold: green tr.bathree: bold: red link: normal: white link.green: bold: brightgreen link.red: bold: black: red link.blue: bold: white: blue link.toc: bold: black: white link.red.next: bold: red link.red.prev: bold: yellow: red link.blue.prev: bold: yellow: blue link.blue.next: bold: blue link.green.toc: bold: white: green span.htmlsrc_comment:normal: white span.htmlsrc_tag:normal: white span.htmlsrc_attrib:normal: cyan span.htmlsrc_attrval:normal: magenta span.htmlsrc_abracket:normal: white span.htmlsrc_entity:normal: white span.htmlsrc_badseq:normal: red span.htmlsrc_badtag:normal: red span.htmlsrc_badattr:normal: red span.htmlsrc_sgmlspecial:normal: yellow -- debconf information excluded -- To UNSUBSCRIBE, email to debian-bugs-dist-requ...@lists.debian.org with a subject of "unsubscribe". Trouble? Contact listmas...@lists.debian.org