Bug#354284: lynx vs. uxterm

2006-02-28 Thread Thomas Dickey
On Tue, Feb 28, 2006 at 07:40:06AM +0100, Dan Jacobson wrote:
 T uxterm should be setting the locale (if your shell unsets them, that
 T would produce the sort of effect you are describing, but is not a bug in 
 lynx).
 $ HOME=/ uxterm #instead of su -, but same effect. OK, will await 
 dist-upgrade.
 Tftp://invisible-island.net/temp/db354284-w3m.png
 Tftp://invisible-island.net/temp/db354284-lynx.png

sorry (my typo):

Feb 26 14:18  image/pngdb354384-lynx.png  9Kb
Feb 26 14:18  image/pngdb354384-w3m.png  8Kb

 == PASV ... done.== RETR db354284-w3m.png ...
 No such file `db354284-w3m.png'. Same with the other. Anyways all you
 need to do is confirm that it looks the same as -dump. No photos
 needed. As I'm sure it does, further testing will have to await my
 next http://jidanni.org/comp/debian/apt-offline/index_en.html run. OK thanks.

-- 
Thomas E. Dickey
http://invisible-island.net
ftp://invisible-island.net


pgp15IZyIjsMl.pgp
Description: PGP signature


Bug#354284: lynx vs. uxterm

2006-02-27 Thread Dan Jacobson
T uxterm should be setting the locale (if your shell unsets them, that
T would produce the sort of effect you are describing, but is not a bug in 
lynx).
$ HOME=/ uxterm #instead of su -, but same effect. OK, will await dist-upgrade.
T  ftp://invisible-island.net/temp/db354284-w3m.png
T  ftp://invisible-island.net/temp/db354284-lynx.png
== PASV ... done.== RETR db354284-w3m.png ...
No such file `db354284-w3m.png'. Same with the other. Anyways all you
need to do is confirm that it looks the same as -dump. No photos
needed. As I'm sure it does, further testing will have to await my
next http://jidanni.org/comp/debian/apt-offline/index_en.html run. OK thanks.


-- 
To UNSUBSCRIBE, email to [EMAIL PROTECTED]
with a subject of unsubscribe. Trouble? Contact [EMAIL PROTECTED]



Bug#354284: lynx vs. uxterm

2006-02-26 Thread Thomas Dickey
On Sun, Feb 26, 2006 at 05:50:09AM +0100, Dan Jacobson wrote:
 Starting over.
 $ uxterm
 $ su -
 # su - nobody
 $ env
 TERM=xterm
 HOME=/ #... etc. No LC_ or LANG stuff.

uxterm should be setting the locale (if your shell unsets them, that
would produce the sort of effect you are describing, but is not a bug
in lynx).

 $ export http_proxy=http://localhost:8080/; #needed for me, WWWOFFLE
 $ wget -O - http://zh.wikipedia.org/wiki/%E8%9A%8A%E5%AD%90 #Chinese looks 
 great!
 $ lynx http://zh.wikipedia.org/wiki/%E8%9A%8A%E5%AD%90
 $ export LYNX_CFG=/etc/lynx-cur/lynx-utf.cfg
 $ lynx http://zh.wikipedia.org/wiki/%E8%9A%8A%E5%AD%90
 Nope. No matter what I do, only 1% of the Chinese characters come thru
 OK, all the rest turn into a tilde mess. wget, w3m work fine.

I don't read Chinese, so it would be understandable if I overlook relatively
minor things, but I don't see any tildes.  Comparing with w3m, the glyphs
look the same.  I made screenshots to show the two, put them in

ftp://invisible-island.net/temp/db354284-w3m.png
ftp://invisible-island.net/temp/db354284-lynx.png

-- 
Thomas E. Dickey
http://invisible-island.net
ftp://invisible-island.net


pgp81pMzG2Qzm.pgp
Description: PGP signature


Bug#354284: lynx vs. uxterm

2006-02-25 Thread Thomas Dickey
On Sat, Feb 25, 2006 at 12:20:14AM +0100, Dan Jacobson wrote:
 Package: lynx-cur
 Version: 2.8.6-17
 Severity: normal
 
 There is something wrong in what lynx sends e.g., uxterm.
 $ lynx -dump http://seba.ulyssis.org/thesis/howto-pinyin.php is fine.
 Nothing to do with raw utf-8 vs. #stuff.

if the dump is fine, then it sounds as if you're talking about the interactive
display - I don't see any tildes in that (after setting the display
character set to UTF-8, of course).

 And why does = show
   Charset: utf-8
 when o shows
   Assumed document character set(!): [iso-8859-1__]

lynx.cfg says

# ASSUME_CHARSET changes the handling of documents which do not 
# explicitly specify a charset.  Normally Lynx assumes that 8-bit 
# characters in those documents are encoded according to iso-8859-1 
# (the official default for the HTTP protocol).  When ASSUME_CHARSET 
# is defined here or by an -assume_charset command line flag is in effect, 
# Lynx will treat documents as if they were encoded accordingly. 
# See above on how this interacts with raw mode and the Display 
# Character Set. 
# ASSUME_CHARSET can also be changed via the 'o'ptions menu but will 
# not be saved as permanent value in user's .lynxrc file to avoid more chaos. 
# 
#ASSUME_CHARSET:iso-8859-1 

 
 -assume_local_charset=UTF-8 no help, -assume_charset=UTF-8 same.
 file:///usr/share/doc/HOWTO/en-html/Unicode-HOWTO-4.html no help.
 
 w3m works fine.
 
 
 -- 
 To UNSUBSCRIBE, email to [EMAIL PROTECTED]
 with a subject of unsubscribe. Trouble? Contact [EMAIL PROTECTED]
-- 
Thomas E. Dickey
http://invisible-island.net
ftp://invisible-island.net


pgpCcOGH52CUO.pgp
Description: PGP signature


Bug#354284: lynx vs. uxterm

2006-02-25 Thread Dan Jacobson
Starting over.
$ uxterm
$ su -
# su - nobody
$ env
TERM=xterm
HOME=/ #... etc. No LC_ or LANG stuff.
$ export http_proxy=http://localhost:8080/; #needed for me, WWWOFFLE
$ wget -O - http://zh.wikipedia.org/wiki/%E8%9A%8A%E5%AD%90 #Chinese looks 
great!
$ lynx http://zh.wikipedia.org/wiki/%E8%9A%8A%E5%AD%90
$ export LYNX_CFG=/etc/lynx-cur/lynx-utf.cfg
$ lynx http://zh.wikipedia.org/wiki/%E8%9A%8A%E5%AD%90
Nope. No matter what I do, only 1% of the Chinese characters come thru
OK, all the rest turn into a tilde mess. wget, w3m work fine.
pstree -Al shows
`-xdm---sh-+-icewm
   |-2*[ssh-agent]
   `-xterm---luit---bash-+-pstree
 `-xterm---luit---bash---su---bash---su---sh
$ xterm -v
XTerm(202)
Well, OK I'll try again with 209 or whatever next time I go to town to
do my dist-upgrade downloads.


-- 
To UNSUBSCRIBE, email to [EMAIL PROTECTED]
with a subject of unsubscribe. Trouble? Contact [EMAIL PROTECTED]



Bug#354284: lynx vs. uxterm

2006-02-24 Thread Dan Jacobson
Package: lynx-cur
Version: 2.8.6-17
Severity: normal

There is something wrong in what lynx sends e.g., uxterm.
$ lynx -dump http://seba.ulyssis.org/thesis/howto-pinyin.php is fine.
Nothing to do with raw utf-8 vs. #stuff.

My investigation concludes that lynx is sending e.g., M-G ~ P when it should be
sending M-G M-^P, where M- means high bit set. Instead of setting the
high bit of ^P and sending that, it sends two characters, ~ and P.

Looking at their bits,
^P   10 0001
P50 0101
~7e 0110
M-^P 90 1001

we see lynx is sending
11000111 0110 010abcde
instead of
11000111  100abcde
where abcde varies with the particular char being sent.
What a screw up?! Looks like some mis-estimate in your to-utf-8
converter for screen presentation.

(pstree shows a xterm--luit connection.)

None of
character_set=UNICODE (UTF-8)
character_set=Transparent
character_set=Chinese
helped.

 Display and Character Set
  Use locale-based character set(!): [ON_]
  Display character set: [UNICODE (UTF-8)]
  Assumed document character set(!): [utf-8___]
  Raw 8-bit (!): [ON_]
also didn't help

And why does = show
  Charset: utf-8
when o shows
  Assumed document character set(!): [iso-8859-1__]

-assume_local_charset=UTF-8 no help, -assume_charset=UTF-8 same.
file:///usr/share/doc/HOWTO/en-html/Unicode-HOWTO-4.html no help.

w3m works fine.


-- 
To UNSUBSCRIBE, email to [EMAIL PROTECTED]
with a subject of unsubscribe. Trouble? Contact [EMAIL PROTECTED]