reopen 291735 !
thanks

also sprach Samuel Thibault <[EMAIL PROTECTED]> [2006.08.21.1455 +0200]:
> These got fixed with upload of 0.5.1-5

I don't see how #291735 got fixed:

piper:~> cat test.latin1.html
<b>äöü</b>
piper:~> LC_ALL=C w3m -dump -T 'text/html;charset=iso8859-1' test.latin1.html
<b>???</b>
piper:~> LC_ALL=C w3m -dump -T 'text/html;charset=iso8859-1' test.latin1.html | 
file -
/dev/stdin: ASCII text
piper:~> LC_ALL=de_CH w3m -dump -T 'text/html;charset=iso8859-1' 
test.latin1.html | file -
/dev/stdin: ISO-8859 text

So the output format is determined by the locale, and this makes
somewhat sense: in a C environment, latin1 characters possibly
cannot be printed, so w3m converts them to ?.

However, in an UTF-8 environment, they can definitely be printed,
but:

piper:~> LC_ALL=de_CH.UTF-8 w3m -dump -T 'text/html;charset=iso8859-1' 
test.latin1.html
<b>???</b>
piper:~> LC_ALL=de_CH.UTF-8 w3m -dump -T 'text/html;charset=iso8859-1' 
test.latin1.html | file -
/dev/stdin: ASCII text

I suggest to just use the input charset as output charset. People
can use iconv if they need to translate.

-- 
 .''`.   martin f. krafft <[EMAIL PROTECTED]>
: :'  :  proud Debian developer, author, administrator, and user
`. `'`   http://people.debian.org/~madduck - http://debiansystem.info
  `-  Debian - when you have better things to do than fixing systems

Attachment: signature.asc
Description: Digital signature (GPG/PGP)

Reply via email to