Package: lynx
Version: 2.8.9dev1-2
Severity: normal

When I do "lynx -dump http://raphaelhertzog.com/ >/tmp/dump" with a UTF-8
locale I get a file that is not valid UTF-8:

$ isutf8 /tmp/dump 
/tmp/dump: line 7, char 1, byte offset 23: invalid UTF-8 code

$ head -n 7 /tmp/dump | tail -n 1
   Search this website� Search

This comes from an input field with « value="Search this website…" »

As far as I know "…" is valid and should be converted to the UTF-8
character "…".

Note that the same operation with the C locale will convert that character
to a single dot. So the problem is really only when you use an UTF-8
locale.

Cheers,

-- System Information:
Debian Release: jessie/sid
  APT prefers squeeze-lts
  APT policy: (500, 'squeeze-lts'), (500, 'unstable'), (500, 'testing'), (500, 
'stable'), (500, 'oldstable'), (1, 'experimental')
Architecture: amd64 (x86_64)
Foreign Architectures: i386

Kernel: Linux 3.16-3-amd64 (SMP w/4 CPU cores)
Locale: LANG=fr_FR.utf8, LC_CTYPE=fr_FR.utf8 (charmap=UTF-8)
Shell: /bin/sh linked to /bin/dash

Versions of packages lynx depends on:
ii  lynx-cur  2.8.9dev1-2+b1

lynx recommends no packages.

lynx suggests no packages.

-- no debconf information


-- 
To UNSUBSCRIBE, email to [email protected]
with a subject of "unsubscribe". Trouble? Contact [email protected]

Reply via email to