On Thursday 12 September 2013 21:34:01 Björn Mattsson wrote: > On 2013-09-12 21:21, Tim Rühsen wrote: > > Am Donnerstag, 12. September 2013, 12:59:00 schrieb Björn Mattsson: > >> Run into a bug in wget last week. > >> Done some digging but can't solve it by my self. > >> > >> If i tries to wget a file containing capital ÅÄÖ they gets coverted > >> wrongly, and åäö works fine. > >> > >> I uses wget -m to backup one of my webb-sites to another machine. Have > >> worked like a cahrm for the last 4-5 years but a couple of week ago one > >> of teh files came down wrong. Thought it was a college that had uploaded > >> something wrong but after some digging it's wget that converts wrongly. > >> > >> I have UTF-8 as charset on my machine. > >> > >> If you want to test/see the problem > >> > >> wget -m http://bmit.se/wget > > > > Just use > > wget --restrict-file-names=nocontrol -m http://bmit.se/wget > > Still the same problem. åäö OK but ÅÄÖ gets wrong.
I switched my environment to UTF-8 now and it seems to work: $ wget --restrict-file-names=nocontrol -m http://bmit.se/wget ... --2013-09-13 09:37:29-- http://bmit.se/wget/test%C3%A5%C3%A4%C3%B6%C3%85%C3%84%C3%96 Reusing existing connection to bmit.se:80. HTTP request sent, awaiting response... 200 OK Length: 0 Saving to: ‘bmit.se/wget/teståäöÅÄÖ’ 2013-09-13 09:37:29 (0.00 B/s) - ‘bmit.se/wget/teståäöÅÄÖ’ saved [0/0] $ ls -la bmit.se/wget/ total 12 drwxr-xr-x 2 oms users 4096 13-09-13 09:37:29 . drwxr-xr-x 3 oms users 4096 13-09-13 09:37:29 .. -rw-r--r-- 1 oms users 120 11-09-13 17:24:38 index.html -rw-r--r-- 1 oms users 0 11-09-13 17:20:53 test -rw-r--r-- 1 oms users 0 11-09-13 17:21:01 teståäöÅÄÖ $ wget --version oms@blitz-lx:~/src/wget/tmp$ wget --version GNU Wget 1.14 built on linux-gnu. +digest +https +ipv6 +iri +large-file +nls -ntlm +opie +ssl/gnutls Please check (and maybe post) wget --version. And check your environment: $ set|egrep 'LANG|LC_' LANG=en_US.UTF-8 LANGUAGE=en_US.UTF-8 LC_ALL=en_US.UTF-8 Regards, Tim
