Follow-up Comment #3, bug #60287 (project wget):

Thank you. I had not tried those options.

Curiously, the only option that I needed was *--local-encoding=utf8*. The
remote-encoding option did not change the detected URI encoding of CP1252.

*Without --local-encoding=utf8*


Loaded example.com/wget-test.html (size 71).
URI encoding = 'CP1252'
example.com/wget-test.html: merge('http://example.com/wget-test.html',
'space-ok%20cyrillic-not%D0%B3.txt') ->
http://example.com/space-ok%20cyrillic-not%D0%B3.txt
converted 'http://example.com/space-ok%20cyrillic-not%D0%B3.txt' (CP1252) ->
'http://example.com/space-ok cyrillic-notг.txt' (UTF-8)
appending 'http://example.com/space-ok%20cyrillic-not%C3%90%C2%B3.txt' to
urlpos.


*With --local-encoding=utf8*


Loaded example.com/wget-test.html (size 71).
URI encoding = 'utf8'
example.com/wget-test.html: merge('http://example.com/wget-test.html',
'space-ok%20cyrillic-not%D0%B3.txt') ->
http://example.com/space-ok%20cyrillic-not%D0%B3.txt
converted 'http://example.com/space-ok%20cyrillic-not%D0%B3.txt' (utf8) ->
'http://example.com/space-ok cyrillic-notг.txt' (UTF-8)
appending 'http://example.com/space-ok%20cyrillic-not%D0%B3.txt' to urlpos.


Regardless, this still feels like a bug to me. But maybe the issue is just how
wget implements the recursive download and isn't really fixable?

    _______________________________________________________

Reply to this item at:

  <https://savannah.gnu.org/bugs/?60287>

_______________________________________________
  Message sent via Savannah
  https://savannah.gnu.org/


Reply via email to