David Coon wrote:
> Hello everyone,
> I thought I'd introduce myself to you all, as I intend to start helping
> out with wget.  This will be my first time contributing to any kind of
> free or open source software, so I may have some basic questions down
> the line about best practices and such, though I'll try to keep that to
> a minimum.
> Anyway, I've been researching unicode and utf-8 recently, so I'm gonna
> try to tackle bug #21793 <https://savannah.gnu.org/bugs/?21793>. 

Hi David, and welcome!

If you haven't already, please see

I'd encourage you to get a Savannah account, so I can assign that bug to
you. Also, I tend to hang out quite a bit on IRC (#wget @
irc.freenode.net), so you might want to sign on there.

Since you mentioned an interest in Unicode and UTF-8, you might want to
check out Saint Xavier's recent work on IRI and iDNS support in Wget,
which is available at http://hg.addictivecode.org/wget/sxav/.

Among other things, sxav's additions make Wget more aware of the user's
locale, so it might be useful for providing a feature to automatically
transcode filenames to the user's locale, rather than just supporting
UTF-8 only (which should still probably remain an explicit option). If
that sounds like the direction you'd like to take it, you should
probably base your work on sxav's repository, rather than mainline.

