On Thu, Feb 02, 2012 at 10:26:58PM +0100, Branko Čibej wrote:
> On 02.02.2012 21:46, Daniel Shahaf wrote:
> > Stefan Sperling wrote on Thu, Feb 02, 2012 at 21:37:53 +0100:
> >> But what do authors gain? Apart from assurance that they may write
> >> UTF-8 characters in error messages I don't see any advantage.
> > If repos_path, or an fspath argument, isn't ASCII, can hook scripts
> > access/use it even if the envvar isn't set?  (Or svnlook, when called
> > from such a hook script --- which I think you mentioned earlier.)
> 
> Yes. File names are just a string of bytes. As long as the script takes
> care to always quote the arguments ("$foo" instead of just $foo), and
> not interpret them, then what it passes to other tools will be what it
> gets from the server. That's either exactly correct, or totally wrong
> already.

Exactly.

The only problem we have is that 'svn' might try to internally convert
the filename from the "native" encoding. Because of the empty environment
in a hook script it will assume the native encoding is ASCII (LC_CTYPE
isn't set).

So the input conversion will fail if a UTF-8 path containing non-ASCII
characters is passed to 'svn' by the script. There's nothing the script
can do about that without knowing what the locale should be.

So we'll need to provide a way to set LC_CTYPE in the environment
of hooks.

Reply via email to