Ludovic Courtès <[email protected]> writes:
> Using the “lower” regexp class instead of “[a-z]” works:
>
> --8<---------------cut here---------------start------------->8---
> scheme@(guile-user)> (string-match "[[:lower:]]" "w")
> $12 = #("w" (0 . 1))
> --8<---------------cut here---------------end--------------->8---
>
> However, it’s not clear to me whether the “lower” class is supposed to
> be the same for all locales or if we’re just lucky:
>
> http://pubs.opengroup.org/onlinepubs/9699919799/basedefs/V1_chap09.html
>
> Thoughts?
The lower class is much larger than [a-z]. If we only wanted to work
around this particular problem we could explicitly spell out the range,
which would be the same in all locales. (Obviously, that wouldn’t be
pretty.)
But can’t URI parts contain more than those characters? To circumvent
the question whether the lower class is locale dependent we could
generate an explicit range from a charset.
--
Ricardo