On Fri, Apr 15, 2005 at 01:01:58PM -0000, Rafael Garcia-Suarez wrote: > Aaron Sherman wrote in perl.perl6.language : > > > > A silly question: is there a canonical character set from which we > > extract these ranges? Are we hard-coding Unicode here, or is there some > > way for the user to specify the character set for ranges? > > Perl 5 forces [a-z] (or [i-j] for that matter) to be a range of > lowercase alphabetic characters, even on EBCDIC platforms (where it's > not).
At the moment, PGE (the part that implements the rule engine) is deferring such questions to Parrot, and otherwise assuming Unicode. Plus, S02 explicitly indicates that Perl is written in Unicode and has consistent Unicode semantics, so I think that's what we should go with. It's certainly the way the compiler will go, at least initially. Pm