Re: \W and [\W]

2004-01-02 Thread Jarkko Hietaniemi
Do negated classes work at all ? What does /[^\w]/ do ? (I looked at this stuff ages ago and I thought unicode classes (including negated ones worked, if that is true then fix may just be the magical \W expander expanding to wrong thing...) I think it's the evil characters in the 0x80..0xFF rang

Re: \W and [\W]

2004-01-02 Thread Nick Ing-Simmons
Eric Cholet <[EMAIL PROTECTED]> writes: >Le 1 janv. 04, Ã 17:50, Rafael Garcia-Suarez a Ãcrit : > >> +(However, and as a limitation of the current implementation, using >> +C<\w> or C<\W> I a C<[...]> character class will still match >> +with byte semantics.) > >I don't think it applies to \w, only

Re: \W and [\W]

2004-01-01 Thread Eric Cholet
Le 1 janv. 04, à 17:50, Rafael Garcia-Suarez a écrit : +(However, and as a limitation of the current implementation, using +C<\w> or C<\W> I a C<[...]> character class will still match +with byte semantics.) I don't think it applies to \w, only \W. \x{df} matches [\w] just fine, as shown in Andrea

Re: \W and [\W]

2004-01-01 Thread Rafael Garcia-Suarez
Andreas J Koenig wrote in perl.unicode : >> On Wed, 31 Dec 2003 16:21:36 +0100, Eric Cholet <[EMAIL PROTECTED]> said: > > > Can anyone enlighten me as to why \W behaves differently depending > > on wether it's inside or outside of a character class, for certain > > characters: > > I have r

Re: \W and [\W]

2004-01-01 Thread Andreas J Koenig
> On Wed, 31 Dec 2003 16:21:36 +0100, Eric Cholet <[EMAIL PROTECTED]> said: > Can anyone enlighten me as to why \W behaves differently depending > on wether it's inside or outside of a character class, for certain > characters: I have reported this as bug 18281 http://guest:[EMAIL PROT

Re: \W and [\W]

2003-12-31 Thread Eric Cholet
Le 31 dÃc. 03, Ã 16:28, [EMAIL PROTECTED] a Ãcrit : Why are you using: use encoding 'utf8'; ? So that, for the sake of keeping the snippet short, Perl would know that my character constant was in utf-8, and that the "print" statements would output utf-8 as well. I typed the source code in an utf

\W and [\W]

2003-12-31 Thread Eric Cholet
Hi list, Can anyone enlighten me as to why \W behaves differently depending on wether it's inside or outside of a character class, for certain characters: This sample program: use encoding 'utf8'; $x = 'GroÃbritannien'; $\ = "\n"; print '1 ', $x =~ /(\W+)/; print '2 ', $x =~ /([\W]+)/; print '3 '