Change 20661 by [EMAIL PROTECTED] on 2003/08/12 19:41:50
perlreref tweaks from SADAHIRO Tomoyuki and Ian Truskett.
Affected files ...
... //depot/perl/pod/perlreref.pod#10 edit
Differences ...
==== //depot/perl/pod/perlreref.pod#10 (text) ====
Index: perl/pod/perlreref.pod
--- perl/pod/perlreref.pod#9~20629~ Mon Aug 11 23:24:32 2003
+++ perl/pod/perlreref.pod Tue Aug 12 12:41:50 2003
@@ -104,15 +104,19 @@
[f-j-] Dash escaped or at start or end means 'dash'
[^f-j] Caret indicates "match any character _except_ these"
-The following work within or without a character class:
+The following sequences work within or without a character class.
+The first six are locale aware, all are Unicode aware. The default
+character class equivalent are given. See L<perllocale> and
+L<perlunicode> for details.
- \d A digit, same as [0-9]
- \D A nondigit, same as [^0-9]
- \w A word character (alphanumeric), same as [a-zA-Z0-9_]
- \W A non-word character, [^a-zA-Z0-9_]
- \s A whitespace character, same as [ \t\n\r\f]
- \S A non-whitespace character, [^ \t\n\r\f]
- \C Match a byte (with Unicode, '.' matches char)
+ \d A digit [0-9]
+ \D A nondigit [^0-9]
+ \w A word character [a-zA-Z0-9_]
+ \W A non-word character [^a-zA-Z0-9_]
+ \s A whitespace character [ \t\n\r\f]
+ \S A non-whitespace character [^ \t\n\r\f]
+
+ \C Match a byte (with Unicode, '.' matches a character)
\pP Match P-named (Unicode) property
\p{...} Match Unicode property with long name
\PP Match non-P
@@ -121,21 +125,21 @@
POSIX character classes and their Unicode and Perl equivalents:
- alnum IsAlnum Alphanumeric
- alpha IsAlpha Alphabetic
- ascii IsASCII Any ASCII char
- blank IsSpace [ \t] Horizontal whitespace (GNU)
- cntrl IsCntrl Control characters
- digit IsDigit \d Digits
- graph IsGraph Alphanumeric and punctuation
- lower IsLower Lowercase chars (locale aware)
- print IsPrint Alphanumeric, punct, and space
- punct IsPunct Punctuation
- space IsSpace [\s\ck] Whitespace
- IsSpacePerl \s Perl's whitespace definition
- upper IsUpper Uppercase chars (locale aware)
- word IsWord \w Alphanumeric plus _ (Perl)
- xdigit IsXDigit [\dA-Fa-f] Hexadecimal digit
+ alnum IsAlnum Alphanumeric
+ alpha IsAlpha Alphabetic
+ ascii IsASCII Any ASCII char
+ blank IsSpace [ \t] Horizontal whitespace (GNU extension)
+ cntrl IsCntrl Control characters
+ digit IsDigit \d Digits
+ graph IsGraph Alphanumeric and punctuation
+ lower IsLower Lowercase chars (locale and Unicode aware)
+ print IsPrint Alphanumeric, punct, and space
+ punct IsPunct Punctuation
+ space IsSpace [\s\ck] Whitespace
+ IsSpacePerl \s Perl's whitespace definition
+ upper IsUpper Uppercase chars (locale and Unicode aware)
+ word IsWord \w Alphanumeric plus _ (Perl extension)
+ xdigit IsXDigit [0-9A-Fa-f] Hexadecimal digit
Within a character class:
End of Patch.