In perl.git, the branch blead has been updated <http://perl5.git.perl.org/perl.git/commitdiff/eb9e3b14fd77e43b36846b86f8780d23fda50258?hp=8b41fafa92716097162f11c7da4c190dfd29fba5>
- Log ----------------------------------------------------------------- commit eb9e3b14fd77e43b36846b86f8780d23fda50258 Author: Karl Williamson <[email protected]> Date: Fri Oct 24 10:30:59 2014 -0600 recharclass: Corrections and nits M pod/perlrecharclass.pod commit 9ee757bebde5a6d88a438f52147fcd06db283b57 Author: Karl Williamson <[email protected]> Date: Wed Oct 22 12:45:59 2014 -0600 perldelta for Unicode erratum fix cb4ea3e667c17d04aceb8aa99c7314653561f238 M pod/perldelta.pod ----------------------------------------------------------------------- Summary of changes: pod/perldelta.pod | 6 ++++++ pod/perlrecharclass.pod | 12 +++++------- 2 files changed, 11 insertions(+), 7 deletions(-) diff --git a/pod/perldelta.pod b/pod/perldelta.pod index e1330cb..261fc21 100644 --- a/pod/perldelta.pod +++ b/pod/perldelta.pod @@ -363,6 +363,12 @@ files in F<ext/> and F<lib/> are best summarized in L</Modules and Pragmata>. fchmod() and futimes() now set C<$!> when they fail due to being passed a closed file handle. [perl #122703] +=item * + +Perl now comes with a corrected Unicode 7.0 for the erratum issued on +October 21, 2014 (see L<http://www.unicode.org/errata/#current_errata>), +dealing with glyph shaping in Arabic. + =back =head1 Known Problems diff --git a/pod/perlrecharclass.pod b/pod/perlrecharclass.pod index 4ab99ac..c79c9a0 100644 --- a/pod/perlrecharclass.pod +++ b/pod/perlrecharclass.pod @@ -480,9 +480,9 @@ and the character must be explicitly specified, and not be part of a multi-character range (not even as one of its endpoints). (L</Character Ranges> will be explained shortly.) Therefore, - 'ss' =~ /\A[\0-\x{ff}]\z/i # Doesn't match - 'ss' =~ /\A[\0-\N{LATIN SMALL LETTER SHARP S}]\z/i # No match - 'ss' =~ /\A[\xDF-\xDF]\z/i # Matches on ASCII platforms, since + 'ss' =~ /\A[\0-\x{ff}]\z/ui # Doesn't match + 'ss' =~ /\A[\0-\N{LATIN SMALL LETTER SHARP S}]\z/ui # No match + 'ss' =~ /\A[\xDF-\xDF]\z/ui # Matches on ASCII platforms, since # \XDF is LATIN SMALL LETTER SHARP S, # and the range is just a single # element @@ -500,7 +500,7 @@ the class, the entire sequence is matched. For example, matches, because C<\N{TAMIL SYLLABLE KAU}> is a named sequence consisting of the two characters matched against. Like the other -instance where a bracketed class can match multi characters, and for +instance where a bracketed class can match multiple characters, and for similar reasons, the class must not be inverted, and the named sequence may not appear in a range, even one where it is both endpoints. If these happen, it is a fatal error if the character class is within an @@ -543,9 +543,7 @@ C<\t>, and C<\x> are also special and have the same meanings as they do outside a -bracketed character class. (However, inside a bracketed character -class, if C<\N{I<NAME>}> expands to a sequence of characters, only the first -one in the sequence is used, with a warning.) +bracketed character class. Also, a backslash followed by two or three octal digits is considered an octal number. -- Perl5 Master Repository
