In perl.git, the branch smoke-me/khw-regex has been created
<http://perl5.git.perl.org/perl.git/commitdiff/694147b6bdbd790bcd468b0ae2ace94b0e672ad1?hp=0000000000000000000000000000000000000000>
at 694147b6bdbd790bcd468b0ae2ace94b0e672ad1 (commit)
- Log -----------------------------------------------------------------
commit 694147b6bdbd790bcd468b0ae2ace94b0e672ad1
Author: Karl Williamson <[email protected]>
Date: Sat Oct 13 10:00:18 2012 -0600
regexp.t: Add 'no warnings "utf8";
This .t works fine unless there are failures that it tries to output,
and the handle hasn't been opened using utf8. Because we aren't sure if
that operation works, just turn off warnings.
M t/re/regexp.t
commit aa3658065b5f9436ed7e67ae7b363231b64b80bb
Author: Karl Williamson <[email protected]>
Date: Sat Oct 13 09:52:42 2012 -0600
utf8.h: Correct some values for EBCDIC
It occurred to me that EBCDIC has different maximums for the number of
bytes a character can occupy. This moves the definition in utf8.h to
within an #ifndef EBCDIC, and adds the correct values to utfebcdic.h
M utf8.h
M utfebcdic.h
commit 38f7ecb4e30c646b34fb09863c93359ede4584db
Author: Karl Williamson <[email protected]>
Date: Sat Oct 13 09:20:11 2012 -0600
regex: White-space, comment only; no code changes
This outdents code that just had its containing block removed, and
reflows its comments to fill 79 columns; and does some other white space
adjustments, plus a typo in a comment.
M regcomp.c
M regexec.c
M sv.c
commit 768546135f96488d49361042b132b569c6a4194a
Author: Karl Williamson <[email protected]>
Date: Sat Oct 13 09:15:37 2012 -0600
regex: Rename macro to reflect its narrowed use
This macro is now only used under locale; its other use has now been
removed. Change the name to reflect its only use.
M regcomp.c
M regcomp.h
M regexec.c
commit 1d3c425d49fc9227091dcf3a4df0ac86891f7d0b
Author: Karl Williamson <[email protected]>
Date: Sat Oct 13 09:07:05 2012 -0600
regex: Splice out no longer used array element
A recent commit removed all uses of an array element in the middle of an
array. This moves up the elements that followed it.
M regcomp.c
M regexec.c
commit f5e70bfbfc9b33f086ad87cd4d26e94bc5d98c25
Author: Karl Williamson <[email protected]>
Date: Sat Oct 13 08:49:26 2012 -0600
regex: Remove old code that tried to handle multi-char folds
A recent commit has changed the algorithm used to handle multi-character
folding in bracketed character classes. The old code is no longer
needed.
M embed.fnc
M embed.h
M proto.h
M regcomp.c
M regcomp.sym
M regexec.c
M regnodes.h
commit 4821ef3702f3d2e0357ce893dde821028cc3422c
Author: Karl Williamson <[email protected]>
Date: Fri Oct 12 11:42:38 2012 -0600
regcomp.c: Fix-up indentaion; no code changes
Indent a newly-formed block
M regcomp.c
commit e24b978fe25c4868610ea643a89105697bfcf4ca
Author: Karl Williamson <[email protected]>
Date: Thu Oct 11 21:49:31 2012 -0600
PATCH: [perl #89774] multi-char fold + its fold in char class
The design for handling characters that fold to multiple characters when
the former are encountered in a bracketed character class is defective.
The ticket reads, "If a bracketed character class includes a character
that has a multi-char fold, and it also includes the first character of
that fold, the multi-char fold will never be matched; just the first
character of the fold.". Thus, in the class /[\0-\xff]/i, \xDF will
never be matched, because its fold is 'ss', the first character of
which, 's', is also in the class.
The reason the design is defective is that it doesn't allow for
backtracking and trying the other options.
This commit solves this by effectively rewriting the above to be
/ (?: \xdf | [\0-\xde\xe0-\xff] ) /xi. And so the backtracking gets
handled automatcially by the regex engine.
M embedvar.h
M intrpvar.h
M pod/perldelta.pod
M pod/perlre.pod
M pod/perlrecharclass.pod
M regcomp.c
M sv.c
M t/re/re_tests
commit 7f62429d27ea2645a9d3f340a322da39bf200309
Author: Karl Williamson <[email protected]>
Date: Fri Oct 12 11:24:34 2012 -0600
regen/mk_invlists.pl: Make list for multi-fold chars
This causes charclass_invlists.h to have a new list of all the
characters whose fold is a sequence of more than one character.
M charclass_invlists.h
M regen/mk_invlists.pl
commit b6546165754863fd8eb3bd2363c69047fd24e059
Author: Karl Williamson <[email protected]>
Date: Fri Oct 12 09:10:10 2012 -0600
mktables: Add table for chars with multi-char fold
This will be used in a later commit
M lib/unicore/mktables
commit ebefcf635ddccaae8224cd44688daefae51165a0
Author: Karl Williamson <[email protected]>
Date: Sat Oct 13 08:31:29 2012 -0600
regcomp.c: Rename a macro, fix-up comments
This very recently introduced macro's name could be clearer, and it can
be used in another place, and the comment concerning that is slightly
inaccurate.
M regcomp.c
-----------------------------------------------------------------------
--
Perl5 Master Repository