In perl.git, the branch smoke-me/khw-regex has been created

<http://perl5.git.perl.org/perl.git/commitdiff/694147b6bdbd790bcd468b0ae2ace94b0e672ad1?hp=0000000000000000000000000000000000000000>

        at  694147b6bdbd790bcd468b0ae2ace94b0e672ad1 (commit)

- Log -----------------------------------------------------------------
commit 694147b6bdbd790bcd468b0ae2ace94b0e672ad1
Author: Karl Williamson <[email protected]>
Date:   Sat Oct 13 10:00:18 2012 -0600

    regexp.t: Add 'no warnings "utf8";
    
    This .t works fine unless there are failures that it tries to output,
    and the handle hasn't been opened using utf8.  Because we aren't sure if
    that operation works, just turn off warnings.

M       t/re/regexp.t

commit aa3658065b5f9436ed7e67ae7b363231b64b80bb
Author: Karl Williamson <[email protected]>
Date:   Sat Oct 13 09:52:42 2012 -0600

    utf8.h: Correct some values for EBCDIC
    
    It occurred to me that EBCDIC has different maximums for the number of
    bytes a character can occupy.  This moves the definition in utf8.h to
    within an #ifndef EBCDIC, and adds the correct values to utfebcdic.h

M       utf8.h
M       utfebcdic.h

commit 38f7ecb4e30c646b34fb09863c93359ede4584db
Author: Karl Williamson <[email protected]>
Date:   Sat Oct 13 09:20:11 2012 -0600

    regex: White-space, comment only; no code changes
    
    This outdents code that just had its containing block removed, and
    reflows its comments to fill 79 columns; and does some other white space
    adjustments, plus a typo in a comment.

M       regcomp.c
M       regexec.c
M       sv.c

commit 768546135f96488d49361042b132b569c6a4194a
Author: Karl Williamson <[email protected]>
Date:   Sat Oct 13 09:15:37 2012 -0600

    regex: Rename macro to reflect its narrowed use
    
    This macro is now only used under locale; its other use has now been
    removed.  Change the name to reflect its only use.

M       regcomp.c
M       regcomp.h
M       regexec.c

commit 1d3c425d49fc9227091dcf3a4df0ac86891f7d0b
Author: Karl Williamson <[email protected]>
Date:   Sat Oct 13 09:07:05 2012 -0600

    regex: Splice out no longer used array element
    
    A recent commit removed all uses of an array element in the middle of an
    array.  This moves up the elements that followed it.

M       regcomp.c
M       regexec.c

commit f5e70bfbfc9b33f086ad87cd4d26e94bc5d98c25
Author: Karl Williamson <[email protected]>
Date:   Sat Oct 13 08:49:26 2012 -0600

    regex: Remove old code that tried to handle multi-char folds
    
    A recent commit has changed the algorithm used to handle multi-character
    folding in bracketed character classes.  The old code is no longer
    needed.

M       embed.fnc
M       embed.h
M       proto.h
M       regcomp.c
M       regcomp.sym
M       regexec.c
M       regnodes.h

commit 4821ef3702f3d2e0357ce893dde821028cc3422c
Author: Karl Williamson <[email protected]>
Date:   Fri Oct 12 11:42:38 2012 -0600

    regcomp.c: Fix-up indentaion; no code changes
    
    Indent a newly-formed block

M       regcomp.c

commit e24b978fe25c4868610ea643a89105697bfcf4ca
Author: Karl Williamson <[email protected]>
Date:   Thu Oct 11 21:49:31 2012 -0600

    PATCH: [perl #89774] multi-char fold + its fold in char class
    
    The design for handling characters that fold to multiple characters when
    the former are encountered in a bracketed character class is defective.
    The ticket reads, "If a bracketed character class includes a character
    that has a multi-char fold, and it also includes the first character of
    that fold, the multi-char fold will never be matched; just the first
    character of the fold.".   Thus, in the class /[\0-\xff]/i, \xDF will
    never be matched, because its fold is 'ss', the first character of
    which, 's', is also in the class.
    
    The reason the design is defective is that it doesn't allow for
    backtracking and trying the other options.
    
    This commit solves this by effectively rewriting the above to be
    / (?: \xdf | [\0-\xde\xe0-\xff] ) /xi.  And so the backtracking gets
    handled automatcially by the regex engine.

M       embedvar.h
M       intrpvar.h
M       pod/perldelta.pod
M       pod/perlre.pod
M       pod/perlrecharclass.pod
M       regcomp.c
M       sv.c
M       t/re/re_tests

commit 7f62429d27ea2645a9d3f340a322da39bf200309
Author: Karl Williamson <[email protected]>
Date:   Fri Oct 12 11:24:34 2012 -0600

    regen/mk_invlists.pl: Make list for multi-fold chars
    
    This causes charclass_invlists.h to have a new list of all the
    characters whose fold is a sequence of more than one character.

M       charclass_invlists.h
M       regen/mk_invlists.pl

commit b6546165754863fd8eb3bd2363c69047fd24e059
Author: Karl Williamson <[email protected]>
Date:   Fri Oct 12 09:10:10 2012 -0600

    mktables: Add table for chars with multi-char fold
    
    This will be used in a later commit

M       lib/unicore/mktables

commit ebefcf635ddccaae8224cd44688daefae51165a0
Author: Karl Williamson <[email protected]>
Date:   Sat Oct 13 08:31:29 2012 -0600

    regcomp.c: Rename a macro, fix-up comments
    
    This very recently introduced macro's name could be clearer, and it can
    be used in another place, and the comment concerning that is slightly
    inaccurate.

M       regcomp.c
-----------------------------------------------------------------------

--
Perl5 Master Repository

Reply via email to