In perl.git, the branch smoke-me/khw-5.21 has been created

<http://perl5.git.perl.org/perl.git/commitdiff/86e3d236b144399e46761aa7dcc6d393ba9f9e52?hp=0000000000000000000000000000000000000000>

        at  86e3d236b144399e46761aa7dcc6d393ba9f9e52 (commit)

- Log -----------------------------------------------------------------
commit 86e3d236b144399e46761aa7dcc6d393ba9f9e52
Author: Karl Williamson <[email protected]>
Date:   Fri Sep 5 22:12:21 2014 -0600

    smoke

M       regcomp.c

commit 9558b81b831ebb43ee148acf12046149d79b75c9
Author: Karl Williamson <[email protected]>
Date:   Fri Sep 5 10:15:09 2014 -0600

    regcomp.c: Don't doubly do 'use encoding'
    
    When reparsing, values have already been converted (if necessary) to
    native, so don't do it again.

M       regcomp.c

commit 7f6dd6d53378de4b1366949814035f30ad7f378a
Author: Karl Williamson <[email protected]>
Date:   Fri Sep 5 09:45:27 2014 -0600

    regcomp.c: Remove extraneous tests
    
    These two messages used to be warnings, but are now errors, so there is
    no need to test which pass they are being output in.

M       regcomp.c

commit 198e09aa454dc8aa38bd11e7d3a0243e13a636cd
Author: Karl Williamson <[email protected]>
Date:   Fri Sep 5 09:34:26 2014 -0600

    numeric.c: Comment tweak

M       numeric.c

commit 1df52657c68c9b8b688bca07de6d5135a95aabee
Author: Karl Williamson <[email protected]>
Date:   Fri Sep 5 09:09:28 2014 -0600

    XXXdelta Allow \N{named seq} in qr/[...]/
    
    This commit changes the regex handler to properly match in many
    instances a \N{named sequence} in a bracketed character class.
    
    A named sequence is one which consists of a string of multiple
    characters but given one name.  Unicode has hundreds of them, like LATIN
    CAPITAL LETTER A WITH MACRON AND GRAVE.  These are encoded by Unicode
    when there is some user community that thinks of the conglomeration as a
    single unit, but there was no prior standard that had it so, and it is
    possible to encode it in Unicode using other means, typically a sequence
    of a base character followed by some combining marks.  (If there had not
    been such a prior standard, 8859-1, things like LATIN CAPITAL LETTER A
    WITH GRAVE would have been put into Unicode this way too.)  If they did
    not do it this way, they would run out of availble code points much
    sooner.
    
    Not having these as single characters adds a burden to the programmer
    having to deal with them.  Hiding this detail as much as possible makes
    it easier to program.  This commit hides this in one more place than
    previously.
    
    It takes advantage of the infrastructure added some releases ago dealing
    with the fact that the match of some single characters
    case-insensitively can be 2 or even 3 characters.
    
      "ss" =~ /[ß]/i;
    
    is the most prominent example.
    
    We earlier discovered that /[^ß]/ leads to unexpected behavior, and
    using one of these sequences as an endpoint in a range is also unclear
    as to what is meant.  This commit leaves existing behavior for those
    cases.  That behavior is to use just the first code point in the
    sequence for regular [...], and to generate a fatal syntax error for
    (?[...]).

M       lib/diagnostics.t
M       pod/perldiag.pod
M       regcomp.c
M       t/re/pat_advanced.t
M       t/re/reg_mesg.t

commit 80666081502f1b0d47025cd1d21223ead3d356e1
Author: Karl Williamson <[email protected]>
Date:   Thu Sep 4 22:48:22 2014 -0600

    regcomp.c: Extract out functionality into a function
    
    This is in preparation for it being called from a 2nd place.  The code
    was merely moved and outdented, and comments moved within the function
    and added to.

M       embed.fnc
M       embed.h
M       proto.h
M       regcomp.c

commit e0435fad4da8a3a60bb5ed9fc619d16577556029
Author: Karl Williamson <[email protected]>
Date:   Wed Sep 3 20:00:28 2014 -0600

    regcomp.c: White-space only
    
    Properly indent code in blocks newly formed by the previous commit

M       regcomp.c

commit 57d524f07326bcae2275cea1a1279c76470d41c8
Author: Karl Williamson <[email protected]>
Date:   Wed Sep 3 19:52:05 2014 -0600

    regcomp.c: Refactor func so caller handles anomalies
    
    S_grok_bslash_N() is refactored to not know about the strictness level
    required by the caller, and to return things instead so that the caller
    can decide what action to take.
    
    This is in preparation for some changes in the caller's behavior in
    future commits.
    
    This has the effect of changing the parsing position or where a problem
    occurs shown in a warning message.

M       embed.fnc
M       embed.h
M       proto.h
M       regcomp.c
M       t/re/reg_mesg.t

commit d57620bcfc94128564362a6f4e8e9382eb829816
Author: Karl Williamson <[email protected]>
Date:   Wed Sep 3 18:28:25 2014 -0600

    regcomp.c: Comment clarifications, nits

M       regcomp.c

commit 2183d74a38252cb4c677ea2701a4978b6f08bfe5
Author: Karl Williamson <[email protected]>
Date:   Wed Sep 3 17:31:39 2014 -0600

    regcomp.c: Refactor one area to use common subroutine
    
    By using the inline function append_utf8_from_native_byte(), the details
    of this conversion are hidden from here.  Since that routine advances
    the parsing pointer with each byte, this has to be slightly refactored.

M       regcomp.c

commit 70f00bb3cd69815469b8c3a7cc46453df8926887
Author: Karl Williamson <[email protected]>
Date:   Mon Sep 1 20:00:01 2014 -0600

    XXXdelta PATCH: [perl #122671] Many warnings in regcomp.c can occur twice
    
    This solves the problem by moving the warnings to be output only in
    pass2 of compilation.  The problem arises because almost all of pass1
    can be repeated under certain circumstances described in the ticket and
    the added comments of this patch.

M       regcomp.c
M       t/lib/warnings/regcomp
M       t/re/reg_mesg.t

commit b1d686edc4c393bbd407e5d7a4577cc2e09e9f75
Author: Karl Williamson <[email protected]>
Date:   Mon Sep 1 18:54:03 2014 -0600

    recomp.c: Don't output same warning twice
    
    This warning was untested for as well.

M       regcomp.c
M       t/re/reg_mesg.t

commit 0870297bc89f2e697ed71645edafabeea517b2c1
Author: Karl Williamson <[email protected]>
Date:   Mon Sep 1 16:44:38 2014 -0600

    regcomp.c: Vertically stack ternary
    
    for legibility

M       regcomp.c

commit 37ec4ac9512ebd4cdc3dbfc6c0e776a2099ea4e4
Author: Karl Williamson <[email protected]>
Date:   Mon Sep 1 14:57:49 2014 -0600

    regcomp.c: Don't prematurely skip error checking
    
    The assertion in the comment changed by this commit was true only for
    pass1 of the regex compilation; not pass2.  This makes it true in both
    passes by moving it, and the code it was about past some error checking.
    This error checking was executed in pass1, but not pass2.  It also
    changes the warning to only be done in the second pass, part of
    [perl #122671].  A future commit will fix the others

M       regcomp.c

commit afcc7da1a5407e3402b798d12b2d5a1589f940e6
Author: Karl Williamson <[email protected]>
Date:   Mon Sep 1 14:48:02 2014 -0600

    regcomp.c: Move comment closer to code it applies to

M       regcomp.c

commit 8de8562af5b1c570ac90f12961335b56ae79098d
Author: Karl Williamson <[email protected]>
Date:   Thu Aug 28 21:12:39 2014 -0600

    regcomp.c: Remove unnecessary cast
    
    The macro does the appropriate cast, and this is slightly more legible.

M       regcomp.c

commit fbe83b82f7fbe5cfa977f91afe59a144db0acca9
Author: Karl Williamson <[email protected]>
Date:   Tue Aug 26 15:34:25 2014 -0600

    regcomp.c: Make macro a lookup
    
    The recently introduced macro isMNEMONIC_CNTRL has a look-up and several
    tests in it, which occupy time and space.  Since it was only used for
    debugging, that did not matter much, but future commits will use it in
    more mainline code.  This commit changes it to be a single look-up,
    using up one of the spare bits available for that purpose in
    PL_charclass.  There are enough available bits that we aren't likely to
    run out, really ever.  (We can always add a 2nd word of bits if
    necessary.)

M       handy.h
M       l1_char_class_tab.h
M       regcomp.c
M       regen/mk_PL_charclass.pl

commit c3a24a1b52a6f2b5d616bc0d0b2d628cdf1596e0
Author: Karl Williamson <[email protected]>
Date:   Tue Aug 26 17:29:31 2014 -0600

    regcomp.c: Extract functionality into a static function
    
    This is in preparation for it being used in more than one place in a
    future commit.

M       embed.fnc
M       embed.h
M       proto.h
M       regcomp.c

commit ab87b65efb16452eaae30fd9a463322f2d5b2d50
Author: Karl Williamson <[email protected]>
Date:   Thu Aug 28 13:59:01 2014 -0600

    XXXcharbits

M       regcomp.h

commit 4786ef35ba2048d8854e05ad6d1535ad1484cfda
Author: Karl Williamson <[email protected]>
Date:   Tue Jun 17 18:49:53 2014 -0600

    XXX partial perlapi text

M       perlvars.h

commit 3133a837ca32fa3784e4c9760a1ab2fa8f2a6b8c
Author: Karl Williamson <[email protected]>
Date:   Sat May 17 19:37:06 2014 -0600

    XXX Don't push. attempt to tell tries everything at compile time
    
    But, it appears this is thrown away, have to consult with Yves to see if
    is worth pursuing

M       embed.fnc
M       embed.h
M       perl.h
M       proto.h
M       regcomp.c
M       regcomp.h
M       regcomp.sym
-----------------------------------------------------------------------

--
Perl5 Master Repository

Reply via email to