In perl.git, the branch smoke-me/khw-5.21 has been created
<http://perl5.git.perl.org/perl.git/commitdiff/86e3d236b144399e46761aa7dcc6d393ba9f9e52?hp=0000000000000000000000000000000000000000>
at 86e3d236b144399e46761aa7dcc6d393ba9f9e52 (commit)
- Log -----------------------------------------------------------------
commit 86e3d236b144399e46761aa7dcc6d393ba9f9e52
Author: Karl Williamson <[email protected]>
Date: Fri Sep 5 22:12:21 2014 -0600
smoke
M regcomp.c
commit 9558b81b831ebb43ee148acf12046149d79b75c9
Author: Karl Williamson <[email protected]>
Date: Fri Sep 5 10:15:09 2014 -0600
regcomp.c: Don't doubly do 'use encoding'
When reparsing, values have already been converted (if necessary) to
native, so don't do it again.
M regcomp.c
commit 7f6dd6d53378de4b1366949814035f30ad7f378a
Author: Karl Williamson <[email protected]>
Date: Fri Sep 5 09:45:27 2014 -0600
regcomp.c: Remove extraneous tests
These two messages used to be warnings, but are now errors, so there is
no need to test which pass they are being output in.
M regcomp.c
commit 198e09aa454dc8aa38bd11e7d3a0243e13a636cd
Author: Karl Williamson <[email protected]>
Date: Fri Sep 5 09:34:26 2014 -0600
numeric.c: Comment tweak
M numeric.c
commit 1df52657c68c9b8b688bca07de6d5135a95aabee
Author: Karl Williamson <[email protected]>
Date: Fri Sep 5 09:09:28 2014 -0600
XXXdelta Allow \N{named seq} in qr/[...]/
This commit changes the regex handler to properly match in many
instances a \N{named sequence} in a bracketed character class.
A named sequence is one which consists of a string of multiple
characters but given one name. Unicode has hundreds of them, like LATIN
CAPITAL LETTER A WITH MACRON AND GRAVE. These are encoded by Unicode
when there is some user community that thinks of the conglomeration as a
single unit, but there was no prior standard that had it so, and it is
possible to encode it in Unicode using other means, typically a sequence
of a base character followed by some combining marks. (If there had not
been such a prior standard, 8859-1, things like LATIN CAPITAL LETTER A
WITH GRAVE would have been put into Unicode this way too.) If they did
not do it this way, they would run out of availble code points much
sooner.
Not having these as single characters adds a burden to the programmer
having to deal with them. Hiding this detail as much as possible makes
it easier to program. This commit hides this in one more place than
previously.
It takes advantage of the infrastructure added some releases ago dealing
with the fact that the match of some single characters
case-insensitively can be 2 or even 3 characters.
"ss" =~ /[Ã]/i;
is the most prominent example.
We earlier discovered that /[^Ã]/ leads to unexpected behavior, and
using one of these sequences as an endpoint in a range is also unclear
as to what is meant. This commit leaves existing behavior for those
cases. That behavior is to use just the first code point in the
sequence for regular [...], and to generate a fatal syntax error for
(?[...]).
M lib/diagnostics.t
M pod/perldiag.pod
M regcomp.c
M t/re/pat_advanced.t
M t/re/reg_mesg.t
commit 80666081502f1b0d47025cd1d21223ead3d356e1
Author: Karl Williamson <[email protected]>
Date: Thu Sep 4 22:48:22 2014 -0600
regcomp.c: Extract out functionality into a function
This is in preparation for it being called from a 2nd place. The code
was merely moved and outdented, and comments moved within the function
and added to.
M embed.fnc
M embed.h
M proto.h
M regcomp.c
commit e0435fad4da8a3a60bb5ed9fc619d16577556029
Author: Karl Williamson <[email protected]>
Date: Wed Sep 3 20:00:28 2014 -0600
regcomp.c: White-space only
Properly indent code in blocks newly formed by the previous commit
M regcomp.c
commit 57d524f07326bcae2275cea1a1279c76470d41c8
Author: Karl Williamson <[email protected]>
Date: Wed Sep 3 19:52:05 2014 -0600
regcomp.c: Refactor func so caller handles anomalies
S_grok_bslash_N() is refactored to not know about the strictness level
required by the caller, and to return things instead so that the caller
can decide what action to take.
This is in preparation for some changes in the caller's behavior in
future commits.
This has the effect of changing the parsing position or where a problem
occurs shown in a warning message.
M embed.fnc
M embed.h
M proto.h
M regcomp.c
M t/re/reg_mesg.t
commit d57620bcfc94128564362a6f4e8e9382eb829816
Author: Karl Williamson <[email protected]>
Date: Wed Sep 3 18:28:25 2014 -0600
regcomp.c: Comment clarifications, nits
M regcomp.c
commit 2183d74a38252cb4c677ea2701a4978b6f08bfe5
Author: Karl Williamson <[email protected]>
Date: Wed Sep 3 17:31:39 2014 -0600
regcomp.c: Refactor one area to use common subroutine
By using the inline function append_utf8_from_native_byte(), the details
of this conversion are hidden from here. Since that routine advances
the parsing pointer with each byte, this has to be slightly refactored.
M regcomp.c
commit 70f00bb3cd69815469b8c3a7cc46453df8926887
Author: Karl Williamson <[email protected]>
Date: Mon Sep 1 20:00:01 2014 -0600
XXXdelta PATCH: [perl #122671] Many warnings in regcomp.c can occur twice
This solves the problem by moving the warnings to be output only in
pass2 of compilation. The problem arises because almost all of pass1
can be repeated under certain circumstances described in the ticket and
the added comments of this patch.
M regcomp.c
M t/lib/warnings/regcomp
M t/re/reg_mesg.t
commit b1d686edc4c393bbd407e5d7a4577cc2e09e9f75
Author: Karl Williamson <[email protected]>
Date: Mon Sep 1 18:54:03 2014 -0600
recomp.c: Don't output same warning twice
This warning was untested for as well.
M regcomp.c
M t/re/reg_mesg.t
commit 0870297bc89f2e697ed71645edafabeea517b2c1
Author: Karl Williamson <[email protected]>
Date: Mon Sep 1 16:44:38 2014 -0600
regcomp.c: Vertically stack ternary
for legibility
M regcomp.c
commit 37ec4ac9512ebd4cdc3dbfc6c0e776a2099ea4e4
Author: Karl Williamson <[email protected]>
Date: Mon Sep 1 14:57:49 2014 -0600
regcomp.c: Don't prematurely skip error checking
The assertion in the comment changed by this commit was true only for
pass1 of the regex compilation; not pass2. This makes it true in both
passes by moving it, and the code it was about past some error checking.
This error checking was executed in pass1, but not pass2. It also
changes the warning to only be done in the second pass, part of
[perl #122671]. A future commit will fix the others
M regcomp.c
commit afcc7da1a5407e3402b798d12b2d5a1589f940e6
Author: Karl Williamson <[email protected]>
Date: Mon Sep 1 14:48:02 2014 -0600
regcomp.c: Move comment closer to code it applies to
M regcomp.c
commit 8de8562af5b1c570ac90f12961335b56ae79098d
Author: Karl Williamson <[email protected]>
Date: Thu Aug 28 21:12:39 2014 -0600
regcomp.c: Remove unnecessary cast
The macro does the appropriate cast, and this is slightly more legible.
M regcomp.c
commit fbe83b82f7fbe5cfa977f91afe59a144db0acca9
Author: Karl Williamson <[email protected]>
Date: Tue Aug 26 15:34:25 2014 -0600
regcomp.c: Make macro a lookup
The recently introduced macro isMNEMONIC_CNTRL has a look-up and several
tests in it, which occupy time and space. Since it was only used for
debugging, that did not matter much, but future commits will use it in
more mainline code. This commit changes it to be a single look-up,
using up one of the spare bits available for that purpose in
PL_charclass. There are enough available bits that we aren't likely to
run out, really ever. (We can always add a 2nd word of bits if
necessary.)
M handy.h
M l1_char_class_tab.h
M regcomp.c
M regen/mk_PL_charclass.pl
commit c3a24a1b52a6f2b5d616bc0d0b2d628cdf1596e0
Author: Karl Williamson <[email protected]>
Date: Tue Aug 26 17:29:31 2014 -0600
regcomp.c: Extract functionality into a static function
This is in preparation for it being used in more than one place in a
future commit.
M embed.fnc
M embed.h
M proto.h
M regcomp.c
commit ab87b65efb16452eaae30fd9a463322f2d5b2d50
Author: Karl Williamson <[email protected]>
Date: Thu Aug 28 13:59:01 2014 -0600
XXXcharbits
M regcomp.h
commit 4786ef35ba2048d8854e05ad6d1535ad1484cfda
Author: Karl Williamson <[email protected]>
Date: Tue Jun 17 18:49:53 2014 -0600
XXX partial perlapi text
M perlvars.h
commit 3133a837ca32fa3784e4c9760a1ab2fa8f2a6b8c
Author: Karl Williamson <[email protected]>
Date: Sat May 17 19:37:06 2014 -0600
XXX Don't push. attempt to tell tries everything at compile time
But, it appears this is thrown away, have to consult with Yves to see if
is worth pursuing
M embed.fnc
M embed.h
M perl.h
M proto.h
M regcomp.c
M regcomp.h
M regcomp.sym
-----------------------------------------------------------------------
--
Perl5 Master Repository