In perl.git, the branch smoke-me/khw-turkish has been created
<https://perl5.git.perl.org/perl.git/commitdiff/c7964c061e78c59f5f4fec9f57d173dda2943c26?hp=0000000000000000000000000000000000000000>
at c7964c061e78c59f5f4fec9f57d173dda2943c26 (commit)
- Log -----------------------------------------------------------------
commit c7964c061e78c59f5f4fec9f57d173dda2943c26
Author: Karl Williamson <[email protected]>
Date: Wed Jan 30 17:21:16 2019 -0700
smoke
commit e5f51d143c5e1c175740f58246f46b1d87537b37
Author: Karl Williamson <[email protected]>
Date: Wed Jan 30 15:00:30 2019 -0700
regcomp.c: Fix recent optimization of [...] bug
This bug was introduced in b2296192536090829ba6d2cb367456f4e346dcc6
n 5.29.7. Using /il should not result in looking for a [:posix:] class
that matches the code points given.
commit c1ec7af9297157670794d4415333938aa6859fb8
Author: Karl Williamson <[email protected]>
Date: Wed Jan 30 11:24:12 2019 -0700
pp.c: Don't assume worst case memory needs
Since 5.28, there has been a function that will calculate the expansion
of a string when converted into UTF-8, using per-word operations. This
means it runs 8 times faster than doing this count previously would have
taken.
I've come to believe it is better to calculate how much memory we need
than to overallocate based on worst-case scenarios. This is because in
very large strings, over allocating can lead to unnecessary inefficient
processing.
This commit changes several instances in pp.c where a string needs to be
converted to UTF-8 to not assume the worst case, but instead calculate
what's needed using the faster function.
commit 7f560c7a5c93dfa0f910fa6e9e7bbc30eba6cb1d
Author: Karl Williamson <[email protected]>
Date: Wed Jan 30 11:09:01 2019 -0700
pp.c: Don't use function call for easy copy
Like the previous commit, this code is adding the UTF-8 for a Greek
character to a string. It previously used Copy, but this character is
representable as two bytes in both ASCII and EBCDIC UTF-8, the only
character sets that Perl will ever supports, so we can use the
specialized code that is used most everywhere else for two byte UTF-8
characters, avoiding the function overhead, and having to treat this
character as particularly special.
commit cae464ba1f9b89313f8f28aa4f0492af646b242b
Author: Karl Williamson <[email protected]>
Date: Wed Jan 30 10:52:41 2019 -0700
pp.c: Don't use function call for easy copy
This code is adding the UTF-8 for a Greek character to a string. It
previously used Copy, but this character is representable as two bytes
in both ASCII and EBCDIC UTF-8, the only character sets that Perl will
ever supports, so we can use the specialized code that is used most
everywhere else for two byte UTF-8 characters, avoiding the function
overhead, and having to treat this character as particularly special.
commit 95a61adcff6e0d31bf00149c02a796b62c3288f4
Author: Karl Williamson <[email protected]>
Date: Wed Jan 30 10:35:21 2019 -0700
pp.c: pp_fc(): Simplify
The function being called does everything that the code being eliminated
here did. We just pass the function the final destination instead of a
temporary.
commit f5e87e8df25e7e29aedd9b1628d9ef07954c94d0
Author: Karl Williamson <[email protected]>
Date: Wed Jan 30 10:27:17 2019 -0700
pp.c: White-space, comments only
commit 37b53739d739f5609259bc949bc034d54e53986d
Author: Karl Williamson <[email protected]>
Date: Wed Jan 30 10:02:35 2019 -0700
pp.c: Reorder && order in 'if'
This makes the test most likely to fail be first, and adding an
UNLIKELY() to it, thus saving a conditional in most instances.
commit 49cdf26e59b81773d81ca79006908728c03d2955
Author: Karl Williamson <[email protected]>
Date: Wed Jan 30 09:56:15 2019 -0700
XXX move to /locale ws. locale.c white space only
commit 01da0ec65d5b40c728db81ede3e7147b9d4c13f7
Author: Karl Williamson <[email protected]>
Date: Wed Jan 30 09:08:13 2019 -0700
handy.h: Add comment
commit 2f18184efc45188d1aadb405b6eea5824b1035b5
Author: Karl Williamson <[email protected]>
Date: Tue Jan 29 22:25:03 2019 -0700
t/re/fold_grind.pl: White-space only
Just align some logical or clauses for readability.
commit 2f6a4bb7b66adbc6a9d9949367a7c7635c2cc585
Author: Karl Williamson <[email protected]>
Date: Tue Jan 29 22:02:59 2019 -0700
pp.c: Use faster method to convert to UTF-8
There is a special inline function that's used when converting a single
byte to UTF-8, that is faster than the more general one used prior to
this commit.
commit 02472c60ba605c46d031a1a16bcd5b66e922aa77
Author: Karl Williamson <[email protected]>
Date: Tue Jan 29 22:01:18 2019 -0700
pp.c: Add missing assert
The comments say there is an assert, but it wasn't there.
commit 149c0eb7421489764c82a208e27e816d3a8b0627
Author: Karl Williamson <[email protected]>
Date: Fri Jan 25 09:55:58 2019 -0700
handy.h: White-space only
Vertically align the ternary colon with the question mark above it.
commit a0a5ce2cd9198eaeb5e044e2016352e61ebab736
Author: Karl Williamson <[email protected]>
Date: Wed Jan 23 15:42:35 2019 -0700
handy.h: Add void * casts to memEQ, memNE
This change is to allow these macros to be called without having to do
casting in the call.
commit 5a0e4ae3280cd1eb035406f683f46fb7855bc516
Author: Karl Williamson <[email protected]>
Date: Mon Jan 21 09:46:00 2019 -0700
mktables: Make Turkic 'I' chars problematic
In a Turkic locale, these are problematic because their mappings
cross the 255/256 boundary.
This change has the side effect of causing U+307 to be added to the
problematic list, and it normally really isn't problematic, because in
those locales where U+130 and U+131 are problematic, U+307 isn't used.
But applications could switch in and out of Turkic locales, so it's best
to leave it be considered problematic. The consequences of making this
mark problematic are simply slightly less optimized regex pattern code.
-----------------------------------------------------------------------
--
Perl5 Master Repository