In perl.git, the branch smoke-me/khw-encode has been created
<http://perl5.git.perl.org/perl.git/commitdiff/4793b68bfbbdaa8aef6343a78d25c187a94ac8cc?hp=0000000000000000000000000000000000000000>
at 4793b68bfbbdaa8aef6343a78d25c187a94ac8cc (commit)
- Log -----------------------------------------------------------------
commit 4793b68bfbbdaa8aef6343a78d25c187a94ac8cc
Author: Karl Williamson <[email protected]>
Date: Tue Nov 1 22:12:51 2016 -0600
later
M embed.fnc
M embed.h
M ext/XS-APItest/t/utf8.t
M proto.h
M utf8.c
commit 1661b45479230024b42d8b9e126965e0809af507
Author: Karl Williamson <[email protected]>
Date: Tue Nov 1 22:13:21 2016 -0600
APItest/t/utf8.t: Correct to uppercase in print
This worked so long as we didn't have hex digits A-F.
M ext/XS-APItest/t/utf8.t
commit fce821fbe2cdad45e564cf54bd9ec652b5bd29b9
Author: Karl Williamson <[email protected]>
Date: Fri Oct 28 05:03:37 2016 -0600
XXX For EBCDIC debug
M utf8.c
commit 26f79a9239960e0a670b83f8d4ca109b6dac4992
Author: Karl Williamson <[email protected]>
Date: Tue Nov 1 22:23:47 2016 -0600
customized
M t/porting/customized.dat
commit 4d9148e22edb2a6f8430a9ccec7c3deaed634027
Author: Karl Williamson <[email protected]>
Date: Tue Oct 18 14:09:43 2016 -0600
pali
M cpan/Encode/Encode.xs
commit 16828e25446533cb1a5078285f6465fd30bccef2
Author: Karl Williamson <[email protected]>
Date: Wed Oct 12 20:33:29 2016 -0600
later
M utf8.h
commit 3190215308d331de2b66db2ac46750ddb7d501f3
Author: Karl Williamson <[email protected]>
Date: Thu Sep 15 09:09:07 2016 -0600
XXX incomplete: Add sv_utf8_decode_flags
M embed.fnc
M embed.h
M proto.h
M sv.c
M sv.h
commit 75d346fcc7066343d9a6021394ecdc69738cc8db
Author: Karl Williamson <[email protected]>
Date: Wed Sep 14 22:40:23 2016 -0600
customized
M t/porting/customized.dat
commit 8f93191b8fb6a4d2b2f93ab1070e8bdd10c2973a
Author: Karl Williamson <[email protected]>
Date: Thu Sep 1 12:20:52 2016 -0600
Use core REPLACEMENT CHARACTER definition
This allows the code to now work on EBCDIC as well.
M cpan/Encode/Encode/encode.h
commit d1568e20c14b15535d1cd459630de7e86ccd697c
Author: Karl Williamson <[email protected]>
Date: Thu Sep 1 12:16:00 2016 -0600
XXX commit msg: Encode.xs: Rmv unused function
M cpan/Encode/Encode.xs
commit 74e374ad84f61dd0dad8445211773a14574d8d52
Author: Karl Williamson <[email protected]>
Date: Thu Sep 1 12:12:39 2016 -0600
Encode.xs: white-space only
M cpan/Encode/Encode.xs
commit f8d2745f848f6b4e64980258afd0fd3575667e72
Author: Karl Williamson <[email protected]>
Date: Thu Sep 1 12:12:06 2016 -0600
XXX maybe more in commit msg: Speed up Encode UTF-8 validation checking
This replaces the current scheme for checking UTF-8 validity by one
in which normal processing doesn't require having to decode the UTF-8
into code points. The copying of characters individually from the input
to the output is changed to be a single operation for each entire span
of valid input at once.
Thus in the normal case, what ends up happening is a tight loop to
check the validity, and then a memmove of the entire input to the
output, then return.
If an error is found, it copies all the valid input before the error,
then handles the character in error, then positions to the next input
position, and repeats the whole process starting from there.
It uses the functionality available from the Perl 5 core to to look at
just the bytes that comprise the UTF-8 to make the determination,
converting to code points only those that are defective some how in
order to display them in warnings and error messages.
Thus, this does not need to know about the intricacies of UTF-8
malformations, relying on the core to handle this.
This cannot be pushed to CPAN until Devel::PPPort has been updated to
implement all the functions now needed.
M cpan/Encode/Encode.pm
M cpan/Encode/Encode.xs
-----------------------------------------------------------------------
--
Perl5 Master Repository