In perl.git, the branch smoke-me/khw-encode has been created
<http://perl5.git.perl.org/perl.git/commitdiff/e6854ee3aa92e668bf0ded04932ebb4a3900b2ae?hp=0000000000000000000000000000000000000000>
at e6854ee3aa92e668bf0ded04932ebb4a3900b2ae (commit)
- Log -----------------------------------------------------------------
commit e6854ee3aa92e668bf0ded04932ebb4a3900b2ae
Author: Karl Williamson <[email protected]>
Date: Sat Oct 15 11:02:04 2016 -0600
t/op/utf8decode.t: Fix for 32 bit systems
There are extra warnings generated for 32 bit systems because some
things overflow there that don't on 64-bit ones.
M t/op/utf8decode.t
commit 4d3874b0aa373fe839dbc74989aef0ea1c4ce739
Author: Karl Williamson <[email protected]>
Date: Sat Oct 15 11:00:57 2016 -0600
utf8.c: Silence a compiler warning
Some compilers wrongly warn that this is used uninitialized.
M utf8.c
commit 92dcd2f5d97b38cc02598edaeaa32ec23529d977
Author: Karl Williamson <[email protected]>
Date: Wed Oct 12 20:33:29 2016 -0600
later
M utf8.h
commit 43f77a66b5304917899abb0ca042a9ba6a934641
Author: Karl Williamson <[email protected]>
Date: Thu Sep 15 09:09:07 2016 -0600
XXX incomplete: Add sv_utf8_decode_flags
M embed.fnc
M embed.h
M proto.h
M sv.c
M sv.h
commit 6f81448dc2574f379ac36278e339460427320f3f
Author: Karl Williamson <[email protected]>
Date: Wed Sep 14 22:40:23 2016 -0600
customized
M t/porting/customized.dat
commit 4ee9552d5b0157ee430fae6f341da1a3ecf3496d
Author: Karl Williamson <[email protected]>
Date: Thu Sep 1 12:20:52 2016 -0600
Use core REPLACEMENT CHARACTER definition
This allows the code to now work on EBCDIC as well.
M cpan/Encode/Encode/encode.h
commit 22c4f40fa4ed38bf928543fc772e8159b4d88909
Author: Karl Williamson <[email protected]>
Date: Thu Sep 1 12:16:00 2016 -0600
XXX commit msg: Encode.xs: Rmv unused function
M cpan/Encode/Encode.xs
commit 1fb5a578b2ef12c76c4bc65c4290b2a4dfaa541d
Author: Karl Williamson <[email protected]>
Date: Thu Sep 1 12:12:39 2016 -0600
Encode.xs: white-space only
M cpan/Encode/Encode.xs
commit 0d625eb4e542232aa1d4b44ed1ab162318903f6c
Author: Karl Williamson <[email protected]>
Date: Thu Sep 1 12:12:06 2016 -0600
XXX maybe more in commit msg: Speed up Encode UTF-8 validation checking
This replaces the current scheme for checking UTF-8 validity by one
in which normal processing doesn't require having to decode the UTF-8
into code points. The copying of characters individually from the input
to the output is changed to be a single operation for each entire span
of valid input at once.
Thus in the normal case, what ends up happening is a tight loop to
check the validity, and then a memmove of the entire input to the
output, then return.
If an error is found, it copies all the valid input before the error,
then handles the character in error, then positions to the next input
position, and repeats the whole process starting from there.
It uses the functionality available from the Perl 5 core to to look at
just the bytes that comprise the UTF-8 to make the determination,
converting to code points only those that are defective some how in
order to display them in warnings and error messages.
Thus, this does not need to know about the intricacies of UTF-8
malformations, relying on the core to handle this.
This cannot be pushed to CPAN until Devel::PPPort has been updated to
implement all the functions now needed.
M cpan/Encode/Encode.pm
M cpan/Encode/Encode.xs
-----------------------------------------------------------------------
--
Perl5 Master Repository