In perl.git, the branch smoke-me/khw-encode has been created


        at  e6854ee3aa92e668bf0ded04932ebb4a3900b2ae (commit)

- Log -----------------------------------------------------------------
commit e6854ee3aa92e668bf0ded04932ebb4a3900b2ae
Author: Karl Williamson <>
Date:   Sat Oct 15 11:02:04 2016 -0600

    t/op/utf8decode.t: Fix for 32 bit systems
    There are extra warnings generated for 32 bit systems because some
    things overflow there that don't on 64-bit ones.

M       t/op/utf8decode.t

commit 4d3874b0aa373fe839dbc74989aef0ea1c4ce739
Author: Karl Williamson <>
Date:   Sat Oct 15 11:00:57 2016 -0600

    utf8.c: Silence a compiler warning
    Some compilers wrongly warn that this is used uninitialized.

M       utf8.c

commit 92dcd2f5d97b38cc02598edaeaa32ec23529d977
Author: Karl Williamson <>
Date:   Wed Oct 12 20:33:29 2016 -0600


M       utf8.h

commit 43f77a66b5304917899abb0ca042a9ba6a934641
Author: Karl Williamson <>
Date:   Thu Sep 15 09:09:07 2016 -0600

    XXX incomplete: Add sv_utf8_decode_flags

M       embed.fnc
M       embed.h
M       proto.h
M       sv.c
M       sv.h

commit 6f81448dc2574f379ac36278e339460427320f3f
Author: Karl Williamson <>
Date:   Wed Sep 14 22:40:23 2016 -0600


M       t/porting/customized.dat

commit 4ee9552d5b0157ee430fae6f341da1a3ecf3496d
Author: Karl Williamson <>
Date:   Thu Sep 1 12:20:52 2016 -0600

    Use core REPLACEMENT CHARACTER definition
    This allows the code to now work on EBCDIC as well.

M       cpan/Encode/Encode/encode.h

commit 22c4f40fa4ed38bf928543fc772e8159b4d88909
Author: Karl Williamson <>
Date:   Thu Sep 1 12:16:00 2016 -0600

    XXX commit msg: Encode.xs: Rmv unused function

M       cpan/Encode/Encode.xs

commit 1fb5a578b2ef12c76c4bc65c4290b2a4dfaa541d
Author: Karl Williamson <>
Date:   Thu Sep 1 12:12:39 2016 -0600

    Encode.xs: white-space only

M       cpan/Encode/Encode.xs

commit 0d625eb4e542232aa1d4b44ed1ab162318903f6c
Author: Karl Williamson <>
Date:   Thu Sep 1 12:12:06 2016 -0600

    XXX maybe more in commit msg: Speed up Encode UTF-8 validation checking
    This replaces the current scheme for checking UTF-8 validity by one
    in which normal processing doesn't require having to decode the UTF-8
    into code points.  The copying of characters individually from the input
    to the output is changed to be a single operation for each entire span
    of valid input at once.
    Thus in the normal case, what ends up happening is a tight loop to
    check the validity, and then a memmove of the entire input to the
    output, then return.
    If an error is found, it copies all the valid input before the error,
    then handles the character in error, then positions to the next input
    position, and repeats the whole process starting from there.
    It uses the functionality available from the Perl 5 core to to look at
    just the bytes that comprise the UTF-8 to make the determination,
    converting to code points only those that are defective some how in
    order to display them in warnings and error messages.
    Thus, this does not need to know about the intricacies of UTF-8
    malformations, relying on the core to handle this.
    This cannot be pushed to CPAN until Devel::PPPort has been updated to
    implement all the functions now needed.

M       cpan/Encode/
M       cpan/Encode/Encode.xs

Perl5 Master Repository

Reply via email to