In perl.git, the branch khw/ebcdic has been created

<http://perl5.git.perl.org/perl.git/commitdiff/3c07265eb645f793cc9c17358319119d008bcb18?hp=0000000000000000000000000000000000000000>

        at  3c07265eb645f793cc9c17358319119d008bcb18 (commit)

- Log -----------------------------------------------------------------
commit 3c07265eb645f793cc9c17358319119d008bcb18
Author: Karl Williamson <[email protected]>
Date:   Wed Aug 19 22:38:44 2015 -0600

    XXX see if fixes it

M       utfebcdic.h

commit 29561dae57cd0cd087cc561b9fbd14a5839da027
Author: Karl Williamson <[email protected]>
Date:   Sat Aug 15 12:53:57 2015 -0600

    XXX Make EBCDIC cmp work when both operands are UTF-8
    
    XXX fix indent

M       sv.c

commit 9d64b018bf3b0c55cf141eaef928d15c46de0a3e
Author: Karl Williamson <[email protected]>
Date:   Sat Aug 15 12:53:17 2015 -0600

    utf8.h: Add comment; white space changes

M       utf8.h

commit b54a1be06f3d46fbbb101412481ee6d95cdc243b
Author: Karl Williamson <[email protected]>
Date:   Thu Aug 13 20:15:53 2015 -0600

    Revert "XXX Run Unicode's official normalization tests"

M       MANIFEST
M       Makefile.SH
M       charclass_invlists.h
D       lib/Unicode/testnorm.t
D       lib/unicore/NormTest.txt
M       regcharclass.h

commit 0d21c933484e51f42aa800b32b74d0c624a719a9
Author: Karl Williamson <[email protected]>
Date:   Mon Aug 3 22:00:53 2015 -0600

    XXX experimental: op/tr.t

M       t/op/tr.t

commit f7e1ac231bb0f0ecfcb036a8dc6f27b4361e107e
Author: Karl Williamson <[email protected]>
Date:   Mon Aug 3 10:35:26 2015 -0600

    XXX temporary

M       cpan/Encode/Encode.xs

commit 9992fd27249ef488be59256de3a745b0c46b5011
Author: Karl Williamson <[email protected]>
Date:   Mon Aug 3 10:17:08 2015 -0600

    XXX op/sort.t: Add more tests
    
    probably comment.  These should fail currently on ebcdic

M       t/op/sort.t

commit 6ab1aa3835d89a7d3f6cea66601474b0cfe1817d
Author: Karl Williamson <[email protected]>
Date:   Sun Aug 2 22:18:10 2015 -0600

    XXX Test Unicode::Collate and Unicode::Normalize

M       t/TEST

commit 3b4a6e6f94f031e5e0a270f24c8c24934581c186
Author: Karl Williamson <[email protected]>
Date:   Sun Aug 2 21:20:44 2015 -0600

    offuni

M       toke.c
M       utf8.c
M       utf8.h
M       utfebcdic.h

commit 218ad0acc079a2c68e0e9695cb684c015aeb2fa3
Author: Karl Williamson <[email protected]>
Date:   Sun Aug 2 21:21:25 2015 -0600

    invariant

M       utf8.h

commit d495d1e12964f661c4c073f99c1a79cc4f65284f
Author: Karl Williamson <[email protected]>
Date:   Sat Aug 1 22:15:18 2015 -0600

    Change EBCDIC macro definition
    
    This changes the definition of isUTF8_POSSIBLY_PROBLEMATIC() on EBCDIC
    platforms to use PL_charclass[] instead of PL_e2a[].  The new array is
    more likely to be in the memory cache.

M       handy.h
M       l1_char_class_tab.h
M       regen/mk_PL_charclass.pl
M       utf8.h
M       utfebcdic.h

commit 38a783df5d9c72656425368ef7c577fd6c8e7878
Author: Karl Williamson <[email protected]>
Date:   Sun Aug 2 09:02:51 2015 -0600

    Change EBCDIC macro definition
    
    Prior to this commit UVCHR_SKIP() was defined the same in both ASCII and
    EBCDIC, but they expanded to different things.  Now, they are defined
    separately  -- to what they expand to, and the EBCDIC version is changed
    when all expanded out to use PL_charclass[] instead of PL_e2a[].  The
    new array is more likely to be in the memory cache.

M       utf8.h
M       utfebcdic.h

commit 3f92e9c3c3f6c2ef6b8d1027f99e701496c5074b
Author: Karl Williamson <[email protected]>
Date:   Sat May 16 10:43:40 2015 -0600

    Change EBCDIC macro definition
    
    Prior to this commit UVCHR_IS_INVARIANT() was defined the same in both
    ASCII and EBCDIC, but they expanded to different things.  Now, they are
    defined separately to what they expand to, and the EBCDIC version is
    changed when all expanded out to use PL_charclass[] instead of PL_e2a[].
    The new array is more likely to be in the memory cache.

M       utf8.h
M       utfebcdic.h

commit be7ba5b5438ac52596ca906d1dd6009d8f665df9
Author: Karl Williamson <[email protected]>
Date:   Sat May 16 10:31:19 2015 -0600

    utf8.h: Change defn of UNI_IS_INVARIANT
    
    This changes it to be isASCII(), instead of repeating the "special"
    number 0x80.

M       utf8.h

commit a54998e45d2b875172499f8b4cb7b91a22420b3e
Author: Karl Williamson <[email protected]>
Date:   Fri May 15 14:49:21 2015 -0600

    Remove no longer used #define
    
    The previous commit removed all uses of this non-public #define.

M       regen/unicode_constants.pl
M       unicode_constants.h

commit 7589454a4d4cbc90dc2a0b65fea530dd9c1aa39b
Author: Karl Williamson <[email protected]>
Date:   Fri May 15 14:48:23 2015 -0600

    Change filter of problematic code points for EBCDIC
    
    There are three classes of problematic Unicode code points that may
    require special handling.  Which code points are problematic is fairly
    complicated, requiring lots of branches.  However, the smallest of them
    is 0xD800, which means that most code points in modern use are below
    them all, and a single test can be used to exclude just about everything
    likely to be encountered.  The problem was that the way this test was
    done on EBCDIC caused way too many things to pass and have to be checked
    with the more complicated branches.  The digits 0-9 and some capital
    letters were not filtered out.  This commit changes the EBCDIC test to
    transform into I8 (an array lookup), and this fixes it to exclude things
    that shouldn't have passed before.

M       utf8.c
M       utf8.h

commit 3f94459bc6d698267c9606375bd77ebadfd04aa5
Author: Karl Williamson <[email protected]>
Date:   Fri May 15 14:35:45 2015 -0600

    Change some UTF-EBCDIC macro handling defns
    
    This commit changes the definitions of some macros for UTF-8 handling on
    EBCDIC platforms.  The previous definitions transformed the bytes into
    I8 and did tests on the transformed values.  The change is to use
    previously unused bits in l1_char_class_tab.h so the transform isn't
    needed, and generally only one branch is.  These macros are called from
    the inner loops of, for example, regex backtracking.

M       l1_char_class_tab.h
M       regen/mk_PL_charclass.pl
M       utfebcdic.h

commit 9d1753ead5da76fe974d1d62f2b46800f8ff4418
Author: Karl Williamson <[email protected]>
Date:   Fri May 15 14:23:12 2015 -0600

    l1_char_class_tab.h: Add bits for UTF-EBCDIC
    
    This is for the next commit.

M       handy.h
M       l1_char_class_tab.h
M       regen/mk_PL_charclass.pl

commit 2b60205b634658fee19d39ab8ef899481e8018fb
Author: Karl Williamson <[email protected]>
Date:   Fri May 15 14:21:25 2015 -0600

    regen/mk_PL_charclass.pl: Refactor a print
    
    This is in preparation for the next commits.

M       regen/mk_PL_charclass.pl

commit 391d8e6712a4daab62774c971b363d76c3f86a41
Author: Karl Williamson <[email protected]>
Date:   Fri May 15 10:59:54 2015 -0600

    Add macro for converting Latin1 to UTF-8, and use it
    
    This adds a macro that converts a code point in the ASCII 128-255 range
    to UTF-8, and changes existing code to use it when the range is known to
    be restricted to this one, rather than the previous macro which accepted
    a wider range (any code point representable by 2 bytes), but had an
    extra test on EBCDIC platforms, hence was larger than necessary and
    slightly slower.

M       handy.h
M       hv.c
M       pp.c
M       regcomp.c
M       regexec.c
M       toke.c
M       utf8.c
M       utf8.h

commit 5339a00ec4e7d2207ea6934cb912211293d71453
Author: Karl Williamson <[email protected]>
Date:   Fri May 15 10:55:30 2015 -0600

    utf8.h: Add assertions to macro

M       utf8.h

commit 138b24c7fb156fb91011b700007331e41a90d592
Author: Karl Williamson <[email protected]>
Date:   Wed May 13 17:38:08 2015 -0600

    Change to use UVCHR_SKIP over UNI_SKIP
    
    UNI_SKIP is somewhat ambiguous.  Perl has long used 'uvchr' as part of a
    name to mean the unsigned values using the native character set plus
    Unicode values for those above 255.
    
    This also changes two calls (one in dquote_static.c and one in
    dquote_inline.h) to use UVCHR_SKIP; they should not have been OFFUNI, as
    they are dealing with native values.

M       dquote.c
M       dquote_inline.h
M       op.c
M       perl.c
M       pp.c
M       regcomp.c
M       regexec.c
M       toke.c
M       utf8.c

commit 047e1cfbfc366928c45b44642d73bb7ca4d4035e
Author: Karl Williamson <[email protected]>
Date:   Sat Aug 1 08:52:52 2015 -0600

    XXX Run Unicode's official normalization tests

M       MANIFEST
M       Makefile.SH
M       charclass_invlists.h
A       lib/Unicode/testnorm.t
A       lib/unicore/NormTest.txt
M       regcharclass.h

commit f816fd2e59ce137fc0d65d04db5e97a9a92fed37
Author: Karl Williamson <[email protected]>
Date:   Mon May 18 11:16:18 2015 -0600

    lib/utf8.t: EBCDIC fixes
    
    Some of the test chose code points that did not match its assumptions as
    to their classifications.
    
    And some of the tests were extended to work on 1047 EBCDIC

M       lib/utf8.t

commit 577c2cf7dd938982aadfbd7de25ba4807922f8c5
Author: Karl Williamson <[email protected]>
Date:   Mon May 18 10:47:50 2015 -0600

    lib/utf8.t: Add some tests to stress EBCDIC
    
    The tests for the Latin1 \xFF aren't a fair test of UTF-8 on EBCDIC
    platforms, because it is generally a UTF-8 invariant character, so is
    the same regardless of being in UTF-8 or not.  This adds some tests
    where the UTF-EBCDIC version is 2 bytes (as well as the UTF-8 version).

M       lib/utf8.t

commit cfa5db8246adf406eab67c689abf690585c2df8a
Author: Karl Williamson <[email protected]>
Date:   Mon May 18 10:45:10 2015 -0600

    XXX t/uni/lex_utf8.t: Do some of the tests on EBCDIC
    
    XXX prob. the \xA2 and \377 will fail

M       t/uni/lex_utf8.t

commit 1731a3b26cbcdb0d8a4496003c9951b9b3c6252f
Author: Karl Williamson <[email protected]>
Date:   Mon May 18 10:30:57 2015 -0600

    t/op/utf8decode.t: Better SKIP message

M       t/op/utf8decode.t

commit 73f1cb1698c344e6d6c3938b10fc36f59a30ff79
Author: Karl Williamson <[email protected]>
Date:   Mon May 18 10:24:11 2015 -0600

    XXX experimental t/op/tr.t

M       t/op/tr.t

commit 6b0e4c57f669999ceaff6cc17abef3d34652a808
Author: Karl Williamson <[email protected]>
Date:   Mon May 18 10:18:51 2015 -0600

    XXX experimental t/op/split.t

M       t/op/split.t

commit fa118bc6216db3ad1e0f4fb61819f2b1bf515bf6
Author: Karl Williamson <[email protected]>
Date:   Mon May 18 10:02:08 2015 -0600

    t/base/lex.t: Use more standard test for EBCDIC
    
    This makes it easier to grep for these things.  The typical test is for
    the ord("A"), not some other character.  Since this is in t/base, it
    doesn't use helper scripts.

M       t/base/lex.t

commit a2dacbfd5b77a1c0b6ef32175de1c0bebdec8dee
Author: Karl Williamson <[email protected]>
Date:   Mon May 18 09:52:59 2015 -0600

    XXX t/io/utf8.t: Experimental

M       t/io/utf8.t

commit a7cf3f9e1fbc0dc18ef7b0ed6c8071d3182771c2
Author: Karl Williamson <[email protected]>
Date:   Mon May 18 08:49:37 2015 -0600

    XXX japh/abigail.t
    
    Experiment with running on EBCDIC, and using test.pl's skip()

M       t/japh/abigail.t

commit 892d13c06d46e72f49e860edb72bdd681cf591b9
Author: Karl Williamson <[email protected]>
Date:   Fri May 15 13:34:07 2015 -0600

    t/op/tr.t: Clarify skip tests message

M       t/op/tr.t

commit eac1373fb52e15a882a6384375752d3837620b8c
Author: Karl Williamson <[email protected]>
Date:   Fri May 8 21:35:12 2015 -0600

    perlapi: Nits

M       sv.c
M       util.c

commit 0e13a83eb90fc31c5ba5f54b7bd5a955841f5a0b
Author: Karl Williamson <[email protected]>
Date:   Fri May 8 21:25:33 2015 -0600

    XXX look for more has X bit set

M       pad.c
M       sv.c

commit de15cb58e4f13fdcdae8497532bbaf12520f1b14
Author: Karl Williamson <[email protected]>
Date:   Fri May 8 21:22:32 2015 -0600

    XXX look for more perlapi: Add L<>

M       op.h

commit c9c90ded41954a689df426a9c11e2c30230de331
Author: Karl Williamson <[email protected]>
Date:   Fri May 8 21:21:51 2015 -0600

    perlapi: Add link

M       hv.c

commit 0e659180ba2359d87de9aa15439d71904679e070
Author: Karl Williamson <[email protected]>
Date:   Fri May 8 21:21:17 2015 -0600

    XXX look for more perlapi UTF-8

M       gv.c
M       hv.c
M       sv.h

commit 34e3547e1d9bb9ca1f8f78635d298f916c2cdbcb
Author: Karl Williamson <[email protected]>
Date:   Fri May 8 21:19:13 2015 -0600

    XXX look for more eg to e.g.

M       cv.h
M       mg.c

commit fcdc195e9f0f59c5fb7ae71614a29279db3becf2
Author: Karl Williamson <[email protected]>
Date:   Fri May 8 21:17:48 2015 -0600

    XXX check and look more to come Add S<>

M       av.c
M       mg.c
M       op.c
M       pad.c
M       sv.c
M       utf8.h
M       util.h

commit ee72898936d14004980b38577da6b0c0f8fdd898
Author: Karl Williamson <[email protected]>
Date:   Fri May 8 21:12:55 2015 -0600

    vutil.c Nits, C<> L<>, XXX cpan upstream

M       vutil.c

commit 7f475e89c89527bfb258847421952897b38f4ac4
Author: Karl Williamson <[email protected]>
Date:   Fri May 8 21:10:16 2015 -0600

    XXX C<> for mro.xs

M       ext/mro/mro.xs

commit bb2db76d86fd1409e0ded4a6c36da6674f70c885
Author: Karl Williamson <[email protected]>
Date:   Thu May 7 10:58:54 2015 -0600

    XXX perlapi: Add C<> around
    
    Look through the code again, like for NUL(L)?
    
    Removes 'the' in front of parameter in some instances.

M       XSUB.h
M       av.c
M       dump.c
M       gv.c
M       handy.h
M       hv.c
M       hv.h
M       inline.h
M       intrpvar.h
M       mathoms.c
M       mg.c
M       mro_core.c
M       numeric.c
M       op.c
M       op.h
M       pad.c
M       pad.h
M       perl.c
M       pp_ctl.c
M       pp_pack.c
M       pp_sort.c
M       pp_sys.c
M       regexp.h
M       sv.c
M       sv.h
M       utf8.c
M       util.c

commit 7a593260ae60299118b2f2a1aa4aa651d9ed1150
Author: Karl Williamson <[email protected]>
Date:   Tue Aug 18 19:49:02 2015 -0600

    op/chr.t: Better skip message

M       t/op/chr.t

commit 81522e5e389da6f424809b89399d3c5b29346080
Author: Karl Williamson <[email protected]>
Date:   Fri Apr 3 12:06:39 2015 -0600

    Various .t files: Use globals to see if on EBCDIC
    
    These globals are already available; by using them instead of rolling
    our own, it makes it easer to grep for these kinds of instances.

M       lib/bytes.t
M       lib/utf8.t
M       t/op/chr.t
M       t/op/lex.t
M       t/op/oct.t
M       t/op/split.t
M       t/op/sub_lval.t
M       t/op/tr.t
M       t/op/vec.t
M       t/op/ver.t

commit 4b0ddcb47f9841d167633773bc8acd40a6e64a89
Author: Karl Williamson <[email protected]>
Date:   Sat Aug 1 10:02:19 2015 -0600

    Revert "XXX Temp Configure to point to bash"
    
    This reverts commit

M       Configure

commit b7b672ed80a3da53c4242b013fb8173698e09bfd
Author: Karl Williamson <[email protected]>
Date:   Wed Dec 17 13:28:56 2014 -0700

    XXX Temp Configure to point to bash

M       Configure
-----------------------------------------------------------------------

--
Perl5 Master Repository

Reply via email to