In perl.git, the branch khw/ebcdic has been created

<http://perl5.git.perl.org/perl.git/commitdiff/c43a3daf4e28e3838e6751761435c38ed85a2044?hp=0000000000000000000000000000000000000000>

        at  c43a3daf4e28e3838e6751761435c38ed85a2044 (commit)

- Log -----------------------------------------------------------------
commit c43a3daf4e28e3838e6751761435c38ed85a2044
Author: Karl Williamson <[email protected]>
Date:   Wed Aug 19 22:38:44 2015 -0600

    XXX see if fixes it

M       utfebcdic.h

commit e8078691fbef1bae0c29b525a14540c85c6c325c
Author: Karl Williamson <[email protected]>
Date:   Sat Aug 15 12:53:57 2015 -0600

    XXX Make EBCDIC cmp work when both operands are UTF-8
    
    XXX fix indent

M       sv.c

commit 8eedaf5c5db1a9e8d04ef2c2434587e946848924
Author: Karl Williamson <[email protected]>
Date:   Sat Aug 15 12:53:17 2015 -0600

    utf8.h: Add comment; white space changes

M       utf8.h

commit d2fb78a679a7b0c3f0b1f9d0514c97644e8db152
Author: Karl Williamson <[email protected]>
Date:   Thu Aug 13 20:15:53 2015 -0600

    Revert "XXX Run Unicode's official normalization tests"

M       MANIFEST
M       Makefile.SH
M       charclass_invlists.h
D       lib/Unicode/testnorm.t
D       lib/unicore/NormTest.txt
M       regcharclass.h

commit f01064c5e5a414ad4d02661bc0575ba89e70b3f6
Author: Karl Williamson <[email protected]>
Date:   Mon Aug 3 22:00:53 2015 -0600

    XXX experimental: op/tr.t

M       t/op/tr.t

commit 290c4a0ef927933d1f008989efc25aa405930c1e
Author: Karl Williamson <[email protected]>
Date:   Mon Aug 3 10:35:26 2015 -0600

    XXX temporary

M       cpan/Encode/Encode.xs

commit 461cd0db7d306e94af29b77ab9f2e466e21db1fc
Author: Karl Williamson <[email protected]>
Date:   Mon Aug 3 10:17:08 2015 -0600

    XXX op/sort.t: Add more tests
    
    probably comment.  These should fail currently on ebcdic

M       t/op/sort.t

commit 5470af3f61cbbb9c7594e1ea5a3ca033a6fb89d8
Author: Karl Williamson <[email protected]>
Date:   Sun Aug 2 22:18:10 2015 -0600

    XXX Test Unicode::Collate and Unicode::Normalize

M       t/TEST

commit cc2499a8bdbbb0365e4bddf62b9110a6ea7e92c4
Author: Karl Williamson <[email protected]>
Date:   Sun Aug 2 21:20:44 2015 -0600

    offuni

M       toke.c
M       utf8.c
M       utf8.h
M       utfebcdic.h

commit a2b8da1d0d32ea9992e6c539abcdd6ceace5a7c9
Author: Karl Williamson <[email protected]>
Date:   Sun Aug 2 21:21:25 2015 -0600

    invariant

M       utf8.h

commit dbcdb7b188678cce915312fcc1331082a90c9d52
Author: Karl Williamson <[email protected]>
Date:   Sat Aug 1 22:15:18 2015 -0600

    Change EBCDIC macro definition
    
    This changes the definition of isUTF8_POSSIBLY_PROBLEMATIC() on EBCDIC
    platforms to use PL_charclass[] instead of PL_e2a[].  The new array is
    more likely to be in the memory cache.

M       handy.h
M       l1_char_class_tab.h
M       regen/mk_PL_charclass.pl
M       utf8.h
M       utfebcdic.h

commit e1dd68f490b2e1962ff67ce209f05eee6e2e419e
Author: Karl Williamson <[email protected]>
Date:   Sun Aug 2 09:02:51 2015 -0600

    Change EBCDIC macro definition
    
    Prior to this commit UVCHR_SKIP() was defined the same in both ASCII and
    EBCDIC, but they expanded to different things.  Now, they are defined
    separately  -- to what they expand to, and the EBCDIC version is changed
    when all expanded out to use PL_charclass[] instead of PL_e2a[].  The
    new array is more likely to be in the memory cache.

M       utf8.h
M       utfebcdic.h

commit e631543e56ed194378170eacf4484fed516933f3
Author: Karl Williamson <[email protected]>
Date:   Sat May 16 10:43:40 2015 -0600

    Change EBCDIC macro definition
    
    Prior to this commit UVCHR_IS_INVARIANT() was defined the same in both
    ASCII and EBCDIC, but they expanded to different things.  Now, they are
    defined separately to what they expand to, and the EBCDIC version is
    changed when all expanded out to use PL_charclass[] instead of PL_e2a[].
    The new array is more likely to be in the memory cache.

M       utf8.h
M       utfebcdic.h

commit 1a8a28195a6dce5c9a0632bffad463c5b8116360
Author: Karl Williamson <[email protected]>
Date:   Sat May 16 10:31:19 2015 -0600

    utf8.h: Change defn of UNI_IS_INVARIANT
    
    This changes it to be isASCII(), instead of repeating the "special"
    number 0x80.

M       utf8.h

commit c3030d392ce92ee421eebe90b7675a5b0fd8515f
Author: Karl Williamson <[email protected]>
Date:   Fri May 15 14:49:21 2015 -0600

    Remove no longer used #define
    
    The previous commit removed all uses of this non-public #define.

M       regen/unicode_constants.pl
M       unicode_constants.h

commit fce9db40bc2cc179574889d5c5b36c0e1adaf57d
Author: Karl Williamson <[email protected]>
Date:   Fri May 15 14:48:23 2015 -0600

    Change filter of problematic code points for EBCDIC
    
    There are three classes of problematic Unicode code points that may
    require special handling.  Which code points are problematic is fairly
    complicated, requiring lots of branches.  However, the smallest of them
    is 0xD800, which means that most code points in modern use are below
    them all, and a single test can be used to exclude just about everything
    likely to be encountered.  The problem was that the way this test was
    done on EBCDIC caused way too many things to pass and have to be checked
    with the more complicated branches.  The digits 0-9 and some capital
    letters were not filtered out.  This commit changes the EBCDIC test to
    transform into I8 (an array lookup), and this fixes it to exclude things
    that shouldn't have passed before.

M       utf8.c
M       utf8.h

commit 47ed0b8484bfe31f915cb847f58f2dffb052dbab
Author: Karl Williamson <[email protected]>
Date:   Fri May 15 14:35:45 2015 -0600

    Change some UTF-EBCDIC macro handling defns
    
    This commit changes the definitions of some macros for UTF-8 handling on
    EBCDIC platforms.  The previous definitions transformed the bytes into
    I8 and did tests on the transformed values.  The change is to use
    previously unused bits in l1_char_class_tab.h so the transform isn't
    needed, and generally only one branch is.  These macros are called from
    the inner loops of, for example, regex backtracking.

M       l1_char_class_tab.h
M       regen/mk_PL_charclass.pl
M       utfebcdic.h

commit bf560d5fb0f561c675e5ecfa71a0e8b93475f1c7
Author: Karl Williamson <[email protected]>
Date:   Fri May 15 14:23:12 2015 -0600

    l1_char_class_tab.h: Add bits for UTF-EBCDIC
    
    This is for the next commit.

M       handy.h
M       l1_char_class_tab.h
M       regen/mk_PL_charclass.pl

commit 31385bfe81deaf6b7b70106cda9c33d874c1ae41
Author: Karl Williamson <[email protected]>
Date:   Fri May 15 14:21:25 2015 -0600

    regen/mk_PL_charclass.pl: Refactor a print
    
    This is in preparation for the next commits.

M       regen/mk_PL_charclass.pl

commit c54f25ef314738c81550aaf3d821862feac2120a
Author: Karl Williamson <[email protected]>
Date:   Fri May 15 10:59:54 2015 -0600

    Add macro for converting Latin1 to UTF-8, and use it
    
    This adds a macro that converts a code point in the ASCII 128-255 range
    to UTF-8, and changes existing code to use it when the range is known to
    be restricted to this one, rather than the previous macro which accepted
    a wider range (any code point representable by 2 bytes), but had an
    extra test on EBCDIC platforms, hence was larger than necessary and
    slightly slower.

M       handy.h
M       hv.c
M       pp.c
M       regcomp.c
M       regexec.c
M       toke.c
M       utf8.c
M       utf8.h

commit 3f091f55c8323961978ef01c7865b6a6277ca01e
Author: Karl Williamson <[email protected]>
Date:   Fri May 15 10:55:30 2015 -0600

    utf8.h: Add assertions to macro

M       utf8.h

commit 97bd0c69232f077f26c7c276aea07909c850f12c
Author: Karl Williamson <[email protected]>
Date:   Wed May 13 17:38:08 2015 -0600

    Change to use UVCHR_SKIP over UNI_SKIP
    
    UNI_SKIP is somewhat ambiguous.  Perl has long used 'uvchr' as part of a
    name to mean the unsigned values using the native character set plus
    Unicode values for those above 255.
    
    This also changes two calls (one in dquote_static.c and one in
    dquote_inline.h) to use UVCHR_SKIP; they should not have been OFFUNI, as
    they are dealing with native values.

M       dquote.c
M       dquote_inline.h
M       op.c
M       perl.c
M       pp.c
M       regcomp.c
M       regexec.c
M       toke.c
M       utf8.c

commit 4f31dd57179ee2e6324922462d79c0698942b6c5
Author: Karl Williamson <[email protected]>
Date:   Sat Aug 1 08:52:52 2015 -0600

    XXX Run Unicode's official normalization tests

M       MANIFEST
M       Makefile.SH
M       charclass_invlists.h
A       lib/Unicode/testnorm.t
A       lib/unicore/NormTest.txt
M       regcharclass.h

commit 0a823a77617aff10cbd953663b0ea66f5a0d4302
Author: Karl Williamson <[email protected]>
Date:   Mon May 18 11:16:18 2015 -0600

    lib/utf8.t: EBCDIC fixes
    
    Some of the test chose code points that did not match its assumptions as
    to their classifications.
    
    And some of the tests were extended to work on 1047 EBCDIC

M       lib/utf8.t

commit 93b407b45111e14547cc95a3856ef86c4be22181
Author: Karl Williamson <[email protected]>
Date:   Mon May 18 10:47:50 2015 -0600

    lib/utf8.t: Add some tests to stress EBCDIC
    
    The tests for the Latin1 \xFF aren't a fair test of UTF-8 on EBCDIC
    platforms, because it is generally a UTF-8 invariant character, so is
    the same regardless of being in UTF-8 or not.  This adds some tests
    where the UTF-EBCDIC version is 2 bytes (as well as the UTF-8 version).

M       lib/utf8.t

commit ac63898cdade83ce1339e38706b55da46e0c5e9a
Author: Karl Williamson <[email protected]>
Date:   Mon May 18 10:45:10 2015 -0600

    XXX t/uni/lex_utf8.t: Do some of the tests on EBCDIC
    
    XXX prob. the \xA2 and \377 will fail

M       t/uni/lex_utf8.t

commit 6dafb9d4988bd6b2d3f56210a70744c511fe4f51
Author: Karl Williamson <[email protected]>
Date:   Mon May 18 10:30:57 2015 -0600

    t/op/utf8decode.t: Better SKIP message

M       t/op/utf8decode.t

commit d193504a7b53dabdd63108acd1b8101b05886d29
Author: Karl Williamson <[email protected]>
Date:   Mon May 18 10:24:11 2015 -0600

    XXX experimental t/op/tr.t

M       t/op/tr.t

commit d4e338d6df6c4d7de435e8bbab0d89a17de25879
Author: Karl Williamson <[email protected]>
Date:   Mon May 18 10:18:51 2015 -0600

    XXX experimental t/op/split.t

M       t/op/split.t

commit 0acb2d1f0e0da03b79bf8910eeaff11e428f3980
Author: Karl Williamson <[email protected]>
Date:   Mon May 18 10:02:08 2015 -0600

    t/base/lex.t: Use more standard test for EBCDIC
    
    This makes it easier to grep for these things.  The typical test is for
    the ord("A"), not some other character.  Since this is in t/base, it
    doesn't use helper scripts.

M       t/base/lex.t

commit d016d99313015c8998198ca14ef23a1b85cc19ca
Author: Karl Williamson <[email protected]>
Date:   Mon May 18 09:52:59 2015 -0600

    XXX t/io/utf8.t: Experimental

M       t/io/utf8.t

commit ff08984f50cc168b8400b4ef3996e6e2f49a74e6
Author: Karl Williamson <[email protected]>
Date:   Mon May 18 08:49:37 2015 -0600

    XXX japh/abigail.t
    
    Experiment with running on EBCDIC, and using test.pl's skip()

M       t/japh/abigail.t

commit 78944eece957f35f8ee977c0887aaa1a353309f9
Author: Karl Williamson <[email protected]>
Date:   Fri May 15 13:34:07 2015 -0600

    t/op/tr.t: Clarify skip tests message

M       t/op/tr.t

commit 40a01f036b99ae585b4f01c79bcd9a1e17b069e2
Author: Karl Williamson <[email protected]>
Date:   Fri May 8 21:35:12 2015 -0600

    perlapi: Nits

M       sv.c
M       util.c

commit e74601af51a3fcde8904402a7356a5018659cdb5
Author: Karl Williamson <[email protected]>
Date:   Fri May 8 21:25:33 2015 -0600

    XXX look for more has X bit set

M       pad.c
M       sv.c

commit f0627927547add5a90bdb0ffb9387f55b070dc55
Author: Karl Williamson <[email protected]>
Date:   Fri May 8 21:22:32 2015 -0600

    XXX look for more perlapi: Add L<>

M       op.h

commit c6891bd1cced54b2fffc9bab23b0b7bf42103566
Author: Karl Williamson <[email protected]>
Date:   Fri May 8 21:21:51 2015 -0600

    perlapi: Add link

M       hv.c

commit 2f949b72c3202bc2190a7618c39b6181dcb603e8
Author: Karl Williamson <[email protected]>
Date:   Fri May 8 21:21:17 2015 -0600

    XXX look for more perlapi UTF-8

M       gv.c
M       hv.c
M       sv.h

commit 32af485e52467e04a403116f9c530865e01216d6
Author: Karl Williamson <[email protected]>
Date:   Fri May 8 21:19:13 2015 -0600

    XXX look for more eg to e.g.

M       cv.h
M       mg.c

commit 3d5b06ebe7dbffe5b8e2633e081d38d4110eb174
Author: Karl Williamson <[email protected]>
Date:   Fri May 8 21:17:48 2015 -0600

    XXX check and look more to come Add S<>

M       av.c
M       mg.c
M       op.c
M       pad.c
M       sv.c
M       utf8.h
M       util.h

commit 436f3faa295d3b72898f08f5c2143c36be5802b1
Author: Karl Williamson <[email protected]>
Date:   Fri May 8 21:12:55 2015 -0600

    vutil.c Nits, C<> L<>, XXX cpan upstream

M       vutil.c

commit c93410a0325b0db39d56d7530dd21b517281eaab
Author: Karl Williamson <[email protected]>
Date:   Fri May 8 21:10:16 2015 -0600

    XXX C<> for mro.xs

M       ext/mro/mro.xs

commit 57b618d56b7d58494bf1c1887370c931a9c1e92c
Author: Karl Williamson <[email protected]>
Date:   Thu May 7 10:58:54 2015 -0600

    XXX perlapi: Add C<> around
    
    Look through the code again, like for NUL(L)?
    
    Removes 'the' in front of parameter in some instances.

M       XSUB.h
M       av.c
M       dump.c
M       gv.c
M       handy.h
M       hv.c
M       hv.h
M       inline.h
M       intrpvar.h
M       mathoms.c
M       mg.c
M       mro_core.c
M       numeric.c
M       op.c
M       op.h
M       pad.c
M       pad.h
M       perl.c
M       pp_ctl.c
M       pp_pack.c
M       pp_sort.c
M       pp_sys.c
M       regexp.h
M       sv.c
M       sv.h
M       utf8.c
M       util.c

commit cb864b3fb2c7ec25f6c616693863716915395a0f
Author: Karl Williamson <[email protected]>
Date:   Tue Aug 18 19:49:02 2015 -0600

    op/chr.t: Better skip message

M       t/op/chr.t

commit 11fdfddca14f276d09f285ac064991d30fbb8f80
Author: Karl Williamson <[email protected]>
Date:   Fri Apr 3 12:06:39 2015 -0600

    Various .t files: Use globals to see if on EBCDIC
    
    These globals are already available; by using them instead of rolling
    our own, it makes it easer to grep for these kinds of instances.

M       lib/bytes.t
M       lib/utf8.t
M       t/op/chr.t
M       t/op/lex.t
M       t/op/oct.t
M       t/op/split.t
M       t/op/sub_lval.t
M       t/op/tr.t
M       t/op/vec.t
M       t/op/ver.t

commit d4ba31ea6bf07ce0b7be127169d6290c22d78193
Author: Karl Williamson <[email protected]>
Date:   Sat Aug 1 10:02:19 2015 -0600

    Revert "XXX Temp Configure to point to bash"
    
    This reverts commit

M       Configure

commit 945085b3641e5cfe556fd68423b0b8cc98f3f730
Author: Karl Williamson <[email protected]>
Date:   Wed Dec 17 13:28:56 2014 -0700

    XXX Temp Configure to point to bash

M       Configure
-----------------------------------------------------------------------

--
Perl5 Master Repository

Reply via email to