In perl.git, the branch khw/ebcdic has been created
<http://perl5.git.perl.org/perl.git/commitdiff/cd09f6fc5db18661a62404c610e7230774cc8ff3?hp=0000000000000000000000000000000000000000>
at cd09f6fc5db18661a62404c610e7230774cc8ff3 (commit)
- Log -----------------------------------------------------------------
commit cd09f6fc5db18661a62404c610e7230774cc8ff3
Author: Karl Williamson <[email protected]>
Date: Tue Oct 6 10:05:56 2015 -0600
XXX IV also need pod changes iv
M ext/XS-APItest/t/utf8.t
M lib/utf8.t
M pod/perldiag.pod
M pod/perlunicode.pod
M t/lib/warnings/utf8
M t/op/bop.t
M t/op/chop.t
M t/op/index.t
M t/op/tr.t
M t/op/ver.t
M t/opbasic/qq.t
M t/re/pat_advanced.t
M utf8.c
commit 71c96bcc676716a8ce28617e215c73c66f340ba1
Author: Karl Williamson <[email protected]>
Date: Wed Oct 28 16:15:59 2015 -0600
XXX Extend UTF-EBCDIC to handle up to 2**32-1 later flesh out
This uses the same mechanism that Perl does to extend UTF-8 beyond what
might be its natural maximum. That is, when the UTF-8 start byte is
0xFF, it adds 6 more bytes to the character. For this purpose, only 1
byte was needed, but using 6 makes the handling between the two
character sets uniform, so some macros can now be shared instead of
having different values, and I thought there might be some subtetly that
I am unaware of, but which tripped up earlier implementors. I did
notice that if I added fewer bytes, there was a lot more possibility of
overlongs. XXX Example from utf8.out
M ebcdic_tables.h
M regen/ebcdic.pl
M utf8.c
M utf8.h
M utfebcdic.h
commit d907c9aae2dc3027b04a68b56295e0387ef18ccd
Author: Karl Williamson <[email protected]>
Date: Tue Oct 27 19:07:51 2015 -0600
XXX re/pat_advanced.t
M t/re/pat_advanced.t
commit 0e96382d610d77b3a6b6ceee868ed2b5cb7816bf
Author: Karl Williamson <[email protected]>
Date: Tue Oct 27 17:56:59 2015 -0600
Unicode::Collate: Generalize to work on EBCDIC
The data that this module gets is directly from unicode.org, and hence
is in Unicode order. But the input data is from whatever platform it's
running on, which may be in EBCDIC order. We just need to convert to
Unicode on input.
M cpan/Unicode-Collate/Collate.pm
commit 868cd526c13a5a735a2bf7c803d8f4126410f19f
Author: Karl Williamson <[email protected]>
Date: Mon Oct 26 12:02:42 2015 -0600
Deprecate 3 utf8 functions
These functions will generally lead to the wrong answers when used on
EBCDIC platforms in modern perls. The docs now give guidance on what to
use instead.
M embed.fnc
M proto.h
M utf8.c
commit 73283f93cfd164fc37494c75b993c5eac2a34107
Author: Karl Williamson <[email protected]>
Date: Sat Oct 24 20:08:47 2015 -0600
re/pat_advanced.t: Fix a test so also runs on EBCDIC
M t/re/pat_advanced.t
commit b71630e836001f017ec09a309e63ab0d383fe4c9
Author: Karl Williamson <[email protected]>
Date: Sat Oct 24 20:06:58 2015 -0600
t/TEST: Test Encode
M t/TEST
commit bc3759c8628957635f087a3cde013919f33a6e62
Author: Karl Williamson <[email protected]>
Date: Sat Oct 24 20:04:17 2015 -0600
Devel::Peek: document that uses STDERR
M ext/Devel-Peek/Peek.pm
commit 734a06cbb9d55a1f54691227bab738bb27003b3a
Author: Karl Williamson <[email protected]>
Date: Sat Oct 24 20:03:31 2015 -0600
Dumpvalue: Generalize for EBCDIC
I overlooked this module until now. It turns out that much of the code
I had changed had a common ancestor with the code I had already changed
to work on EBCDIC in lib/dumpvar.pl. So I just copied that changing the
things that needed to be different. It appears that Dumpvalue had a bug
in that it did not escape NUL, of all the C0 controls. I changed it to
do so.
M dist/Dumpvalue/lib/Dumpvalue.pm
M dist/Dumpvalue/t/Dumpvalue.t
commit 8c539b30a54e6845b0e66bdde4f4062938bf8f56
Author: Karl Williamson <[email protected]>
Date: Sat Oct 24 20:02:56 2015 -0600
XXX experiment cpan/Pod-Simple/t/fcodes_s.t
M cpan/Pod-Simple/t/fcodes_s.t
commit c8eba95f5fd9c93632e6be2f991e44f989a9ebb3
Author: Karl Williamson <[email protected]>
Date: Sat Oct 24 20:01:36 2015 -0600
XXX cpan decode.t: Generalize to run on EBCDIC
Just use another code point. The previous code point was a UTF-8
invariant on EBCDIC systems, so did not trigger what was being tested.
M cpan/Encode/t/decode.t
commit 7446f819f682151512376e328a6882145f8961a8
Author: Karl Williamson <[email protected]>
Date: Sat Oct 24 20:00:40 2015 -0600
XXX cpan cow.t: Fix so also runs on EBCDIC 1047
M cpan/Encode/t/cow.t
commit 39a5c44541d9943feb2f84d6846f4e74a3bcaf0e
Author: Karl Williamson <[email protected]>
Date: Tue Oct 6 10:49:18 2015 -0600
XXX lib/open.t: todo test until encode is fixed
M lib/open.t
commit 5fea4231d92aee5e7403ab16407e3e53ed0d1291
Author: Karl Williamson <[email protected]>
Date: Mon Oct 5 17:08:59 2015 -0600
XXX See if lib/open.t now works
M lib/open.t
commit 43da944eba136ca45884e758e19726994d4a1787
Author: Karl Williamson <[email protected]>
Date: Thu Oct 8 12:54:54 2015 -0600
MANIFEST, etc for prev. commit
M Porting/Maintainers.pl
commit b9a09b32ca77e7ac04296cd3333c255d0f0d77fb
Author: Karl Williamson <[email protected]>
Date: Thu Oct 1 11:19:22 2015 -0600
XXX cpan MIME::Base64::unicode.t: Generalize to work on EBCDIC
M cpan/MIME-Base64/t/unicode.t
commit e35c825b6b4f6ce4dd45f1eaec7dc684986c7652
Author: Karl Williamson <[email protected]>
Date: Tue Oct 6 16:53:53 2015 -0600
XXX cpan quoted-print.t: Generalize for non-ASCII
This had hard-coded in one EBCDIC code page, and would not work on
others. The ASCII and EBCDIC code was disjoint, and over the years,
changes had been made to the ASCII version, but not the EBCDIC.
This commit joins the disparate paths so that they will stay in sync,
and modernizes to use the is() test function, which it defines itself
rather than adding dependencies.
Because the diff listing is so disjointed, I manually checked that the
final result included the tests from the ASCII version, which had more
tests than the EBCDIC, and some revisions.
The one change is that I changed "\x20" occurrences to " ". I know
enough about perl parsing to know that the \x version will be
transformed into the literal blank before the module sees it, so that
keeping the \x version doesn't actually test anything.
M cpan/MIME-Base64/t/quoted-print.t
commit 99797af9e82ca95abf1ad8581c3ee6c701c639f9
Author: Karl Williamson <[email protected]>
Date: Tue Oct 6 14:32:54 2015 -0600
XXX cpan quoted-print.t: Add in an is() testing function
This function will allow this .t to be modernized to use more current
testing infrastructure. I added this separately from the changes in the
next commit, as my diff tool got confused and showed lots of change
fragments. This will make the diff in the following commit cleaner,
but it still is not very good.
M cpan/MIME-Base64/t/quoted-print.t
commit 44ce353e6f242a330516c89ecab7cdfe1ca6299b
Author: Karl Williamson <[email protected]>
Date: Tue Oct 6 14:20:33 2015 -0600
XXX cpan quoted-print.t: Test that interior \t is output as-is
Tabs aren't encoded except if they are at the end of line. But there
was no test for non-end of line tabs.
M cpan/MIME-Base64/t/quoted-print.t
commit 0ec276fb09d93f17ac3190043b3c381208e3269a
Author: Karl Williamson <[email protected]>
Date: Thu Oct 1 11:16:44 2015 -0600
XXX Debug cpan/MIME-Base64/t/base64.t
M cpan/MIME-Base64/t/base64.t
commit e7e21e00d4c821443dfe5889091330555ff42627
Author: Karl Williamson <[email protected]>
Date: Thu Oct 1 12:07:33 2015 -0600
Run official Unicode normalization tests if available
This will skip the test if the optional generated file is not built.
It is built by changing the Makefile options to run mktables.
M MANIFEST
A t/uni/normalization.t
commit 63b3d5a8de03c2cd22ad3f08fc7b0d9cf62d06c9
Author: Karl Williamson <[email protected]>
Date: Sat Oct 3 10:23:46 2015 -0600
Revert "Generate official normtest file"
M MANIFEST
M Makefile.SH
commit 0d4ab56906b2efa0b99daf10d4dbedd58fbf023c
Author: Karl Williamson <[email protected]>
Date: Mon Sep 21 13:31:45 2015 -0600
Generate official Unicode normalization test file
M MANIFEST
M Makefile.SH
M charclass_invlists.h
M lib/unicore/mktables
M regcharclass.h
A unicore/mktables
commit 4460ccdf42892b5d6cd1e2e6233283dc9cba031e
Author: Karl Williamson <[email protected]>
Date: Fri Sep 25 12:20:40 2015 -0600
XXX debug lib/open.t
M lib/open.t
commit 3876055b4c12a60a12344263aa93bf28d0866e95
Author: Karl Williamson <[email protected]>
Date: Sat Oct 3 08:46:28 2015 -0600
XXX cpan 07locale.t: Skip if locales not fully available
Some systems will have partial locale handling available, but not enough
to actually use; this commit improves the detection of them. Also, the
perl can be configured to not use locales, even if available, and this
commit now honors that.
M cpan/version/t/07locale.t
commit ab0e4b68be63a81b8930ed8be174b865ae9d30f9
Author: Karl Williamson <[email protected]>
Date: Mon Feb 25 17:04:24 2013 -0700
XXX CPAN and unsure Encode
M cpan/Encode/Encode.xs
M cpan/Encode/Unicode/Unicode.xs
commit 5aff58c8548a4fea669500e8e17b762b94e8ff88
Author: Karl Williamson <[email protected]>
Date: Tue Feb 3 14:04:48 2015 -0700
XXX Encode
M cpan/Encode/t/cow.t
M cpan/Encode/t/decode.t
commit ef4bb1c33e236894e910b01a9c598d9fdb5afa57
Author: Karl Williamson <[email protected]>
Date: Sun Dec 21 23:00:26 2014 -0700
XXX cpan: cpan/MIME-Base64/Base64.xs: Generalize for EBCDIC
M cpan/MIME-Base64/Base64.xs
commit 784a1c9ec413609789733d4c8e5d7eb90bd54a40
Author: Karl Williamson <[email protected]>
Date: Mon Apr 21 10:09:57 2014 -0600
cpan/ExtUtils-Install/t/InstallWithMM.t: White-space only
Indent for new block formed by previous commit
M cpan/ExtUtils-Install/t/InstallWithMM.t
commit d20bbfd08ba8302e1cacc59a5174ec558c258b68
Author: Karl Williamson <[email protected]>
Date: Tue Apr 16 09:03:47 2013 -0600
cpan/ExtUtils-Install/t/InstallWithMM.t: Skip if EBCDIC
Because it uses JSON
M cpan/ExtUtils-Install/t/InstallWithMM.t
commit ac294af851687d43357e697b31f360d370d58a30
Author: Karl Williamson <[email protected]>
Date: Mon Sep 7 22:39:09 2015 -0600
XXX remove all but CP 1047 EBCDIC tables
This makes things smaller and faster to regenerate during the debugging
stage.
M charclass_invlists.h
M ebcdic_tables.h
M l1_char_class_tab.h
M regcharclass.h
M regen/charset_translations.pl
M unicode_constants.h
commit 264b286a9c84d659dc8a7e3ea69a19f1a9656789
Author: Karl Williamson <[email protected]>
Date: Fri Sep 25 12:22:13 2015 -0600
t/TEST
M t/TEST
commit fbf5717315019674758623006a3205d40c2104d3
Author: Karl Williamson <[email protected]>
Date: Sun Aug 2 22:18:10 2015 -0600
XXX Test Unicode::Collate and Unicode::Normalize
M t/TEST
commit 840d0b43b20dd1b58e018163b6c5d8c09edcecc3
Author: Karl Williamson <[email protected]>
Date: Tue Mar 26 15:44:59 2013 -0600
t/TEST: Stop after 10 errors
M t/TEST
commit 2c5fe4caf890e6e770372058b85be601cd3eec9e
Author: Karl Williamson <[email protected]>
Date: Fri May 8 21:12:55 2015 -0600
XXX cpan vutil.c Nits, C<> L<>
M vutil.c
commit e74b750bc9ac4f644258dfbb826984a3083fc786
Author: Karl Williamson <[email protected]>
Date: Wed Sep 2 18:21:39 2015 -0600
XXX temp ebcdic tables in hex
M ebcdic_tables.h
M regen/ebcdic.pl
commit 4fa2d97a077d1f25b98e70034b0cdd669c72a647
Author: Karl Williamson <[email protected]>
Date: Sat Oct 3 09:34:54 2015 -0600
Import cpan's version/t/07locale.t
M cpan/version/t/07locale.t
-----------------------------------------------------------------------
--
Perl5 Master Repository