Branch: refs/heads/blead Home: https://github.com/Perl/perl5 Commit: 2e778df261ad19a99df22ae42d2aab87bcfda96d https://github.com/Perl/perl5/commit/2e778df261ad19a99df22ae42d2aab87bcfda96d Author: Karl Williamson <k...@cpan.org> Date: 2025-04-17 (Thu, 17 Apr 2025)
Changed paths: M charclass_invlists.inc M lib/unicore/mktables M lib/unicore/uni_keywords.pl M regcharclass.h M regexp_constants.h M uni_keywords.h Log Message: ----------- mktables: White-space, comment only Add comments, and rewrap comment lines to fit 80 columns Commit: 0fe4301063b58e4ac24dc721442fe600af9d2f2c https://github.com/Perl/perl5/commit/0fe4301063b58e4ac24dc721442fe600af9d2f2c Author: Karl Williamson <k...@cpan.org> Date: 2025-04-17 (Thu, 17 Apr 2025) Changed paths: M charclass_invlists.inc M lib/unicore/mktables M lib/unicore/uni_keywords.pl M regcharclass.h M regexp_constants.h M uni_keywords.h Log Message: ----------- mktables: Handle new property NFKC_Simple_Casefold Unicode 15.1 introduces this new property, which needs the same special handling as plain NFKC_Casefold does. Commit: 7bff8910b72865c593c0694159654c9d73f710b7 https://github.com/Perl/perl5/commit/7bff8910b72865c593c0694159654c9d73f710b7 Author: Karl Williamson <k...@cpan.org> Date: 2025-04-17 (Thu, 17 Apr 2025) Changed paths: M charclass_invlists.inc M lib/unicore/mktables M lib/unicore/uni_keywords.pl M regcharclass.h M regexp_constants.h M uni_keywords.h Log Message: ----------- mktables: Ignore missings entries in two files These files are changed in 15.1 to have @missings lines, whereas they didn't before. This leads to some warnings messages, so turn off looking at them, as we do for a number of other files. Commit: 15be213b1d4b3f7a5a253b0aa1bb6e79c342371d https://github.com/Perl/perl5/commit/15be213b1d4b3f7a5a253b0aa1bb6e79c342371d Author: Karl Williamson <k...@cpan.org> Date: 2025-04-17 (Thu, 17 Apr 2025) Changed paths: M charclass_invlists.inc M lib/unicore/mktables M lib/unicore/uni_keywords.pl M regcharclass.h M regexp_constants.h M uni_keywords.h Log Message: ----------- mktables: Handle Unicode 16.0 DoNotEmit.txt We handle it by ignoring this file, new to Unicode 16.0. It consists of lists of characters that, to put it less delicately than Unicode would like, they regret creating. But there are no rules associated with them. It would be nice to have a \p{DoNotEmit} property so that applications could handle situations where this occurs. But I'm fearful that if we did something like this, that Unicode would later come up with something that had the same intention but would be subtly or unsubtly different. That has happened before, to our detriment. So I think we should wait to see what they do do, in future releases. Commit: 8d9de0d5ea1f54d306dc6b3a9d8b9d196327dd4c https://github.com/Perl/perl5/commit/8d9de0d5ea1f54d306dc6b3a9d8b9d196327dd4c Author: Karl Williamson <k...@cpan.org> Date: 2025-04-17 (Thu, 17 Apr 2025) Changed paths: M charclass_invlists.inc M lib/unicore/mktables M lib/unicore/uni_keywords.pl M regcharclass.h M regexp_constants.h M uni_keywords.h Log Message: ----------- mktables: Handle Unicode 16.0 Unikemet.txt file This includes several new properties, some of which are considered "provisional" by Unicode, which means they can be heavily revised or withdrawn. These properties are designed for use by scholars of hieroglyphics. Commit: 23e7edd7852f4ce0bbcd65b5baa5fc326b90bf09 https://github.com/Perl/perl5/commit/23e7edd7852f4ce0bbcd65b5baa5fc326b90bf09 Author: Karl Williamson <k...@cpan.org> Date: 2025-04-17 (Thu, 17 Apr 2025) Changed paths: M charclass_invlists.inc M lib/unicore/mktables M lib/unicore/uni_keywords.pl M regcharclass.h M regexp_constants.h M uni_keywords.h Log Message: ----------- mktables: Support new Unicode 16.0 properties ID_Compat_Math_foo These new properties are automatically handled, but there is a problem. They have no short form names. Files are written for them based on their names, and those files are not distinguishable on a DOS 8.3 file system. The solution here is to manually override the automatically generated file names with distinguishable ones. Commit: 6b753e0de9baf64e3c65d7e0e55a792e281ca25c https://github.com/Perl/perl5/commit/6b753e0de9baf64e3c65d7e0e55a792e281ca25c Author: Karl Williamson <k...@cpan.org> Date: 2025-04-17 (Thu, 17 Apr 2025) Changed paths: M charclass_invlists.inc M lib/unicore/mktables M lib/unicore/uni_keywords.pl M regcharclass.h M regexp_constants.h M uni_keywords.h Log Message: ----------- mktables: Handle Unicode 16.0 new \d ranges mktables does a lot of sanity checks on the data it gets fed. One of those is to make sure any \d group of code points is 10 long. This verifies that Unicode has given us enough code points to form 0-9. It assumes that if it got this much right, that their numeric values are also 0-9. This check has uncovered issues with the Unicode Standard in the past. Nowadays, they've cleaned up their act, and it's been many releases since there has been problems. But our checks remain, and I think they should. What happens in Unicode 16.0 was there was a range of \d characters that contain two consecutive groups of 0-9 values. The check could be changed to verify that the count is divisible by 10, but checking for this particular range is a bit safer. Commit: 407191920f62631584e33c5274db8dc4e867f826 https://github.com/Perl/perl5/commit/407191920f62631584e33c5274db8dc4e867f826 Author: Karl Williamson <k...@cpan.org> Date: 2025-04-17 (Thu, 17 Apr 2025) Changed paths: M charclass_invlists.inc M lib/unicore/mktables M lib/unicore/uni_keywords.pl M regcharclass.h M regexp_constants.h M uni_keywords.h Log Message: ----------- mktables: Add count() method to Range class There is already this method for lists of Ranges, so this is is just so callers don't need to know which they are operating on. Compare: https://github.com/Perl/perl5/compare/326b81edc663...407191920f62 To unsubscribe from these emails, change your notification settings at https://github.com/Perl/perl5/settings/notifications