Branch: refs/heads/blead
  Home:   https://github.com/Perl/perl5
  Commit: 2e778df261ad19a99df22ae42d2aab87bcfda96d
      
https://github.com/Perl/perl5/commit/2e778df261ad19a99df22ae42d2aab87bcfda96d
  Author: Karl Williamson <k...@cpan.org>
  Date:   2025-04-17 (Thu, 17 Apr 2025)

  Changed paths:
    M charclass_invlists.inc
    M lib/unicore/mktables
    M lib/unicore/uni_keywords.pl
    M regcharclass.h
    M regexp_constants.h
    M uni_keywords.h

  Log Message:
  -----------
  mktables: White-space, comment only

Add comments, and rewrap comment lines to fit 80 columns


  Commit: 0fe4301063b58e4ac24dc721442fe600af9d2f2c
      
https://github.com/Perl/perl5/commit/0fe4301063b58e4ac24dc721442fe600af9d2f2c
  Author: Karl Williamson <k...@cpan.org>
  Date:   2025-04-17 (Thu, 17 Apr 2025)

  Changed paths:
    M charclass_invlists.inc
    M lib/unicore/mktables
    M lib/unicore/uni_keywords.pl
    M regcharclass.h
    M regexp_constants.h
    M uni_keywords.h

  Log Message:
  -----------
  mktables: Handle new property NFKC_Simple_Casefold

Unicode 15.1 introduces this new property, which needs the same special
handling as plain NFKC_Casefold does.


  Commit: 7bff8910b72865c593c0694159654c9d73f710b7
      
https://github.com/Perl/perl5/commit/7bff8910b72865c593c0694159654c9d73f710b7
  Author: Karl Williamson <k...@cpan.org>
  Date:   2025-04-17 (Thu, 17 Apr 2025)

  Changed paths:
    M charclass_invlists.inc
    M lib/unicore/mktables
    M lib/unicore/uni_keywords.pl
    M regcharclass.h
    M regexp_constants.h
    M uni_keywords.h

  Log Message:
  -----------
  mktables: Ignore missings entries in two files

These files are changed in 15.1 to have @missings lines, whereas they
didn't before.  This leads to some warnings messages, so turn off
looking at them, as we do for a number of other files.


  Commit: 15be213b1d4b3f7a5a253b0aa1bb6e79c342371d
      
https://github.com/Perl/perl5/commit/15be213b1d4b3f7a5a253b0aa1bb6e79c342371d
  Author: Karl Williamson <k...@cpan.org>
  Date:   2025-04-17 (Thu, 17 Apr 2025)

  Changed paths:
    M charclass_invlists.inc
    M lib/unicore/mktables
    M lib/unicore/uni_keywords.pl
    M regcharclass.h
    M regexp_constants.h
    M uni_keywords.h

  Log Message:
  -----------
  mktables: Handle Unicode 16.0 DoNotEmit.txt

We handle it by ignoring this file, new to Unicode 16.0.

It consists of lists of characters that, to put it less delicately than
Unicode would like, they regret creating.

But there are no rules associated with them.  It would be nice to have a
\p{DoNotEmit} property so that applications could handle situations
where this occurs.  But I'm fearful that if we did something like this,
that Unicode would later come up with something that had the same
intention but would be subtly or unsubtly different.

That has happened before, to our detriment.

So I think we should wait to see what they do do, in future releases.


  Commit: 8d9de0d5ea1f54d306dc6b3a9d8b9d196327dd4c
      
https://github.com/Perl/perl5/commit/8d9de0d5ea1f54d306dc6b3a9d8b9d196327dd4c
  Author: Karl Williamson <k...@cpan.org>
  Date:   2025-04-17 (Thu, 17 Apr 2025)

  Changed paths:
    M charclass_invlists.inc
    M lib/unicore/mktables
    M lib/unicore/uni_keywords.pl
    M regcharclass.h
    M regexp_constants.h
    M uni_keywords.h

  Log Message:
  -----------
  mktables: Handle Unicode 16.0 Unikemet.txt file

This includes several new properties, some of which are considered
"provisional" by Unicode, which means they can be heavily revised or
withdrawn.

These properties are designed for use by scholars of hieroglyphics.


  Commit: 23e7edd7852f4ce0bbcd65b5baa5fc326b90bf09
      
https://github.com/Perl/perl5/commit/23e7edd7852f4ce0bbcd65b5baa5fc326b90bf09
  Author: Karl Williamson <k...@cpan.org>
  Date:   2025-04-17 (Thu, 17 Apr 2025)

  Changed paths:
    M charclass_invlists.inc
    M lib/unicore/mktables
    M lib/unicore/uni_keywords.pl
    M regcharclass.h
    M regexp_constants.h
    M uni_keywords.h

  Log Message:
  -----------
  mktables: Support new Unicode 16.0 properties ID_Compat_Math_foo

These new properties are automatically handled, but there is a problem.
They have no short form names.  Files are written for them based on
their names, and those files are not distinguishable on a DOS 8.3 file
system.  The solution here is to manually override the automatically
generated file names with distinguishable ones.


  Commit: 6b753e0de9baf64e3c65d7e0e55a792e281ca25c
      
https://github.com/Perl/perl5/commit/6b753e0de9baf64e3c65d7e0e55a792e281ca25c
  Author: Karl Williamson <k...@cpan.org>
  Date:   2025-04-17 (Thu, 17 Apr 2025)

  Changed paths:
    M charclass_invlists.inc
    M lib/unicore/mktables
    M lib/unicore/uni_keywords.pl
    M regcharclass.h
    M regexp_constants.h
    M uni_keywords.h

  Log Message:
  -----------
  mktables: Handle Unicode 16.0 new \d ranges

mktables does a lot of sanity checks on the data it gets fed.  One of
those is to make sure any \d group of code points is 10 long.  This
verifies that Unicode has given us enough code points to form 0-9.  It
assumes that if it got this much right, that their numeric values are
also 0-9.  This check has uncovered issues with the Unicode Standard in
the past.

Nowadays, they've cleaned up their act, and it's been many releases
since there has been problems.  But our checks remain, and I think they
should.

What happens in Unicode 16.0 was there was a range of \d characters that
contain two consecutive groups of 0-9 values.  The check could be
changed to verify that the count is divisible by 10, but checking for
this particular range is a bit safer.


  Commit: 407191920f62631584e33c5274db8dc4e867f826
      
https://github.com/Perl/perl5/commit/407191920f62631584e33c5274db8dc4e867f826
  Author: Karl Williamson <k...@cpan.org>
  Date:   2025-04-17 (Thu, 17 Apr 2025)

  Changed paths:
    M charclass_invlists.inc
    M lib/unicore/mktables
    M lib/unicore/uni_keywords.pl
    M regcharclass.h
    M regexp_constants.h
    M uni_keywords.h

  Log Message:
  -----------
  mktables: Add count() method to Range class

There is already this method for lists of Ranges, so this is is just so
callers don't need to know which they are operating on.


Compare: https://github.com/Perl/perl5/compare/326b81edc663...407191920f62

To unsubscribe from these emails, change your notification settings at 
https://github.com/Perl/perl5/settings/notifications

Reply via email to