Hi Andrew,

ironically, the patch did not apply for me because you sent it with

  Content-Type: text/plain; charset=iso-8859-1

so when i saved it to disk, it remained in ISO-Latin-1.
In the license comment you are changing, there is a U+00A9 COPYRIGHT
SIGN encoded as UTF-8.  I suggest you change that to "(c)" in both
files before commit to avoid similar issues in the future.


Andrew Fresh wrote on Sat, Feb 15, 2020 at 11:26:39AM -0800:

> This is two patches, the first updates to the latest Unicode licence

I confirm that the license change is correct.
Nice to see people actually making a license better,
in this case by dropping an awkward modification clause.

> (and adds a comment with the version of unicode this file contains)

That makes sense to me.

> https://www.unicode.org/license.html
> 
> The second is the output of running the script with perl 5.30.1, which I
> usually remember to do earlier.

I compared to the output of the script generated on my machine, and
i looked through the changes and found nothing suspicious.  There
is one new rare latin letter (LATIN CAPITAL LETTER S WITH HOOK),
and some other latin letters that seem even more unusual.  The rest
are mostly new characters and new character properties for non-latin
scripts, and of course lots of new symbols and pictographs, including
a whole block of chess symbols.  There are also some changes that
may plausibly be bugfixes, like an Ethiopic special mark no longer
being iswalpha(3) and one Lepcha letter going in just the opposite
direction.

The most notable changes may include:

 * Surrogates are no longer considered control characters.
   I do not expect fallout from that change because they should
   not appear in UTF-8 text in the first place.
 * There are whole new blocks of Georgian, Tamil, Rohingya, Dogra,
   Gondi, and of some historical scripts.
 
> Comments, OK?

A very minor detail: it appears you only applied parts of the license
change patch before running the script, i got the difference shown
below relative to what you sent.

OK schwarze@ either way, please go ahead!
  Ingo


--- en_US.UTF-8.src.bentley     Sun Feb 16 23:39:19 2020
+++ en_US.UTF-8.src     Mon Feb 17 07:47:55 2020
@@ -1,11 +1,11 @@
-/*     $OpenBSD: en_US.UTF-8.src,v 1.9 2019/02/22 16:35:16 afresh1 Exp $       
*/
+/*     $OpenBSD$       */
 
 /*
  * COPYRIGHT AND PERMISSION NOTICE
  *
- * Copyright © 1991-2015 Unicode, Inc. All rights reserved.
+ * Copyright (c) 1991-2019 Unicode, Inc. All rights reserved.
  * Distributed under the Terms of Use in
- * http://www.unicode.org/copyright.html.
+ * https://www.unicode.org/copyright.html.
  *
  * Permission is hereby granted, free of charge, to any person obtaining
  * a copy of the Unicode data files and any associated documentation
@@ -41,7 +41,6 @@
 VARIABLE        CODESET=UTF-8
 
 /* Unicode Version 12.1.0 */
-
 
 /*
  * U+0000 - U+007F : Basic Latin

Reply via email to