Re: domain names and internationalization?

2022-09-21 Thread Rick Macklem
Christoph Moench-Tegeder  wrote:
>## Rick Macklem (rmack...@uoguelph.ca):
>
>> I am hoping someone knows what DNS does in this area (the
>> working group list uses terms like umlaut, which I have never
>> even heard of;-).
>
>The dry start on that topic is RFC5890
>https://datatracker.ietf.org/doc/html/rfc5890
>The Wikipedia overview looks really decent:
>https://en.wikipedia.org/wiki/Internationalized_domain_name
>What else?
Thanks. Both the RFC and wikipedia article were useful.

It turns out that the messy part for NFSv4 is that the RFCs
specified that the labels were in Unicode (U-labels) and
not A-labels.
--> RFC 7530 also wants the code to translate an A-label
   to a U-label and then compare U-labels. (Not sure why
   that is preferable to a case independent comparison of
   the A-labels, but maybe the intent was that an A-label
   would compare the same as a U-label in a domain name?)

The FreeBSD man page for hostname(1) and gethostname(3)
don't seem to limit the labels in the name to A-label format.

Does anyone use non-ascii (ie. a U-label with multibyte characters
in it) in a machine's hostname?

Right now the coding of nfsuserd(8) does not conform to the RFCs,
but should work for domain names where all the labels are
A-labels (either LDN or "xn--" followed by a Punycode encoded
unicode string).

rick

Gruß <-- see? eszett :)
Christoph

--
Spare Space



Re: domain names and internationalization?

2022-09-20 Thread Stefan Esser

Am 19.09.22 um 22:27 schrieb Rick Macklem:

Hi,

Recently there has been discussion on the NFSv4 IETF working
group email list w.r.t. internationalization for the domain name
it uses for users/groups.


Hi Rick,

I do assume that you know about RFC 3492 (Punycode):

https://datatracker.ietf.org/doc/html/rfc3492


Right now, I am pretty sure the FreeBSD nfsuserd(8) only works
for ascii domain names, but...


You can manually translate domain names into their Punycode
representation. The NFS code could work with them and only
translate them back to UTF-8 (or whatever) for display purposes.

For pure ASCII this is an identity transformation, for names
that actually represent UTF-8 strings, the value to send to
DNS servers (and to locally store in the daemon) could be the
internally stored Punycode representation.


I am hoping someone knows what DNS does in this area (the
working group list uses terms like umlaut, which I have never
even heard of;-).


That's the contraction of "ae", "oe", "ue" that has long ago
been introduced into the German writing system, with the "e"
abbreviated to two dots above the vocal, e.g. "ae" --> "ä".
Just a convenience rule to speed up manually copying the bible
in monasteries in medieval times ;-)

But there are many other accented letters in other languages,
that can be used in internationalized domain names, and the
whole set of Unicode characters can be represented using
Punycode.


I know essentially nothing about internationalization, so any hints
will be appreciated.


For a start:

https://en.wikipedia.org/wiki/Internationalized_domain_name
https://en.wikipedia.org/wiki/Punycode

There are C implementations of the transformations, e.g. in the
dns/libidn2 port.

We do not seem to have equivalent library functions in the
FreeBSD base system yet, but probably should provide them.

Best regards, STefan



Re: domain names and internationalization?

2022-09-19 Thread Alexander Leidinger
Quoting Rick Macklem  (from Mon, 19 Sep 2022  
20:27:29 +):



Hi,

Recently there has been discussion on the NFSv4 IETF working
group email list w.r.t. internationalization for the domain name
it uses for users/groups.

Right now, I am pretty sure the FreeBSD nfsuserd(8) only works
for ascii domain names, but...

I am hoping someone knows what DNS does in this area (the
working group list uses terms like umlaut, which I have never
even heard of;-).


DNS does this:
https://en.wikipedia.org/wiki/Punycode
This page also shows some umlauts (German ones to be precise, e.g.  
"Bücher") and other things like chinese and other characters.


There are libs which do the conversation, e.g.  
https://www.gnu.org/software/libidn/doxygen/index.html

I don't know if there are libs with more preferred licenses.

Bye,
Alexander.

--
http://www.Leidinger.net alexan...@leidinger.net: PGP 0x8F31830F9F2772BF
http://www.FreeBSD.orgnetch...@freebsd.org  : PGP 0x8F31830F9F2772BF


pgpb4gSj3Lh6y.pgp
Description: Digitale PGP-Signatur


Re: domain names and internationalization?

2022-09-19 Thread Mehmet Erol Sanliturk
On Mon, Sep 19, 2022 at 11:27 PM Rick Macklem  wrote:

> Hi,
>
> Recently there has been discussion on the NFSv4 IETF working
> group email list w.r.t. internationalization for the domain name
> it uses for users/groups.
>
> Right now, I am pretty sure the FreeBSD nfsuserd(8) only works
> for ascii domain names, but...
>
> I am hoping someone knows what DNS does in this area (the
> working group list uses terms like umlaut, which I have never
> even heard of;-).
>
> I know essentially nothing about internationalization, so any hints
> will be appreciated.
>
> Thanks, rick
>
>




https://en.wikipedia.org/wiki/Internationalization_and_localization
Internationalization and localization

https://www.google.com/search?q=internationalization+%28i18n%29=ALiCzsbsXi8Z_tScj_8BZPTxNfpQvwPEYw%3A1663641967630=bykpY8iJJpeBxc8PjpCF8AY=internationalization_lcp=Cgdnd3Mtd2l6EAEYADIKCAAQRxDWBBCwAzIKCAAQRxDWBBCwAzIKCAAQRxDWBBCwAzIKCAAQRxDWBBCwAzIKCAAQRxDWBBCwAzIKCAAQRxDWBBCwAzIKCAAQRxDWBBCwAzIKCAAQRxDWBBCwAzIHCAAQsAMQQzIHCAAQsAMQQ0oECEEYAEoECEYYAFAAWABgr11oAXABeACAAQCIAQCSAQCYAQDIAQrAAQE=gws-wiz

internationalization (i18n)



With my best wishes  for all .


Mehmet Erol Sanliturk


domain names and internationalization?

2022-09-19 Thread Rick Macklem
Hi,

Recently there has been discussion on the NFSv4 IETF working
group email list w.r.t. internationalization for the domain name
it uses for users/groups.

Right now, I am pretty sure the FreeBSD nfsuserd(8) only works
for ascii domain names, but...

I am hoping someone knows what DNS does in this area (the
working group list uses terms like umlaut, which I have never
even heard of;-).

I know essentially nothing about internationalization, so any hints
will be appreciated.

Thanks, rick