unicode combinig mark/ std.uni question

ikod via Digitalmars-d Tue, 05 Dec 2017 12:06:01 -0800

Hello,

I have to create very basic IDNA (Internationalized Domain Namesin Applications) library. There are two parts in IDNA - userinput checks and punycode encoding/decoding.

Punycode part already completed, and now I have to add somechecks but I'm weak in unicode and cant find proper way toexpress these tests using std.uni.

Here are list of prohibited domain labels(https://tools.ietf.org/html/rfc5891):

o Labels whose first character is a combining mark (see TheUnicode

      Standard, Section 2.11 [Unicode]).

o Labels containing prohibited code points, i.e., those thatare

      assigned to the "DISALLOWED" category of the Tables document
      [RFC5892].

o Labels containing code points that are identified in theTablesdocument as "CONTEXTJ", i.e., requiring exceptionalcontextualrule processing on lookup, but that do not conform to thoserules.Note that this implies that a rule must be defined, notnull: acharacter that requires a contextual rule but for which theruleis null is treated in this step as having failed to conformto the

      rule.

o Labels containing code points that are identified in theTablesdocument as "CONTEXTO", but for which no such rule appearsin thetable of rules. Applications resolving DNS names orcarrying outequivalent operations are not required to test contextualrulesfor "CONTEXTO" characters, only to verify that a rule isdefined(although they MAY make such tests to provide betterprotection or

      give better information to the user).

o Labels containing code points that are unassigned in theversionof Unicode being used by the application, i.e., in theUNASSIGNED

      category of the Tables document.

Can anybody help with this task?

Thanks!

unicode combinig mark/ std.uni question

Reply via email to