On Saturday, 20 June 2020 at 01:41:50 UTC, Paul Backus wrote:
It sounds like maybe what you are looking for is Unicode character categories:https://en.wikipedia.org/wiki/Unicode_character_property#General_Category
The character validation step could indeed be expressed using Unicode properties:
Allow Unicode White_Space Reject Unicode Control Allow everything else