likyh commented on issue #3564: URL: https://github.com/apache/incubator-devlake/issues/3564#issuecomment-1294389877
Unicode | UTF-8 ----------------------+--------------------------------------------- 0000 0000-0000 007F | 0xxxxxxx 0000 0080-0000 07FF | 110xxxxx 10xxxxxx 0000 0800-0000 FFFF | 1110xxxx 10xxxxxx 10xxxxxx 0001 0000-0010 FFFF | 11110xxx 10xxxxxx 10xxxxxx 10xxxxxx All multi-byte char in utf8 are start with byte `1`. So all ASCII char will not exist in other char. https://www.ietf.org/rfc/rfc3629.txt -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: [email protected] For queries about this service, please contact Infrastructure at: [email protected]
