I'd like to fix more of our unicode mess while we're at it.
For example Mn (non-spacing combining marks) should allowed in
varid_{cont}, so it won't be like this:
h> let é=() in é
()
h> let x́=() in x́
:6:6: lexical error at character '\769'
(that’s because x́ is denormalized and is ac
Back in 2008 or so, GHC changed the behavior of unicode characters in the
parser that parse as OtherLetter to make them parse as lower case so that
languages like Japanese that lack case could be used in identifier names:
https://ghc.haskell.org/trac/ghc/ticket/1103
In a recent thread on reddit L