Hyunsik Choi created TAJO-230:
---------------------------------

             Summary: Support unicode identifiers
                 Key: TAJO-230
                 URL: https://issues.apache.org/jira/browse/TAJO-230
             Project: Tajo
          Issue Type: New Feature
          Components: parser
            Reporter: Hyunsik Choi


The current parse only recognizes a combination of alphabet characters and 
underscore as an identifier used for function names, column names, and table 
names. This is because of the following antlr lexer rules:
{code}
Identifier
  : Nonreserved_keywords
  | Regular_Identifier
  ;

Regular_Identifier
  : ('a'..'z'|'A'..'Z'|'_') ('a'..'z'|'A'..'Z'|Digit|'_')*
  ;
{code}

In some CJK country, their characters can be used as identifiers. We need to 
support unicode identifier.



--
This message was sent by Atlassian JIRA
(v6.1#6144)

Reply via email to