[ 
https://issues.apache.org/jira/browse/IMPALA-14551?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=18037945#comment-18037945
 ] 

woosuk.ro commented on IMPALA-14551:
------------------------------------

* Root Cause
 ** The alias mapping logic checks if the expression can serve as an identifier 
via HiveLexer. With ANTLR 3.3, recovery does not consume input after a 
RecognitionException, leading to an infinite loop. Hive 3.1.3’s allowQuotedId 
default further hinders quoted Unicode identifier handling.
 * Proposed Fixes
 ** Upgrade ANTLR runtime to 3.5.x so recover() consumes input

>  Query hangs when selecting an expression that contains Unicode Letters
> -----------------------------------------------------------------------
>
>                 Key: IMPALA-14551
>                 URL: https://issues.apache.org/jira/browse/IMPALA-14551
>             Project: IMPALA
>          Issue Type: Bug
>          Components: Frontend
>            Reporter: woosuk.ro
>            Priority: Major
>             Fix For: Impala 4.4.0
>
>
> * Summary
>  ** When a complex expression that contains Unicode Letters is selected 
> without an alias, an error during alias mapping causes the query to hang 
> before the planning phase. CANCEL QUERY is ineffective.
>  * Environment
>  ** Impala 4.4.0, Hive 3.1.3.
>  * Steps to Reproduce
>  ** In the SELECT list, use an expression containing Unicode Letters without 
> an alias.
>  ** Example : `select 누적합 - lag (누적합) over (partition by day order by day) 
> from base`
>  * Actual Behavior
>  ** During alias mapping, invoking HiveLexer triggers repeated retries 
> without consuming input, and the query hangs. Threads remain RUNNABLE and 
> cannot be canceled



--
This message was sent by Atlassian Jira
(v8.20.10#820010)

---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

Reply via email to