[ 
https://issues.apache.org/jira/browse/IMPALA-10415?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17256339#comment-17256339
 ] 

Quanlong Huang commented on IMPALA-10415:
-----------------------------------------

Similar with IMPALA-1130 but the error details are different. Current error is:
{code:java}
[localhost:21050] default> select 'è'
                         > ;
Traceback (most recent call last):
  File "/home/quanlong/workspace/Impala/shell/impala_shell.py", line 2063, in 
<module>
    impala_shell_main()
  File "/home/quanlong/workspace/Impala/shell/impala_shell.py", line 2028, in 
impala_shell_main
    shell.cmdloop(intro)
  File 
"/home/quanlong/workspace/Impala/toolchain/toolchain-packages-gcc7.5.0/python-2.7.16/lib/python2.7/cmd.py",
 line 141, in cmdloop
    line = self.precmd(line)
  File "/home/quanlong/workspace/Impala/shell/impala_shell.py", line 632, in 
precmd
    args = self.sanitise_input(args.decode('utf-8'))  # python2
  File "/home/quanlong/workspace/Impala/shell/impala_shell.py", line 436, in 
sanitise_input
    tokens = args.strip().split(' ')
UnicodeDecodeError: 'ascii' codec can't decode byte 0xc3 in position 8: ordinal 
not in range(128)
{code}
I also verified that impala-3.4 doesn't have this issue. So maybe some changes 
in impala-4.0 cause this.

> impala-shell crash in parsing multiline queries that contain UTF-8 characters
> -----------------------------------------------------------------------------
>
>                 Key: IMPALA-10415
>                 URL: https://issues.apache.org/jira/browse/IMPALA-10415
>             Project: IMPALA
>          Issue Type: Bug
>          Components: Clients
>    Affects Versions: Impala 4.0
>            Reporter: Quanlong Huang
>            Assignee: Quanlong Huang
>            Priority: Critical
>
> I'm at master branch (commit a6a244099502329d9193b316ea26d5fd6451b6bd) and 
> hit this error:
> {code:java}
> [localhost:21050] default> select "你好";
> Query: select "你好"
> Query submitted at: 2020-12-30 11:00:40 (Coordinator: 
> http://quanlong-OptiPlex-BJ:25000)
> Query progress can be monitored at: 
> http://quanlong-OptiPlex-BJ:25000/query_plan?query_id=554d2348a28884c6:30835a4800000000
> +--------+
> | '你好' |
> +--------+
> | 你好   |
> +--------+
> Fetched 1 row(s) in 0.12s
> [localhost:21050] default> select
>                          > "你好";
> Traceback (most recent call last):
>   File "/home/quanlong/workspace/Impala/shell/impala_shell.py", line 2062, in 
> <module>
>     impala_shell_main()
>   File "/home/quanlong/workspace/Impala/shell/impala_shell.py", line 2027, in 
> impala_shell_main
>     shell.cmdloop(intro)
>   File 
> "/home/quanlong/workspace/Impala/toolchain/toolchain-packages-gcc7.5.0/python-2.7.16/lib/python2.7/cmd.py",
>  line 141, in cmdloop
>     line = self.precmd(line)
>   File "/home/quanlong/workspace/Impala/shell/impala_shell.py", line 631, in 
> precmd
>     args = self.sanitise_input(args.decode('utf-8'))  # python2
>   File "/home/quanlong/workspace/Impala/shell/impala_shell.py", line 435, in 
> sanitise_input
>     tokens = args.strip().split(' ')
> UnicodeDecodeError: 'ascii' codec can't decode byte 0xe4 in position 8: 
> ordinal not in range(128) {code}



--
This message was sent by Atlassian Jira
(v8.3.4#803005)

---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

Reply via email to