Quanlong Huang created IMPALA-10415:
---------------------------------------
Summary: impala-shell crash in parsing multiline queries that
contain UTF-8 characters
Key: IMPALA-10415
URL: https://issues.apache.org/jira/browse/IMPALA-10415
Project: IMPALA
Issue Type: Bug
Components: Clients
Reporter: Quanlong Huang
Assignee: Quanlong Huang
Reproducing the issue by:
{code:java}
[localhost:21050] default> select "你好";
Query: select "你好"
Query submitted at: 2020-12-30 11:00:40 (Coordinator:
http://quanlong-OptiPlex-BJ:25000)
Query progress can be monitored at:
http://quanlong-OptiPlex-BJ:25000/query_plan?query_id=554d2348a28884c6:30835a4800000000
+--------+
| '你好' |
+--------+
| 你好 |
+--------+
Fetched 1 row(s) in 0.12s
[localhost:21050] default> select
> "你好";
Traceback (most recent call last):
File "/home/quanlong/workspace/Impala/shell/impala_shell.py", line 2062, in
<module>
impala_shell_main()
File "/home/quanlong/workspace/Impala/shell/impala_shell.py", line 2027, in
impala_shell_main
shell.cmdloop(intro)
File
"/home/quanlong/workspace/Impala/toolchain/toolchain-packages-gcc7.5.0/python-2.7.16/lib/python2.7/cmd.py",
line 141, in cmdloop
line = self.precmd(line)
File "/home/quanlong/workspace/Impala/shell/impala_shell.py", line 631, in
precmd
args = self.sanitise_input(args.decode('utf-8')) # python2
File "/home/quanlong/workspace/Impala/shell/impala_shell.py", line 435, in
sanitise_input
tokens = args.strip().split(' ')
UnicodeDecodeError: 'ascii' codec can't decode byte 0xe4 in position 8: ordinal
not in range(128) {code}
--
This message was sent by Atlassian Jira
(v8.3.4#803005)