Quanlong Huang created IMPALA-10299:
---------------------------------------
Summary: Impala-shell hangs in printing partial bytes of UTF-8
charactors
Key: IMPALA-10299
URL: https://issues.apache.org/jira/browse/IMPALA-10299
Project: IMPALA
Issue Type: Bug
Components: Clients
Reporter: Quanlong Huang
Saw the hanging when playing around with Chinese charactors:
{code:sql}
[localhost:21050] default> select substr("引擎", 1, 3);
Query: select substr("引擎", 1, 3)
Query submitted at: 2020-10-28 11:54:26 (Coordinator:
http://quanlong-OptiPlex-BJ:25000)
Query progress can be monitored at:
http://quanlong-OptiPlex-BJ:25000/query_plan?query_id=464f44d405bb16a1:5b41e00e00000000
+----------------------+
| substr('引擎', 1, 3) |
+----------------------+
| 引 |
+----------------------+
Fetched 1 row(s) in 0.13s
[localhost:21050] default> select substr("引擎", 1, 2);
Query: select substr("引擎", 1, 2)
Query submitted at: 2020-10-28 11:54:32 (Coordinator:
http://quanlong-OptiPlex-BJ:25000)
Query progress can be monitored at:
http://quanlong-OptiPlex-BJ:25000/query_plan?query_id=d048231be08f29b4:860c959500000000
Caught exception 'utf8' codec can't decode bytes in position 0-1: unexpected
end of data, type=<type 'exceptions.UnicodeDecodeError'> in FetchResults.
{code}
This seems not a problem in previous versions. E.g. see the examples in the
description of IMPALA-9662.
--
This message was sent by Atlassian Jira
(v8.3.4#803005)