[
https://issues.apache.org/jira/browse/IMPALA-14653?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=18063626#comment-18063626
]
Michael Smith edited comment on IMPALA-14653 at 3/6/26 6:29 PM:
----------------------------------------------------------------
{code:java}
-- 2026-03-03 23:19:26,277 INFO FetchingThread: minimal_hs2: Start fetching
results
-- 2026-03-03 23:19:26,278 INFO MainThread: Could not connect to ('::1',
21050, 0, 0)
Traceback (most recent call last):
File
"/data/jenkins/workspace/impala-asf-master-core-asan/repos/Impala/infra/python/env-gcc10.4.0-py3/lib/python3.11/site-packages/thrift/transport/TSocket.py",
line 137, in open
handle.connect(sockaddr)
ConnectionRefusedError: [Errno 111] Connection refused
-- 2026-03-03 23:19:26,290 INFO FetchingThread:
d249189faf721e15:0c4ee3d000000000: fetching results
-- 2026-03-03 23:19:26,317 INFO MainThread:
d249189faf721e15:0c4ee3d000000000: canceling operation
-- 2026-03-03 23:19:26,318 INFO MainThread: minimal_hs2: closing connection
to: localhost:21050
-- 2026-03-03 23:19:26,318 INFO MainThread: Could not connect to ('::1',
21050, 0, 0)
Traceback (most recent call last):
File
"/data/jenkins/workspace/impala-asf-master-core-asan/repos/Impala/infra/python/env-gcc10.4.0-py3/lib/python3.11/site-packages/thrift/transport/TSocket.py",
line 137, in open
handle.connect(sockaddr)
ConnectionRefusedError: [Errno 111] Connection refused
-- 2026-03-03 23:19:26,389 INFO MainThread:
d249189faf721e15:0c4ee3d000000000: closing query for operation
-- 2026-03-03 23:19:36,330 INFO FetchingThread: minimal_hs2: Stop fetching
results{code}
happens around
[https://github.com/apache/impala/blob/master/tests/util/cancel_util.py#L129-L142.]
It's not obvious impyla should get the response it's getting. Maybe this is
happening when the connection is closed in the middle of a fetch?
This case is
{code:java}
query_test.test_cancellation.TestCancellationParallel.test_cancel_select["table_format:
text/none" | "protocol: hs2" | "query: select l_returnflag from lineitem" |
"query_type: SELECT" | "cancel_delay: 0" | "wait_action: None" |
"fail_rpc_action: COORD_CANCEL_QUERY_FINSTANCES_RPC:FAIL" | "join_before_close:
False" | "buffer_pool_limit: 0" | "cpu_limit_s: 100000" | "mt_dop: 0" |
"use_kill_query_statement: False" | "num_cancellation_iterations: 1" |
"exec_option: {'abort_on_error': 1, 'batch_size': 0, 'disable_codegen': False,
'disable_codegen_rows_threshold': 0, 'exec_single_node_rows_threshold': 0,
'num_nodes': 0, 'test_replan': 1}"] {code}
so join_before_close=False. But "closing query for operation" happens after the
"Could not connect". And then we wait 10s for the FetchThread to join. So maybe
this is a path we can hit when fetching on a cancelled query.
was (Author: JIRAUSER288956):
{code:java}
-- 2026-03-03 23:19:26,277 INFO FetchingThread: minimal_hs2: Start fetching
results
-- 2026-03-03 23:19:26,278 INFO MainThread: Could not connect to ('::1',
21050, 0, 0)
Traceback (most recent call last):
File
"/data/jenkins/workspace/impala-asf-master-core-asan/repos/Impala/infra/python/env-gcc10.4.0-py3/lib/python3.11/site-packages/thrift/transport/TSocket.py",
line 137, in open
handle.connect(sockaddr)
ConnectionRefusedError: [Errno 111] Connection refused
-- 2026-03-03 23:19:26,290 INFO FetchingThread:
d249189faf721e15:0c4ee3d000000000: fetching results
-- 2026-03-03 23:19:26,317 INFO MainThread:
d249189faf721e15:0c4ee3d000000000: canceling operation
-- 2026-03-03 23:19:26,318 INFO MainThread: minimal_hs2: closing connection
to: localhost:21050
-- 2026-03-03 23:19:26,318 INFO MainThread: Could not connect to ('::1',
21050, 0, 0)
Traceback (most recent call last):
File
"/data/jenkins/workspace/impala-asf-master-core-asan/repos/Impala/infra/python/env-gcc10.4.0-py3/lib/python3.11/site-packages/thrift/transport/TSocket.py",
line 137, in open
handle.connect(sockaddr)
ConnectionRefusedError: [Errno 111] Connection refused
-- 2026-03-03 23:19:26,389 INFO MainThread:
d249189faf721e15:0c4ee3d000000000: closing query for operation
-- 2026-03-03 23:19:36,330 INFO FetchingThread: minimal_hs2: Stop fetching
results{code}
happens around
[https://github.com/apache/impala/blob/master/tests/util/cancel_util.py#L129-L142.]
It's not obvious impyla should get the response it's getting. Maybe this is
happening when the connection is closed in the middle of a fetch?
> test_cancel_select is failing with AttributeError
> -------------------------------------------------
>
> Key: IMPALA-14653
> URL: https://issues.apache.org/jira/browse/IMPALA-14653
> Project: IMPALA
> Issue Type: Bug
> Components: Clients
> Reporter: Pranav Yogi Lodha
> Assignee: Michael Smith
> Priority: Major
> Labels: broken-build
>
> Error Message
> query_test/test_cancellation.py:276: in test_cancel_select
> self.execute_cancel_test(vector) query_test/test_cancellation.py:187: in
> execute_cancel_test cancel_query_and_validate_state(
> util/cancel_util.py:195: in cancel_query_and_validate_state
> __run_cancel_query_and_validate_state( util/cancel_util.py:261: in
> __run_cancel_query_and_validate_state assert 'Cancelled' in
> str_fetch_error or \ E AssertionError: Exception in fetch_results:
> 'NoneType' object has no attribute 'columns' E Traceback (most recent call
> last): E File
> "/data0/jenkins/workspace/impala-cdw-master-staging-core-ubsan-arm/repos/Impala/tests/util/cancel_util.py",
> line 133, in run E result = self.client.fetch(self.query, self.handle)
> E File
> "/data0/jenkins/workspace/impala-cdw-master-staging-core-ubsan-arm/repos/Impala/tests/common/impala_connection.py",
> line 1109, in fetch E return
> self.__get_operation(operation_handle).fetch(max_rows=max_rows) E File
> "/data/jenkins/workspace/impala-cdw-master-staging-core-ubsan-arm/repos/Impala/infra/python/env-gcc10.4.0-py3/lib/python3.8/site-packages/impala/hiveserver2.py",
> line 1507, in fetch E return self._wrap_results(resp.results,
> resp.hasMoreRows, schema, E File
> "/data/jenkins/workspace/impala-cdw-master-staging-core-ubsan-arm/repos/Impala/infra/python/env-gcc10.4.0-py3/lib/python3.8/site-packages/impala/hiveserver2.py",
> line 1515, in _wrap_results E return CBatch(results, expect_more_rows,
> schema, convert_types=convert_types, E File
> "/data/jenkins/workspace/impala-cdw-master-staging-core-ubsan-arm/repos/Impala/infra/python/env-gcc10.4.0-py3/lib/python3.8/site-packages/impala/hiveserver2.py",
> line 1042, in __init__ E for (i, col) in enumerate(trowset.columns)] E
> AttributeError: 'NoneType' object has no attribute 'columns'
> Stacktrace
> query_test/test_cancellation.py:276: in test_cancel_select
> self.execute_cancel_test(vector)
> query_test/test_cancellation.py:187: in execute_cancel_test
> cancel_query_and_validate_state(
> util/cancel_util.py:195: in cancel_query_and_validate_state
> __run_cancel_query_and_validate_state(
> util/cancel_util.py:261: in __run_cancel_query_and_validate_state
> assert 'Cancelled' in str_fetch_error or \
> E AssertionError: Exception in fetch_results: 'NoneType' object has no
> attribute 'columns'
> E Traceback (most recent call last):
> E File
> "/data0/jenkins/workspace/impala-cdw-master-staging-core-ubsan-arm/repos/Impala/tests/util/cancel_util.py",
> line 133, in run
> E result = self.client.fetch(self.query, self.handle)
> E File
> "/data0/jenkins/workspace/impala-cdw-master-staging-core-ubsan-arm/repos/Impala/tests/common/impala_connection.py",
> line 1109, in fetch
> E return self.__get_operation(operation_handle).fetch(max_rows=max_rows)
> E File
> "/data/jenkins/workspace/impala-cdw-master-staging-core-ubsan-arm/repos/Impala/infra/python/env-gcc10.4.0-py3/lib/python3.8/site-packages/impala/hiveserver2.py",
> line 1507, in fetch
> E return self._wrap_results(resp.results, resp.hasMoreRows, schema,
> E File
> "/data/jenkins/workspace/impala-cdw-master-staging-core-ubsan-arm/repos/Impala/infra/python/env-gcc10.4.0-py3/lib/python3.8/site-packages/impala/hiveserver2.py",
> line 1515, in _wrap_results
> E return CBatch(results, expect_more_rows, schema,
> convert_types=convert_types,
> E File
> "/data/jenkins/workspace/impala-cdw-master-staging-core-ubsan-arm/repos/Impala/infra/python/env-gcc10.4.0-py3/lib/python3.8/site-packages/impala/hiveserver2.py",
> line 1042, in __init__
> E for (i, col) in enumerate(trowset.columns)]
> E AttributeError: 'NoneType' object has no attribute 'columns'
--
This message was sent by Atlassian Jira
(v8.20.10#820010)
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]