[ 
https://issues.apache.org/jira/browse/IMPALA-14653?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=18063626#comment-18063626
 ] 

Michael Smith edited comment on IMPALA-14653 at 3/6/26 6:29 PM:
----------------------------------------------------------------

{code:java}
-- 2026-03-03 23:19:26,277 INFO     FetchingThread: minimal_hs2: Start fetching 
results
-- 2026-03-03 23:19:26,278 INFO     MainThread: Could not connect to ('::1', 
21050, 0, 0)
Traceback (most recent call last):
  File 
"/data/jenkins/workspace/impala-asf-master-core-asan/repos/Impala/infra/python/env-gcc10.4.0-py3/lib/python3.11/site-packages/thrift/transport/TSocket.py",
 line 137, in open
    handle.connect(sockaddr)
ConnectionRefusedError: [Errno 111] Connection refused
-- 2026-03-03 23:19:26,290 INFO     FetchingThread: 
d249189faf721e15:0c4ee3d000000000: fetching results
-- 2026-03-03 23:19:26,317 INFO     MainThread: 
d249189faf721e15:0c4ee3d000000000: canceling operation
-- 2026-03-03 23:19:26,318 INFO     MainThread: minimal_hs2: closing connection 
to: localhost:21050
-- 2026-03-03 23:19:26,318 INFO     MainThread: Could not connect to ('::1', 
21050, 0, 0)
Traceback (most recent call last):
  File 
"/data/jenkins/workspace/impala-asf-master-core-asan/repos/Impala/infra/python/env-gcc10.4.0-py3/lib/python3.11/site-packages/thrift/transport/TSocket.py",
 line 137, in open
    handle.connect(sockaddr)
ConnectionRefusedError: [Errno 111] Connection refused
-- 2026-03-03 23:19:26,389 INFO     MainThread: 
d249189faf721e15:0c4ee3d000000000: closing query for operation
-- 2026-03-03 23:19:36,330 INFO     FetchingThread: minimal_hs2: Stop fetching 
results{code}
happens around 
[https://github.com/apache/impala/blob/master/tests/util/cancel_util.py#L129-L142.]

It's not obvious impyla should get the response it's getting. Maybe this is 
happening when the connection is closed in the middle of a fetch?

This case is
{code:java}
query_test.test_cancellation.TestCancellationParallel.test_cancel_select["table_format:
 text/none" | "protocol: hs2" | "query: select l_returnflag from lineitem" | 
"query_type: SELECT" | "cancel_delay: 0" | "wait_action: None" | 
"fail_rpc_action: COORD_CANCEL_QUERY_FINSTANCES_RPC:FAIL" | "join_before_close: 
False" | "buffer_pool_limit: 0" | "cpu_limit_s: 100000" | "mt_dop: 0" | 
"use_kill_query_statement: False" | "num_cancellation_iterations: 1" | 
"exec_option: {'abort_on_error': 1, 'batch_size': 0, 'disable_codegen': False, 
'disable_codegen_rows_threshold': 0, 'exec_single_node_rows_threshold': 0, 
'num_nodes': 0, 'test_replan': 1}"] {code}
so join_before_close=False. But "closing query for operation" happens after the 
"Could not connect". And then we wait 10s for the FetchThread to join. So maybe 
this is a path we can hit when fetching on a cancelled query.


was (Author: JIRAUSER288956):
{code:java}
-- 2026-03-03 23:19:26,277 INFO     FetchingThread: minimal_hs2: Start fetching 
results
-- 2026-03-03 23:19:26,278 INFO     MainThread: Could not connect to ('::1', 
21050, 0, 0)
Traceback (most recent call last):
  File 
"/data/jenkins/workspace/impala-asf-master-core-asan/repos/Impala/infra/python/env-gcc10.4.0-py3/lib/python3.11/site-packages/thrift/transport/TSocket.py",
 line 137, in open
    handle.connect(sockaddr)
ConnectionRefusedError: [Errno 111] Connection refused
-- 2026-03-03 23:19:26,290 INFO     FetchingThread: 
d249189faf721e15:0c4ee3d000000000: fetching results
-- 2026-03-03 23:19:26,317 INFO     MainThread: 
d249189faf721e15:0c4ee3d000000000: canceling operation
-- 2026-03-03 23:19:26,318 INFO     MainThread: minimal_hs2: closing connection 
to: localhost:21050
-- 2026-03-03 23:19:26,318 INFO     MainThread: Could not connect to ('::1', 
21050, 0, 0)
Traceback (most recent call last):
  File 
"/data/jenkins/workspace/impala-asf-master-core-asan/repos/Impala/infra/python/env-gcc10.4.0-py3/lib/python3.11/site-packages/thrift/transport/TSocket.py",
 line 137, in open
    handle.connect(sockaddr)
ConnectionRefusedError: [Errno 111] Connection refused
-- 2026-03-03 23:19:26,389 INFO     MainThread: 
d249189faf721e15:0c4ee3d000000000: closing query for operation
-- 2026-03-03 23:19:36,330 INFO     FetchingThread: minimal_hs2: Stop fetching 
results{code}
happens around 
[https://github.com/apache/impala/blob/master/tests/util/cancel_util.py#L129-L142.]

It's not obvious impyla should get the response it's getting. Maybe this is 
happening when the connection is closed in the middle of a fetch?

> test_cancel_select is failing with AttributeError
> -------------------------------------------------
>
>                 Key: IMPALA-14653
>                 URL: https://issues.apache.org/jira/browse/IMPALA-14653
>             Project: IMPALA
>          Issue Type: Bug
>          Components: Clients
>            Reporter: Pranav Yogi Lodha
>            Assignee: Michael Smith
>            Priority: Major
>              Labels: broken-build
>
> Error Message
> query_test/test_cancellation.py:276: in test_cancel_select     
> self.execute_cancel_test(vector) query_test/test_cancellation.py:187: in 
> execute_cancel_test     cancel_query_and_validate_state( 
> util/cancel_util.py:195: in cancel_query_and_validate_state     
> __run_cancel_query_and_validate_state( util/cancel_util.py:261: in 
> __run_cancel_query_and_validate_state     assert 'Cancelled' in 
> str_fetch_error or \ E   AssertionError: Exception in fetch_results: 
> 'NoneType' object has no attribute 'columns' E   Traceback (most recent call 
> last): E     File 
> "/data0/jenkins/workspace/impala-cdw-master-staging-core-ubsan-arm/repos/Impala/tests/util/cancel_util.py",
>  line 133, in run E       result = self.client.fetch(self.query, self.handle) 
> E     File 
> "/data0/jenkins/workspace/impala-cdw-master-staging-core-ubsan-arm/repos/Impala/tests/common/impala_connection.py",
>  line 1109, in fetch E       return 
> self.__get_operation(operation_handle).fetch(max_rows=max_rows) E     File 
> "/data/jenkins/workspace/impala-cdw-master-staging-core-ubsan-arm/repos/Impala/infra/python/env-gcc10.4.0-py3/lib/python3.8/site-packages/impala/hiveserver2.py",
>  line 1507, in fetch E       return self._wrap_results(resp.results, 
> resp.hasMoreRows, schema, E     File 
> "/data/jenkins/workspace/impala-cdw-master-staging-core-ubsan-arm/repos/Impala/infra/python/env-gcc10.4.0-py3/lib/python3.8/site-packages/impala/hiveserver2.py",
>  line 1515, in _wrap_results E       return CBatch(results, expect_more_rows, 
> schema, convert_types=convert_types, E     File 
> "/data/jenkins/workspace/impala-cdw-master-staging-core-ubsan-arm/repos/Impala/infra/python/env-gcc10.4.0-py3/lib/python3.8/site-packages/impala/hiveserver2.py",
>  line 1042, in __init__ E       for (i, col) in enumerate(trowset.columns)] E 
>   AttributeError: 'NoneType' object has no attribute 'columns'
> Stacktrace
> query_test/test_cancellation.py:276: in test_cancel_select
>     self.execute_cancel_test(vector)
> query_test/test_cancellation.py:187: in execute_cancel_test
>     cancel_query_and_validate_state(
> util/cancel_util.py:195: in cancel_query_and_validate_state
>     __run_cancel_query_and_validate_state(
> util/cancel_util.py:261: in __run_cancel_query_and_validate_state
>     assert 'Cancelled' in str_fetch_error or \
> E   AssertionError: Exception in fetch_results: 'NoneType' object has no 
> attribute 'columns'
> E   Traceback (most recent call last):
> E     File 
> "/data0/jenkins/workspace/impala-cdw-master-staging-core-ubsan-arm/repos/Impala/tests/util/cancel_util.py",
>  line 133, in run
> E       result = self.client.fetch(self.query, self.handle)
> E     File 
> "/data0/jenkins/workspace/impala-cdw-master-staging-core-ubsan-arm/repos/Impala/tests/common/impala_connection.py",
>  line 1109, in fetch
> E       return self.__get_operation(operation_handle).fetch(max_rows=max_rows)
> E     File 
> "/data/jenkins/workspace/impala-cdw-master-staging-core-ubsan-arm/repos/Impala/infra/python/env-gcc10.4.0-py3/lib/python3.8/site-packages/impala/hiveserver2.py",
>  line 1507, in fetch
> E       return self._wrap_results(resp.results, resp.hasMoreRows, schema,
> E     File 
> "/data/jenkins/workspace/impala-cdw-master-staging-core-ubsan-arm/repos/Impala/infra/python/env-gcc10.4.0-py3/lib/python3.8/site-packages/impala/hiveserver2.py",
>  line 1515, in _wrap_results
> E       return CBatch(results, expect_more_rows, schema, 
> convert_types=convert_types,
> E     File 
> "/data/jenkins/workspace/impala-cdw-master-staging-core-ubsan-arm/repos/Impala/infra/python/env-gcc10.4.0-py3/lib/python3.8/site-packages/impala/hiveserver2.py",
>  line 1042, in __init__
> E       for (i, col) in enumerate(trowset.columns)]
> E   AttributeError: 'NoneType' object has no attribute 'columns'



--
This message was sent by Atlassian Jira
(v8.20.10#820010)

---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

Reply via email to