[
https://issues.apache.org/jira/browse/IMPALA-10109?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17273322#comment-17273322
]
Wenzhe Zhou commented on IMPALA-10109:
--------------------------------------
Loop to run the test case TestQueryRetries::test_retries_from_cancellation_pool
for 1000 times, but the issue could not be re-produced.
This issue happened when the retried query still failed.
Before the fixing for[
IMPALA-9636|http://issues.apache.org/jira/browse/IMPALA-9636] was merged, it's
possible to schedule the retried query on the blacklisted nodes if the
blacklisting was timeout. This cause retried query failed again. Guess this is
the root cause. This issue should be already fixed by
[IMPALA-9636.|http://issues.apache.org/jira/browse/IMPALA-9636]
> fetch fails in TestQueryRetries.test_retries_from_cancellation_pool
> -------------------------------------------------------------------
>
> Key: IMPALA-10109
> URL: https://issues.apache.org/jira/browse/IMPALA-10109
> Project: IMPALA
> Issue Type: Bug
> Components: Distributed Exec
> Reporter: Quanlong Huang
> Assignee: Wenzhe Zhou
> Priority: Major
> Labels: broken-build, flaky
> Attachments:
> impalad.ip-172-31-37-16.ubuntu.log.INFO.20200826-035614.115389,
> impalad.ip-172-31-37-16.ubuntu.log.INFO.20200826-035614.115392,
> impalad.ip-172-31-37-16.ubuntu.log.INFO.20200826-035614.115397
>
>
> Encountered a failure on an unrelated patch
> [https://jenkins.impala.io/job/ubuntu-16.04-from-scratch/11795/testReport/junit/custom_cluster.test_query_retries/TestQueryRetries/test_retries_from_cancellation_pool/]
> {code}
> custom_cluster/test_query_retries.py:83: in
> test_retries_from_cancellation_pool
> results = self.client.fetch(query, handle)
> common/impala_connection.py:254: in fetch
> sql_stmt, operation_handle.get_handle(), max_rows)
> beeswax/impala_beeswax.py:452: in fetch_results
> exec_result = self.__fetch_results(query_handle, max_rows)
> beeswax/impala_beeswax.py:463: in __fetch_results
> results = self.__do_rpc(lambda: self.imp_service.fetch(handle, False,
> fetch_rows))
> beeswax/impala_beeswax.py:520: in __do_rpc
> raise ImpalaBeeswaxException(self.__build_error_message(b), b)
> E ImpalaBeeswaxException: ImpalaBeeswaxException:
> E INNER EXCEPTION: <class 'beeswaxd.ttypes.BeeswaxException'>
> E MESSAGE: Query 24486bb27fc574a1:a00ce3ae00000000 cancelled due to
> unresponsive backend: 127.0.0.1:27001 has not sent a report in 784806ms (max
> allowed lag is 720000ms)
> {code}
>
--
This message was sent by Atlassian Jira
(v8.3.4#803005)
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]