[ 
https://issues.apache.org/jira/browse/FLINK-32785?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17759381#comment-17759381
 ] 

Sergey Nuyanzin edited comment on FLINK-32785 at 8/27/23 5:20 PM:
------------------------------------------------------------------

Checked plan in 1.18: it contains {{TTL}} as mentioned above.
Compiled in 1.17 and then executed in 1.18: the job was successfully running in 
1.18

However there is an issue that after about 1 min 30 sec the job fails with 
{noformat}
Caused by: java.util.concurrent.TimeoutException: Heartbeat of TaskManager with 
id localhost:45987-ac10bb timed out.
        ... 30 more
{noformat}

I checked it with 1.17.1, 1.16 and it is reproduced there as well, so probably 
it is not related to current change

UPD: probably the reason is not enough resources for tasks
in 1.16 it fails as
{noformat}
Caused by: java.util.concurrent.CompletionException: 
java.util.concurrent.CompletionException: 
org.apache.flink.runtime.jobmanager.scheduler.NoResourceAvailableException: 
Could not acquire the minimum required resources.
        at 
org.apache.flink.runtime.scheduler.DefaultExecutionDeployer.lambda$assignResource$4(DefaultExecutionDeployer.java:227)
        ... 39 more
Caused by: java.util.concurrent.CompletionException: 
org.apache.flink.runtime.jobmanager.scheduler.NoResourceAvailableException: 
Could not acquire the minimum required resources.
        at 
java.util.concurrent.CompletableFuture.encodeThrowable(CompletableFuture.java:292)
        at 
java.util.concurrent.CompletableFuture.completeThrowable(CompletableFuture.java:308)
        at 
java.util.concurrent.CompletableFuture.uniApply(CompletableFuture.java:607)
        at 
java.util.concurrent.CompletableFuture$UniApply.tryFire(CompletableFuture.java:591)
        ... 37 more
Caused by: 
org.apache.flink.runtime.jobmanager.scheduler.NoResourceAvailableException: 
Could not acquire the minimum required resources.
{noformat}


was (Author: sergey nuyanzin):
Checked plan in 1.18: it contains {{TTL}} as mentioned above.
Compiled in 1.17 and then executed in 1.18: the job was successfully running in 
1.18

However there is an issue that after about 1 min 30 sec the job fails with 
{noformat}
Caused by: java.util.concurrent.TimeoutException: Heartbeat of TaskManager with 
id localhost:45987-ac10bb timed out.
        ... 30 more
{noformat}

I checked it with 1.17.1, 1.16 and it is reproduced there as well, so probably 
it is not related to current change

UPD: probably the reason is not enough resources for tasks

> Release Testing: Verify FLIP-292: Enhance COMPILED PLAN to support 
> operator-level state TTL configuration
> ---------------------------------------------------------------------------------------------------------
>
>                 Key: FLINK-32785
>                 URL: https://issues.apache.org/jira/browse/FLINK-32785
>             Project: Flink
>          Issue Type: Sub-task
>          Components: Tests
>    Affects Versions: 1.18.0
>            Reporter: Qingsheng Ren
>            Assignee: Sergey Nuyanzin
>            Priority: Major
>             Fix For: 1.18.0
>
>




--
This message was sent by Atlassian Jira
(v8.20.10#820010)

Reply via email to