[
https://issues.apache.org/jira/browse/FLINK-15280?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16997245#comment-16997245
]
Yun Tang commented on FLINK-15280:
----------------------------------
[https://api.travis-ci.org/v3/job/624983426/log.txt] fails due to error:
{code:java}
2019-12-14 20:58:51,531 WARN
org.apache.flink.streaming.runtime.tasks.StreamTask - Error while
canceling task.
java.lang.IllegalStateException: Released
at
org.apache.flink.runtime.io.network.partition.consumer.SingleInputGate.getNextBufferOrEvent(SingleInputGate.java:483)
at
org.apache.flink.runtime.io.network.partition.consumer.SingleInputGate.pollNext(SingleInputGate.java:474)
at
org.apache.flink.runtime.taskmanager.InputGateWithMetrics.pollNext(InputGateWithMetrics.java:75)
at
org.apache.flink.streaming.runtime.io.CheckpointedInputGate.pollNext(CheckpointedInputGate.java:125)
at
org.apache.flink.streaming.runtime.io.StreamTaskNetworkInput.emitNext(StreamTaskNetworkInput.java:133)
at
org.apache.flink.streaming.runtime.io.StreamOneInputProcessor.processInput(StreamOneInputProcessor.java:69)
at
org.apache.flink.streaming.runtime.tasks.StreamTask.processInput(StreamTask.java:311)
at
org.apache.flink.streaming.runtime.tasks.mailbox.MailboxProcessor.runMailboxLoop(MailboxProcessor.java:187)
at
org.apache.flink.streaming.runtime.tasks.StreamTask.runMailboxLoop(StreamTask.java:488)
at
org.apache.flink.streaming.runtime.tasks.StreamTask.invoke(StreamTask.java:470)
at org.apache.flink.runtime.taskmanager.Task.doRun(Task.java:702)
at org.apache.flink.runtime.taskmanager.Task.run(Task.java:527)
at java.lang.Thread.run(Thread.java:748)
{code}
I think this exception should be considered as one of the white list.
However, another instance [https://api.travis-ci.org/v3/job/625281873/log.txt]
fails due to some unknown error which is inundated due to maximum length limit.
To avoid unknown errors, I think we should either print errors out once grepped
or avoid to cat metrics out. CC [~trohrmann] , [~liyu]
> Checkpoint end-to-end test failed
> ---------------------------------
>
> Key: FLINK-15280
> URL: https://issues.apache.org/jira/browse/FLINK-15280
> Project: Flink
> Issue Type: Bug
> Components: Runtime / Checkpointing
> Affects Versions: 1.10.0
> Reporter: wangxiyuan
> Priority: Major
>
> The Checkpoint e2e test failed. The error log is very long. Please check the
> link to find the error message.
> Some logs in these two days:
> [https://api.travis-ci.org/v3/job/624983426/log.txt]
> Running 'Resuming Externalized Checkpoint (rocks, incremental, no parallelism
> change) end-to-end test'
>
> [https://api.travis-ci.org/v3/job/625281873/log.txt]
> Running 'Resuming Externalized Checkpoint (file, sync, scale down) end-to-end
> test'
>
> And arm as well:
> [https://logs.openlabtesting.org/logs/periodic-20-flink-mail/github.com/apache/flink/master/flink-end-to-end-test-arm64-checkpoints-and-sticky/3da575a/job-output.txt.gz]
> Running 'Resuming Externalized Checkpoint (rocks, incremental, no parallelism
> change) end-to-end test'
>
--
This message was sent by Atlassian Jira
(v8.3.4#803005)