[GitHub] [flink] flinkbot edited a comment on pull request #13458: FLINK-18851 [runtime] Add checkpoint type to checkpoint history entries in Web UI
flinkbot edited a comment on pull request #13458: URL: https://github.com/apache/flink/pull/13458#issuecomment-697041219 ## CI report: * 3a9c127acf7d5615f5a32c7af35f8611b93cd6ab Azure: [SUCCESS](https://dev.azure.com/apache-flink/98463496-1af2-4620-8eab-a2ecc1a2e6fe/_build/results?buildId=7216) Bot commands The @flinkbot bot supports the following commands: - `@flinkbot run travis` re-run the last Travis build - `@flinkbot run azure` re-run the last Azure build This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. For queries about this service, please contact Infrastructure at: us...@infra.apache.org
[GitHub] [flink] flinkbot edited a comment on pull request #13542: [FLINK-19501][runtime-web] Missing state in enum type in rest_api_v1.…
flinkbot edited a comment on pull request #13542: URL: https://github.com/apache/flink/pull/13542#issuecomment-703993218 ## CI report: * 8666e10ce4b127d2a81c437c2540b5338f818a8b Azure: [PENDING](https://dev.azure.com/apache-flink/98463496-1af2-4620-8eab-a2ecc1a2e6fe/_build/results?buildId=7218) * 7dea0131bfb5ce7eaab4ba7e0b1e198c944920cd UNKNOWN * a4019c50ca9bdea3baa0f3e0b10efd9d21b51877 Azure: [PENDING](https://dev.azure.com/apache-flink/98463496-1af2-4620-8eab-a2ecc1a2e6fe/_build/results?buildId=7219) Bot commands The @flinkbot bot supports the following commands: - `@flinkbot run travis` re-run the last Travis build - `@flinkbot run azure` re-run the last Azure build This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. For queries about this service, please contact Infrastructure at: us...@infra.apache.org
[GitHub] [flink] flinkbot edited a comment on pull request #13542: [FLINK-19501][runtime-web] Missing state in enum type in rest_api_v1.…
flinkbot edited a comment on pull request #13542: URL: https://github.com/apache/flink/pull/13542#issuecomment-703993218 ## CI report: * 8666e10ce4b127d2a81c437c2540b5338f818a8b Azure: [PENDING](https://dev.azure.com/apache-flink/98463496-1af2-4620-8eab-a2ecc1a2e6fe/_build/results?buildId=7218) * 7dea0131bfb5ce7eaab4ba7e0b1e198c944920cd UNKNOWN * a4019c50ca9bdea3baa0f3e0b10efd9d21b51877 UNKNOWN Bot commands The @flinkbot bot supports the following commands: - `@flinkbot run travis` re-run the last Travis build - `@flinkbot run azure` re-run the last Azure build This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. For queries about this service, please contact Infrastructure at: us...@infra.apache.org
[GitHub] [flink] flinkbot edited a comment on pull request #13542: [FLINK-19501][runtime-web] Missing state in enum type in rest_api_v1.…
flinkbot edited a comment on pull request #13542: URL: https://github.com/apache/flink/pull/13542#issuecomment-703993218 ## CI report: * 8666e10ce4b127d2a81c437c2540b5338f818a8b Azure: [PENDING](https://dev.azure.com/apache-flink/98463496-1af2-4620-8eab-a2ecc1a2e6fe/_build/results?buildId=7218) * 7dea0131bfb5ce7eaab4ba7e0b1e198c944920cd UNKNOWN Bot commands The @flinkbot bot supports the following commands: - `@flinkbot run travis` re-run the last Travis build - `@flinkbot run azure` re-run the last Azure build This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. For queries about this service, please contact Infrastructure at: us...@infra.apache.org
[GitHub] [flink] carp84 commented on pull request #13393: [FLINK-19238] [RocksDB] Sanity check for arena block size
carp84 commented on pull request #13393: URL: https://github.com/apache/flink/pull/13393#issuecomment-704020362 > Hey. I'm going to lose access to the GitHub account soon, and not be able to modify the branch anymore. If changes are needed before merge, I guess I could create a new PR using another GH account. @juha-mynttinen-king sorry for the late reply due to the National Day Holiday here, will check the updated changes asap. About the GH account, yes it's ok to create another PR and link it with this one, while I suggest to use the same account as long as possible in the future (if it's impossible with the existing one) so it could better reflect your coding career (smile). This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. For queries about this service, please contact Infrastructure at: us...@infra.apache.org
[GitHub] [flink] gm7y8 commented on pull request #13458: FLINK-18851 [runtime] Add checkpoint type to checkpoint history entries in Web UI
gm7y8 commented on pull request #13458: URL: https://github.com/apache/flink/pull/13458#issuecomment-704014916 > Thanks for this improvement @gm7y8 > Overall, it looks good to me. > > @XComp @vthinkxie Did someone check that the change works in UI? > > I left comments about changes which are probably unrelated. > After addressing these comments/question, I can merge the PR. > > Also, not sure what the first 'merge' commit means, I can remove it before merging. First Merge commit is created when I synced fork with the main repo. This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. For queries about this service, please contact Infrastructure at: us...@infra.apache.org
[jira] [Updated] (FLINK-19486) expected: class org.apache.flink.runtime.state.KeyGroupsStateHandle, but found: class org.apache.flink.runtime.state.IncrementalRemoteKeyedStateHandle
[ https://issues.apache.org/jira/browse/FLINK-19486?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] appleyuchi updated FLINK-19486: --- Description: *The reason why I reopen it is that:* *I can {color:#de350b}not{color} {color:#de350b}resume from incremental checkpoint manually{color} when I stop the following program by force(maybe NOT supported officially?).* ||My Experience||StateBackEnd||result|| |Normal Checkpoint|FsStateBackend|succeed| |Incremental Checkpoint|RocksDBStateBackend|failed| ||My Other Effort||Content||Result|| |①|*I {color:#de350b}hava subscribed to mailing list{color}*|*David Anderson has answered this several days ago. But he did not tell me how I can fix it.* *I guess he's very busy.* |②|{color:#de350b}stackoverflow account{color}|it is forbidden to ask questions. |③|alibaba {color:#de350b}Flink dingtalk group{color}|no one can answer this question |④|I have read [document|https://ci.apache.org/projects/flink/flink-docs-stable/ops/state/checkpoints.html] carefully|I can NOT succeed in it| |⑤|apologise for post it here|how can I make sure this question is a support instead of a bug or due to my own mistake? no successful example in Google/Baidu,am I right?| |⑥|about less than 1000 people visit my [failure record|https://yuchi.blog.csdn.net/article/details/108896441] about this experiment.|in vain| *Please help,thanks~* -original posting content- *I want to do an experiment of"incremental checkpoint"* ||information||content|| |pom.xml|[ is in the attachment] |code| [https://paste.ubuntu.com/p/DpTyQKq6Vk/] |error|[https://paste.ubuntu.com/p/49HRYXFzR2/] *some of the above error is:* *Caused by: java.lang.IllegalStateException: Unexpected state handle type, expected: class org.apache.flink.runtime.state.KeyGroupsStateHandle, but found: class org.apache.flink.runtime.state.IncrementalRemoteKeyedStateHandle* The steps for incremental experiment are: ||step||content|| |①|mvn clean scala:compile compile package| |②|nc -lk | |③|flink run -c wordcount_increstate datastream_api-1.0-SNAPSHOT.jar Job has been submitted with JobID df6d62a43620f258155b8538ef0ddf1b| |④|input the following conents in nc -lk before error error error error |⑤|flink run -s hdfs://Desktop:9000/tmp/flinkck/df6d62a43620f258155b8538ef0ddf1b/chk-22 -c StateWordCount datastream_api-1.0-SNAPSHOT.jar Then the above error happens. Please help,Thanks~! was: *The reason why I reopen it is that:* *I can {color:#de350b}not{color} {color:#de350b}resume from incremental checkpoint manually{color} when I stop the following program by force(maybe NOT supported officially?).* ||My Experience||StateBackEnd||result|| |Normal Checkpoint|FsStateBackend|succeed| |Incremental Checkpoint|RocksDBStateBackend|failed| ||My Other Effort||Content||Result|| |①|*I {color:#de350b}hava subscribed to mailing list{color}*|*David Anderson has answered this several days ago. But he did not tell me how I can fix it.* *I guess he's very busy.* |②|{color:#de350b}stackoverflow account{color}|it is forbidden to ask questions. |③|alibaba {color:#de350b}Flink dingtalk group{color}|no one can answer this question |④|I have read [document|https://ci.apache.org/projects/flink/flink-docs-stable/ops/state/checkpoints.html] carefully|I can NOT succeed in it| |⑤|apologise for post it here|how can I make sure this question is a support instead of a bug or due to my own mistake? no successful example in Google/Baidu,am I right?| |⑥|about less than 1000 people visit my [failure record|https://yuchi.blog.csdn.net/article/details/108896441] about this experiment.|in vain| *Please help,thanks~* --original posting content-- *I want to do an experiment of"incremental checkpoint"* ||information||content|| |pom.xml|[ is in the attachment] |code| [https://paste.ubuntu.com/p/DpTyQKq6Vk/] |error|[https://paste.ubuntu.com/p/49HRYXFzR2/] *some of the above error is:* *Caused by: java.lang.IllegalStateException: Unexpected state handle type, expected: class org.apache.flink.runtime.state.KeyGroupsStateHandle, but found: class org.apache.flink.runtime.state.IncrementalRemoteKeyedStateHandle* The steps are: ||step||content|| |①|mvn clean scala:compile compile package| |②|nc -lk | |③|flink run -c wordcount_increstate datastream_api-1.0-SNAPSHOT.jar Job has been submitted with JobID df6d62a43620f258155b8538ef0ddf1b| |④|input the following conents in nc -lk before error error error error |⑤|flink run -s hdfs://Desktop:9000/tmp/flinkck/df6d62a43620f258155b8538ef0ddf1b/chk-22 -c StateWordCount datastream_api-1.0-SNAPSHOT.jar Then the above error happens. Please help,Thanks~! > expected: class
[jira] [Updated] (FLINK-19486) expected: class org.apache.flink.runtime.state.KeyGroupsStateHandle, but found: class org.apache.flink.runtime.state.IncrementalRemoteKeyedStateHandle
[ https://issues.apache.org/jira/browse/FLINK-19486?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] appleyuchi updated FLINK-19486: --- Description: *The reason why I reopen it is that:* *I can {color:#de350b}not{color} {color:#de350b}resume from incremental checkpoint manually{color} when I stop the following program by force(maybe NOT supported officially?).* ||My Experience||StateBackEnd||result|| |Normal Checkpoint|FsStateBackend|succeed| |Incremental Checkpoint|RocksDBStateBackend|failed| ||My Other Effort||Content||Result|| |①|*I {color:#de350b}hava subscribed to mailing list{color}*|*David Anderson has answered this several days ago. But he did not tell me how I can fix it.* *I guess he's very busy.* |②|{color:#de350b}stackoverflow account{color}|it is forbidden to ask questions. |③|alibaba {color:#de350b}Flink dingtalk group{color}|no one can answer this question |④|I have read [document|https://ci.apache.org/projects/flink/flink-docs-stable/ops/state/checkpoints.html] carefully|I can NOT succeed in it| |⑤|apologise for post it here|how can I make sure this question is a support instead of a bug or due to my own mistake? no successful example in Google/Baidu,am I right?| |⑥|about less than 1000 people visit my [failure record|https://yuchi.blog.csdn.net/article/details/108896441] about this experiment.|in vain| *Please help,thanks~* --original posting content-- *I want to do an experiment of"incremental checkpoint"* ||information||content|| |pom.xml|[ is in the attachment] |code| [https://paste.ubuntu.com/p/DpTyQKq6Vk/] |error|[https://paste.ubuntu.com/p/49HRYXFzR2/] *some of the above error is:* *Caused by: java.lang.IllegalStateException: Unexpected state handle type, expected: class org.apache.flink.runtime.state.KeyGroupsStateHandle, but found: class org.apache.flink.runtime.state.IncrementalRemoteKeyedStateHandle* The steps are: ||step||content|| |①|mvn clean scala:compile compile package| |②|nc -lk | |③|flink run -c wordcount_increstate datastream_api-1.0-SNAPSHOT.jar Job has been submitted with JobID df6d62a43620f258155b8538ef0ddf1b| |④|input the following conents in nc -lk before error error error error |⑤|flink run -s hdfs://Desktop:9000/tmp/flinkck/df6d62a43620f258155b8538ef0ddf1b/chk-22 -c StateWordCount datastream_api-1.0-SNAPSHOT.jar Then the above error happens. Please help,Thanks~! was: *The reason why I reopen it is that:* *I can {color:#de350b}not{color} {color:#de350b}resume from incremental checkpoint manually{color} when I stop the following program by force(maybe NOT supported officially?).* ||My Experience||StateBackEnd||result|| |Normal Checkpoint|FsStateBackend|succeed| |Incremental Checkpoint|RocksDBStateBackend|failed| ||My Other Effort||Content||Result|| |①|*I {color:#de350b}hava subscribed to mailing list{color}*|*David Anderson haved answered this several days ago. But he did not tell me how I can fix it.* *I guess he's very busy.* |②|{color:#de350b}stackoverflow account{color}|it is forbidden to ask questions. |③|alibaba {color:#de350b}Flink dingtalk group{color}|no one can answer this question |④|I have read [document|https://ci.apache.org/projects/flink/flink-docs-stable/ops/state/checkpoints.html] carefully|I can NOT succeed in it| |⑤|apologise for post it here|how can I make sure this question is a support instead of a bug or due to my own mistake? no successful example in Google/Baidu,am I right?| |⑥|about less than 1000 people visit my [failure record|https://yuchi.blog.csdn.net/article/details/108896441] about this experiment.|in vain| *Please help,thanks~* --original posting content-- *I want to do an experiment of"incremental checkpoint"* ||information||content|| |pom.xml|[ is in the attachment] |code| [https://paste.ubuntu.com/p/DpTyQKq6Vk/] |error|[https://paste.ubuntu.com/p/49HRYXFzR2/] *some of the above error is:* *Caused by: java.lang.IllegalStateException: Unexpected state handle type, expected: class org.apache.flink.runtime.state.KeyGroupsStateHandle, but found: class org.apache.flink.runtime.state.IncrementalRemoteKeyedStateHandle* The steps are: ||step||content|| |①|mvn clean scala:compile compile package| |②|nc -lk | |③|flink run -c wordcount_increstate datastream_api-1.0-SNAPSHOT.jar Job has been submitted with JobID df6d62a43620f258155b8538ef0ddf1b| |④|input the following conents in nc -lk before error error error error |⑤|flink run -s hdfs://Desktop:9000/tmp/flinkck/df6d62a43620f258155b8538ef0ddf1b/chk-22 -c StateWordCount datastream_api-1.0-SNAPSHOT.jar Then the above error happens. Please help,Thanks~! > expected: class
[jira] [Updated] (FLINK-19486) expected: class org.apache.flink.runtime.state.KeyGroupsStateHandle, but found: class org.apache.flink.runtime.state.IncrementalRemoteKeyedStateHandle
[ https://issues.apache.org/jira/browse/FLINK-19486?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] appleyuchi updated FLINK-19486: --- Fix Version/s: 1.11.1 > expected: class org.apache.flink.runtime.state.KeyGroupsStateHandle, but > found: class org.apache.flink.runtime.state.IncrementalRemoteKeyedStateHandle > -- > > Key: FLINK-19486 > URL: https://issues.apache.org/jira/browse/FLINK-19486 > Project: Flink > Issue Type: Bug >Affects Versions: 1.11.1 >Reporter: appleyuchi >Priority: Major > Fix For: 1.11.1 > > Attachments: pom.xml > > > *The reason why I reopen it is that:* > > *I can {color:#de350b}not{color} {color:#de350b}resume from incremental > checkpoint manually{color} when I stop the following program by force(maybe > NOT supported officially?).* > ||My Experience||StateBackEnd||result|| > |Normal Checkpoint|FsStateBackend|succeed| > |Incremental Checkpoint|RocksDBStateBackend|failed| > ||My Other Effort||Content||Result|| > |①|*I {color:#de350b}hava subscribed to mailing list{color}*|*David Anderson > haved answered this several days ago. But he did not tell me how I can fix > it.* > *I guess he's very busy.* > |②|{color:#de350b}stackoverflow account{color}|it is forbidden to ask > questions. > |③|alibaba {color:#de350b}Flink dingtalk group{color}|no one can answer this > question > |④|I have read > [document|https://ci.apache.org/projects/flink/flink-docs-stable/ops/state/checkpoints.html] > carefully|I can NOT succeed in it| > |⑤|apologise for post it here|how can I make sure this question is a support > instead of a bug or due to my own mistake? > no successful example in Google/Baidu,am I right?| > |⑥|about less than 1000 people visit my [failure > record|https://yuchi.blog.csdn.net/article/details/108896441] about this > experiment.|in vain| > *Please help,thanks~* > > --original posting content-- > > *I want to do an experiment of"incremental checkpoint"* > > ||information||content|| > |pom.xml|[ is in the attachment] > |code| [https://paste.ubuntu.com/p/DpTyQKq6Vk/] > |error|[https://paste.ubuntu.com/p/49HRYXFzR2/] > > *some of the above error is:* > *Caused by: java.lang.IllegalStateException: Unexpected state handle type, > expected: class org.apache.flink.runtime.state.KeyGroupsStateHandle, but > found: class org.apache.flink.runtime.state.IncrementalRemoteKeyedStateHandle* > > > The steps are: > ||step||content|| > |①|mvn clean scala:compile compile package| > |②|nc -lk | > |③|flink run -c wordcount_increstate datastream_api-1.0-SNAPSHOT.jar > Job has been submitted with JobID df6d62a43620f258155b8538ef0ddf1b| > |④|input the following conents in nc -lk > before > error > error > error > error > |⑤|flink run -s > hdfs://Desktop:9000/tmp/flinkck/df6d62a43620f258155b8538ef0ddf1b/chk-22 -c > StateWordCount datastream_api-1.0-SNAPSHOT.jar > Then the above error happens. > Please help,Thanks~! > -- This message was sent by Atlassian Jira (v8.3.4#803005)
[jira] [Updated] (FLINK-19486) expected: class org.apache.flink.runtime.state.KeyGroupsStateHandle, but found: class org.apache.flink.runtime.state.IncrementalRemoteKeyedStateHandle
[ https://issues.apache.org/jira/browse/FLINK-19486?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] appleyuchi updated FLINK-19486: --- Priority: Major (was: Minor) > expected: class org.apache.flink.runtime.state.KeyGroupsStateHandle, but > found: class org.apache.flink.runtime.state.IncrementalRemoteKeyedStateHandle > -- > > Key: FLINK-19486 > URL: https://issues.apache.org/jira/browse/FLINK-19486 > Project: Flink > Issue Type: Bug >Affects Versions: 1.11.1 >Reporter: appleyuchi >Priority: Major > Attachments: pom.xml > > > *The reason why I reopen it is that:* > > *I can {color:#de350b}not{color} {color:#de350b}resume from incremental > checkpoint manually{color} when I stop the following program by force(maybe > NOT supported officially?).* > ||My Experience||StateBackEnd||result|| > |Normal Checkpoint|FsStateBackend|succeed| > |Incremental Checkpoint|RocksDBStateBackend|failed| > ||My Other Effort||Content||Result|| > |①|*I {color:#de350b}hava subscribed to mailing list{color}*|*David Anderson > haved answered this several days ago. But he did not tell me how I can fix > it.* > *I guess he's very busy.* > |②|{color:#de350b}stackoverflow account{color}|it is forbidden to ask > questions. > |③|alibaba {color:#de350b}Flink dingtalk group{color}|no one can answer this > question > |④|I have read > [document|https://ci.apache.org/projects/flink/flink-docs-stable/ops/state/checkpoints.html] > carefully|I can NOT succeed in it| > |⑤|apologise for post it here|how can I make sure this question is a support > instead of a bug or due to my own mistake? > no successful example in Google/Baidu,am I right?| > |⑥|about less than 1000 people visit my [failure > record|https://yuchi.blog.csdn.net/article/details/108896441] about this > experiment.|in vain| > *Please help,thanks~* > > --original posting content-- > > *I want to do an experiment of"incremental checkpoint"* > > ||information||content|| > |pom.xml|[ is in the attachment] > |code| [https://paste.ubuntu.com/p/DpTyQKq6Vk/] > |error|[https://paste.ubuntu.com/p/49HRYXFzR2/] > > *some of the above error is:* > *Caused by: java.lang.IllegalStateException: Unexpected state handle type, > expected: class org.apache.flink.runtime.state.KeyGroupsStateHandle, but > found: class org.apache.flink.runtime.state.IncrementalRemoteKeyedStateHandle* > > > The steps are: > ||step||content|| > |①|mvn clean scala:compile compile package| > |②|nc -lk | > |③|flink run -c wordcount_increstate datastream_api-1.0-SNAPSHOT.jar > Job has been submitted with JobID df6d62a43620f258155b8538ef0ddf1b| > |④|input the following conents in nc -lk > before > error > error > error > error > |⑤|flink run -s > hdfs://Desktop:9000/tmp/flinkck/df6d62a43620f258155b8538ef0ddf1b/chk-22 -c > StateWordCount datastream_api-1.0-SNAPSHOT.jar > Then the above error happens. > Please help,Thanks~! > -- This message was sent by Atlassian Jira (v8.3.4#803005)
[jira] [Updated] (FLINK-19486) expected: class org.apache.flink.runtime.state.KeyGroupsStateHandle, but found: class org.apache.flink.runtime.state.IncrementalRemoteKeyedStateHandle
[ https://issues.apache.org/jira/browse/FLINK-19486?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] appleyuchi updated FLINK-19486: --- Description: *The reason why I reopen it is that:* *I can {color:#de350b}not{color} {color:#de350b}resume from incremental checkpoint manually{color} when I stop the following program by force(maybe NOT supported officially?).* ||My Experience||StateBackEnd||result|| |Normal Checkpoint|FsStateBackend|succeed| |Incremental Checkpoint|RocksDBStateBackend|failed| ||My Other Effort||Content||Result|| |①|*I {color:#de350b}hava subscribed to mailing list{color}*|*David Anderson haved answered this several days ago. But he did not tell me how I can fix it.* *I guess he's very busy.* |②|{color:#de350b}stackoverflow account{color}|it is forbidden to ask questions. |③|alibaba {color:#de350b}Flink dingtalk group{color}|no one can answer this question |④|I have read [document|https://ci.apache.org/projects/flink/flink-docs-stable/ops/state/checkpoints.html] carefully|I can NOT succeed in it| |⑤|apologise for post it here|how can I make sure this question is a support instead of a bug or due to my own mistake? no successful example in Google/Baidu,am I right?| |⑥|about less than 1000 people visit my [failure record|https://yuchi.blog.csdn.net/article/details/108896441] about this experiment.|in vain| *Please help,thanks~* --original posting content-- *I want to do an experiment of"incremental checkpoint"* ||information||content|| |pom.xml|[ is in the attachment] |code| [https://paste.ubuntu.com/p/DpTyQKq6Vk/] |error|[https://paste.ubuntu.com/p/49HRYXFzR2/] *some of the above error is:* *Caused by: java.lang.IllegalStateException: Unexpected state handle type, expected: class org.apache.flink.runtime.state.KeyGroupsStateHandle, but found: class org.apache.flink.runtime.state.IncrementalRemoteKeyedStateHandle* The steps are: ||step||content|| |①|mvn clean scala:compile compile package| |②|nc -lk | |③|flink run -c wordcount_increstate datastream_api-1.0-SNAPSHOT.jar Job has been submitted with JobID df6d62a43620f258155b8538ef0ddf1b| |④|input the following conents in nc -lk before error error error error |⑤|flink run -s hdfs://Desktop:9000/tmp/flinkck/df6d62a43620f258155b8538ef0ddf1b/chk-22 -c StateWordCount datastream_api-1.0-SNAPSHOT.jar Then the above error happens. Please help,Thanks~! was: *The reason why I reopen it is that:* *I can {color:#de350b}not{color} {color:#de350b}resume from incremental checkpoint manually{color} when I stop the following program by force(maybe NOT supported officially?).* ||My Experience||StateBackEnd||result|| |Normal Checkpoint|FsStateBackend|succeed| |Incremental Checkpoint|RocksDBStateBackend|failed| ||My Other Effort||Content||Result|| |①|*I {color:#de350b}hava subscribed to mailing list{color}*|*David Anderson haved answered this several days ago. But he did not tell me how I can fix it.* *I guess he's very busy.* |②|{color:#de350b}stackoverflow account{color}|it is forbidden to ask questions. |③|alibaba {color:#de350b}Flink dingtalk group{color}|no one can answer this question |④|I have read [document](https://ci.apache.org/projects/flink/flink-docs-stable/ops/state/checkpoints.html) carefully|I can NOT succeed in it| |⑤|apologise for post it here|how can I make sure this question is a support instead of a bug or due to my own mistake? no successful example in Google/Baidu,am I right?| |⑥|about less than 1000 people visit my [failure record|https://yuchi.blog.csdn.net/article/details/108896441] about this experiment.|in vain| *Please help,thanks~* *The keypoint is:* *①Of course I know checkpoint is for automatically resuming,it's basics.* *②I want to stop the program by force(simulate Product Environment)with* {color:#de350b} *incremental* {color}*{color:#de350b}checkpoint{color} and then {color:#de350b}resume from{color}* {color:#de350b} *incremental* *checkpoint manually*{color} --original posting content-- *I want to do an experiment of"incremental checkpoint"* ||information||content|| |pom.xml|[ is in the attachment] |code| [https://paste.ubuntu.com/p/DpTyQKq6Vk/] |error|[https://paste.ubuntu.com/p/49HRYXFzR2/] *some of the above error is:* *Caused by: java.lang.IllegalStateException: Unexpected state handle type, expected: class org.apache.flink.runtime.state.KeyGroupsStateHandle, but found: class org.apache.flink.runtime.state.IncrementalRemoteKeyedStateHandle* The steps are: ||step||content|| |①|mvn clean scala:compile compile package| |②|nc -lk | |③|flink run -c wordcount_increstate datastream_api-1.0-SNAPSHOT.jar Job has been
[jira] [Updated] (FLINK-19486) expected: class org.apache.flink.runtime.state.KeyGroupsStateHandle, but found: class org.apache.flink.runtime.state.IncrementalRemoteKeyedStateHandle
[ https://issues.apache.org/jira/browse/FLINK-19486?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] appleyuchi updated FLINK-19486: --- Description: *The reason why I reopen it is that:* *I can {color:#de350b}not{color} {color:#de350b}resume from incremental checkpoint manually{color} when I stop the following program by force(maybe NOT supported officially?).* ||My Experience||StateBackEnd||result|| |Normal Checkpoint|FsStateBackend|succeed| |Incremental Checkpoint|RocksDBStateBackend|failed| ||My Other Effort||Content||Result|| |①|*I {color:#de350b}hava subscribed to mailing list{color}*|*David Anderson haved answered this several days ago. But he did not tell me how I can fix it.* *I guess he's very busy.* |②|{color:#de350b}stackoverflow account{color}|it is forbidden to ask questions. |③|alibaba {color:#de350b}Flink dingtalk group{color}|no one can answer this question |④|I have read [document](https://ci.apache.org/projects/flink/flink-docs-stable/ops/state/checkpoints.html) carefully|I can NOT succeed in it| |⑤|apologise for post it here|how can I make sure this question is a support instead of a bug or due to my own mistake? no successful example in Google/Baidu,am I right?| |⑥|about less than 1000 people visit my [failure record|https://yuchi.blog.csdn.net/article/details/108896441] about this experiment.|in vain| *Please help,thanks~* *The keypoint is:* *①Of course I know checkpoint is for automatically resuming,it's basics.* *②I want to stop the program by force(simulate Product Environment)with* {color:#de350b} *incremental* {color}*{color:#de350b}checkpoint{color} and then {color:#de350b}resume from{color}* {color:#de350b} *incremental* *checkpoint manually*{color} --original posting content-- *I want to do an experiment of"incremental checkpoint"* ||information||content|| |pom.xml|[ is in the attachment] |code| [https://paste.ubuntu.com/p/DpTyQKq6Vk/] |error|[https://paste.ubuntu.com/p/49HRYXFzR2/] *some of the above error is:* *Caused by: java.lang.IllegalStateException: Unexpected state handle type, expected: class org.apache.flink.runtime.state.KeyGroupsStateHandle, but found: class org.apache.flink.runtime.state.IncrementalRemoteKeyedStateHandle* The steps are: ||step||content|| |①|mvn clean scala:compile compile package| |②|nc -lk | |③|flink run -c wordcount_increstate datastream_api-1.0-SNAPSHOT.jar Job has been submitted with JobID df6d62a43620f258155b8538ef0ddf1b| |④|input the following conents in nc -lk before error error error error |⑤|flink run -s hdfs://Desktop:9000/tmp/flinkck/df6d62a43620f258155b8538ef0ddf1b/chk-22 -c StateWordCount datastream_api-1.0-SNAPSHOT.jar Then the above error happens. Please help,Thanks~! was: *The reason why I reopen it is that:* *I can {color:#de350b}not{color} {color:#de350b}resume from incremental checkpoint manually{color} when I stop the following program by force(maybe NOT supported officially?).* ||My Experience||StateBackEnd||result|| |Normal Checkpoint|FsStateBackend|succeed| |Incremental Checkpoint|RocksDBStateBackend|failed| ||My Other Effort||Content||Result|| |①|*I {color:#de350b}hava subscribed to mailing list{color}*|*David Anderson haved answered this several days ago. But he did not tell me how I can fix it.* *I guess he's very busy.* |②|{color:#de350b}stackoverflow account{color}|it is forbidden to ask questions. |③|alibaba {color:#de350b}Flink dingtalk group{color}|no one can answer this question *④I have read [document](https://ci.apache.org/projects/flink/flink-docs-stable/ops/state/checkpoints.html) carefully,but I can NOT succeed in it.* *⑤{color:#de350b}apologise for post it here{color},but how can I make sure this question is a support instead of a bug or due to my own mistake?* *no successful example in Google/Baidu,am I right?* *⑥{color:#de350b}about less than 1000 people visit my [failure record |[https://yuchi.blog.csdn.net/article/details/108896441]]{color}about this experiment.* *Please help,thanks~* *The keypoint is:* *①Of course I know checkpoint is for automatically resuming,it's basics.* *②I want to stop the program by force(simulate Product Environment)with* {color:#de350b} *incremental* {color}*{color:#de350b}checkpoint{color} and then {color:#de350b}resume from{color}* {color:#de350b} *incremental* *checkpoint manually*{color} --original posting content-- *I want to do an experiment of"incremental checkpoint"* ||information||content|| |pom.xml|[ is in the attachment] |code| [https://paste.ubuntu.com/p/DpTyQKq6Vk/] |error|[https://paste.ubuntu.com/p/49HRYXFzR2/] *some of the above error is:* *Caused by:
[jira] [Updated] (FLINK-19486) expected: class org.apache.flink.runtime.state.KeyGroupsStateHandle, but found: class org.apache.flink.runtime.state.IncrementalRemoteKeyedStateHandle
[ https://issues.apache.org/jira/browse/FLINK-19486?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] appleyuchi updated FLINK-19486: --- Description: *The reason why I reopen it is that:* *I can {color:#de350b}not{color} {color:#de350b}resume from incremental checkpoint manually{color} when I stop the following program by force(maybe NOT supported officially?).* ||My Experience||StateBackEnd||result|| |Normal Checkpoint|FsStateBackend|succeed| |Incremental Checkpoint|RocksDBStateBackend|failed| ||My Other Effort||Content||Result|| |①|*I {color:#de350b}hava subscribed to mailing list{color}*|*David Anderson haved answered this several days ago. But he did not tell me how I can fix it.* *I guess he's very busy.* |②|{color:#de350b}stackoverflow account{color}|it is forbidden to ask questions. |③|alibaba {color:#de350b}Flink dingtalk group{color}|no one can answer this question *④I have read [document](https://ci.apache.org/projects/flink/flink-docs-stable/ops/state/checkpoints.html) carefully,but I can NOT succeed in it.* *⑤{color:#de350b}apologise for post it here{color},but how can I make sure this question is a support instead of a bug or due to my own mistake?* *no successful example in Google/Baidu,am I right?* *⑥{color:#de350b}about less than 1000 people visit my [failure record |[https://yuchi.blog.csdn.net/article/details/108896441]]{color}about this experiment.* *Please help,thanks~* *The keypoint is:* *①Of course I know checkpoint is for automatically resuming,it's basics.* *②I want to stop the program by force(simulate Product Environment)with* {color:#de350b} *incremental* {color}*{color:#de350b}checkpoint{color} and then {color:#de350b}resume from{color}* {color:#de350b} *incremental* *checkpoint manually*{color} --original posting content-- *I want to do an experiment of"incremental checkpoint"* ||information||content|| |pom.xml|[ is in the attachment] |code| [https://paste.ubuntu.com/p/DpTyQKq6Vk/] |error|[https://paste.ubuntu.com/p/49HRYXFzR2/] *some of the above error is:* *Caused by: java.lang.IllegalStateException: Unexpected state handle type, expected: class org.apache.flink.runtime.state.KeyGroupsStateHandle, but found: class org.apache.flink.runtime.state.IncrementalRemoteKeyedStateHandle* The steps are: ||step||content|| |①|mvn clean scala:compile compile package| |②|nc -lk | |③|flink run -c wordcount_increstate datastream_api-1.0-SNAPSHOT.jar Job has been submitted with JobID df6d62a43620f258155b8538ef0ddf1b| |④|input the following conents in nc -lk before error error error error |⑤|flink run -s hdfs://Desktop:9000/tmp/flinkck/df6d62a43620f258155b8538ef0ddf1b/chk-22 -c StateWordCount datastream_api-1.0-SNAPSHOT.jar Then the above error happens. Please help,Thanks~! was: *The reason why I reopen it is that:* *I can {color:#de350b}not{color} {color:#de350b}resume from incremental checkpoint manually{color} when I stop the following program by force(maybe NOT supported officially?).* ||My Experience||StateBackEnd||result|| |Normal Checkpoint|FsStateBackend|succeed| |Incremental Checkpoint|RocksDBStateBackend|failed| ||My Other Effort||Content||Result|| |①|*I {color:#de350b}hava subscribed to mailing list{color}*|*David Anderson haved answered this several days ago. But he did not tell me how I can fix it.* *I guess he's very busy.* *②My {color:#de350b}stackoverflow account{color} is forbidden to ask questions.* *③No one in alibaba {color:#de350b}Flink dingtalk group{color} can answer this question.* *④I have read [document](https://ci.apache.org/projects/flink/flink-docs-stable/ops/state/checkpoints.html) carefully,but I can NOT succeed in it.* *⑤{color:#de350b}apologise for post it here{color},but how can I make sure this question is a support instead of a bug or due to my own mistake?* *no successful example in Google/Baidu,am I right?* *⑥{color:#de350b}about less than 1000 people visit my [failure record |[https://yuchi.blog.csdn.net/article/details/108896441]]{color}about this experiment.* *Please help,thanks~* *The keypoint is:* *①Of course I know checkpoint is for automatically resuming,it's basics.* *②I want to stop the program by force(simulate Product Environment)with* {color:#de350b} *incremental* {color}*{color:#de350b}checkpoint{color} and then {color:#de350b}resume from{color}* {color:#de350b} *incremental* *checkpoint manually*{color} --original posting content-- *I want to do an experiment of"incremental checkpoint"* ||information||content|| |pom.xml|[ is in the attachment] |code| [https://paste.ubuntu.com/p/DpTyQKq6Vk/]
[jira] [Updated] (FLINK-19486) expected: class org.apache.flink.runtime.state.KeyGroupsStateHandle, but found: class org.apache.flink.runtime.state.IncrementalRemoteKeyedStateHandle
[ https://issues.apache.org/jira/browse/FLINK-19486?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] appleyuchi updated FLINK-19486: --- Description: *The reason why I reopen it is that:* *I can {color:#de350b}not{color} {color:#de350b}resume from incremental checkpoint manually{color} when I stop the following program by force(maybe NOT supported officially?).* ||My Experience||StateBackEnd||result|| |Normal Checkpoint|FsStateBackend|succeed| |Incremental Checkpoint|RocksDBStateBackend|failed| ||My Other Effort||Content||Result|| |①|*I {color:#de350b}hava subscribed to mailing list{color}*|*David Anderson haved answered this several days ago. But he did not tell me how I can fix it.* *I guess he's very busy.* *②My {color:#de350b}stackoverflow account{color} is forbidden to ask questions.* *③No one in alibaba {color:#de350b}Flink dingtalk group{color} can answer this question.* *④I have read [document](https://ci.apache.org/projects/flink/flink-docs-stable/ops/state/checkpoints.html) carefully,but I can NOT succeed in it.* *⑤{color:#de350b}apologise for post it here{color},but how can I make sure this question is a support instead of a bug or due to my own mistake?* *no successful example in Google/Baidu,am I right?* *⑥{color:#de350b}about less than 1000 people visit my [failure record |[https://yuchi.blog.csdn.net/article/details/108896441]]{color}about this experiment.* *Please help,thanks~* *The keypoint is:* *①Of course I know checkpoint is for automatically resuming,it's basics.* *②I want to stop the program by force(simulate Product Environment)with* {color:#de350b} *incremental* {color}*{color:#de350b}checkpoint{color} and then {color:#de350b}resume from{color}* {color:#de350b} *incremental* *checkpoint manually*{color} --original posting content-- *I want to do an experiment of"incremental checkpoint"* ||information||content|| |pom.xml|[ is in the attachment] |code| [https://paste.ubuntu.com/p/DpTyQKq6Vk/] |error|[https://paste.ubuntu.com/p/49HRYXFzR2/] *some of the above error is:* *Caused by: java.lang.IllegalStateException: Unexpected state handle type, expected: class org.apache.flink.runtime.state.KeyGroupsStateHandle, but found: class org.apache.flink.runtime.state.IncrementalRemoteKeyedStateHandle* The steps are: ||step||content|| |①|mvn clean scala:compile compile package| |②|nc -lk | |③|flink run -c wordcount_increstate datastream_api-1.0-SNAPSHOT.jar Job has been submitted with JobID df6d62a43620f258155b8538ef0ddf1b| |④|input the following conents in nc -lk before error error error error |⑤|flink run -s hdfs://Desktop:9000/tmp/flinkck/df6d62a43620f258155b8538ef0ddf1b/chk-22 -c StateWordCount datastream_api-1.0-SNAPSHOT.jar Then the above error happens. Please help,Thanks~! was: *The reason why I reopen it is that:* *I can {color:#de350b}not{color} {color:#de350b}resume from incremental checkpoint manually{color} when I stop the following program by force(maybe NOT supported officially?).* ||My Experience||StateBackEnd||result|| |Normal Checkpoint|FsStateBackend|succeed| |Incremental Checkpoint|RocksDBStateBackend|failed| ||My Other Effort||Result|| |*①I {color:#de350b}hava subscribed to mailing list{color} and David Anderson haved answered this several days ago.*|列 A2|*But he did not tell me how I can fix it.* *I guess he's very busy.* *②My {color:#de350b}stackoverflow account{color} is forbidden to ask questions.* *③No one in alibaba {color:#de350b}Flink dingtalk group{color} can answer this question.* *④I have read [document](https://ci.apache.org/projects/flink/flink-docs-stable/ops/state/checkpoints.html) carefully,but I can NOT succeed in it.* *⑤{color:#de350b}apologise for post it here{color},but how can I make sure this question is a support instead of a bug or due to my own mistake?* *no successful example in Google/Baidu,am I right?* *⑥{color:#de350b}about less than 1000 people visit my [failure record |[https://yuchi.blog.csdn.net/article/details/108896441]]{color}about this experiment.* *Please help,thanks~* *The keypoint is:* *①Of course I know checkpoint is for automatically resuming,it's basics.* *②I want to stop the program by force(simulate Product Environment)with* {color:#de350b} *incremental* {color}*{color:#de350b}checkpoint{color} and then {color:#de350b}resume from{color}* {color:#de350b} *incremental* *checkpoint manually*{color} --original posting content-- *I want to do an experiment of"incremental checkpoint"* ||information||content|| |pom.xml|[ is in the attachment] |code| [https://paste.ubuntu.com/p/DpTyQKq6Vk/]
[jira] [Updated] (FLINK-19486) expected: class org.apache.flink.runtime.state.KeyGroupsStateHandle, but found: class org.apache.flink.runtime.state.IncrementalRemoteKeyedStateHandle
[ https://issues.apache.org/jira/browse/FLINK-19486?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] appleyuchi updated FLINK-19486: --- Description: *The reason why I reopen it is that:* *I can {color:#de350b}not{color} {color:#de350b}resume from incremental checkpoint manually{color} when I stop the following program by force(maybe NOT supported officially?).* ||My Experience||StateBackEnd||result|| |Normal Checkpoint|FsStateBackend|succeed| |Incremental Checkpoint|RocksDBStateBackend|failed| ||My Other Effort||Result|| |*①I {color:#de350b}hava subscribed to mailing list{color} and David Anderson haved answered this several days ago.*|列 A2|*But he did not tell me how I can fix it.* *I guess he's very busy.* *②My {color:#de350b}stackoverflow account{color} is forbidden to ask questions.* *③No one in alibaba {color:#de350b}Flink dingtalk group{color} can answer this question.* *④I have read [document](https://ci.apache.org/projects/flink/flink-docs-stable/ops/state/checkpoints.html) carefully,but I can NOT succeed in it.* *⑤{color:#de350b}apologise for post it here{color},but how can I make sure this question is a support instead of a bug or due to my own mistake?* *no successful example in Google/Baidu,am I right?* *⑥{color:#de350b}about less than 1000 people visit my [failure record |[https://yuchi.blog.csdn.net/article/details/108896441]]{color}about this experiment.* *Please help,thanks~* *The keypoint is:* *①Of course I know checkpoint is for automatically resuming,it's basics.* *②I want to stop the program by force(simulate Product Environment)with* {color:#de350b} *incremental* {color}*{color:#de350b}checkpoint{color} and then {color:#de350b}resume from{color}* {color:#de350b} *incremental* *checkpoint manually*{color} --original posting content-- *I want to do an experiment of"incremental checkpoint"* ||information||content|| |pom.xml|[ is in the attachment] |code| [https://paste.ubuntu.com/p/DpTyQKq6Vk/] |error|[https://paste.ubuntu.com/p/49HRYXFzR2/] *some of the above error is:* *Caused by: java.lang.IllegalStateException: Unexpected state handle type, expected: class org.apache.flink.runtime.state.KeyGroupsStateHandle, but found: class org.apache.flink.runtime.state.IncrementalRemoteKeyedStateHandle* The steps are: ||step||content|| |①|mvn clean scala:compile compile package| |②|nc -lk | |③|flink run -c wordcount_increstate datastream_api-1.0-SNAPSHOT.jar Job has been submitted with JobID df6d62a43620f258155b8538ef0ddf1b| |④|input the following conents in nc -lk before error error error error |⑤|flink run -s hdfs://Desktop:9000/tmp/flinkck/df6d62a43620f258155b8538ef0ddf1b/chk-22 -c StateWordCount datastream_api-1.0-SNAPSHOT.jar Then the above error happens. Please help,Thanks~! was: *The reason why I reopen it is that:* *I can {color:#de350b}not{color} {color:#de350b}resume from incremental checkpoint manually{color} when I stop the following program by force(maybe NOT supported officially?).* ||My Experience||StateBackEnd||result|| |Normal Checkpoint|FsStateBackend|succeed| |Incremental Checkpoint|RocksDBStateBackend|failed| ||My Other Effort||Result|| |列 A1|列 A2| *①I {color:#de350b}hava subscribed to mailing list{color} and David Anderson haved answered this several days ago.* *But he did not tell me how I can fix it.* *I guess he's very busy.* *②My {color:#de350b}stackoverflow account{color} is forbidden to ask questions.* *③No one in alibaba {color:#de350b}Flink dingtalk group{color} can answer this question.* *④I have read [document](https://ci.apache.org/projects/flink/flink-docs-stable/ops/state/checkpoints.html) carefully,but I can NOT succeed in it.* *⑤{color:#de350b}apologise for post it here{color},but how can I make sure this question is a support instead of a bug or due to my own mistake?* *no successful example in Google/Baidu,am I right?* *⑥{color:#de350b}about less than 1000 people visit my [failure record |[https://yuchi.blog.csdn.net/article/details/108896441]]{color}about this experiment.* *Please help,thanks~* *The keypoint is:* *①Of course I know checkpoint is for automatically resuming,it's basics.* *②I want to stop the program by force(simulate Product Environment)with* {color:#de350b} *incremental* {color}*{color:#de350b}checkpoint{color} and then {color:#de350b}resume from{color}* {color:#de350b} *incremental* *checkpoint manually*{color} --original posting content-- *I want to do an experiment of"incremental checkpoint"* ||information||content|| |pom.xml|[ is in the attachment] |code| [https://paste.ubuntu.com/p/DpTyQKq6Vk/]
[jira] [Updated] (FLINK-19486) expected: class org.apache.flink.runtime.state.KeyGroupsStateHandle, but found: class org.apache.flink.runtime.state.IncrementalRemoteKeyedStateHandle
[ https://issues.apache.org/jira/browse/FLINK-19486?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] appleyuchi updated FLINK-19486: --- Description: *The reason why I reopen it is that:* *I can {color:#de350b}not{color} {color:#de350b}resume from incremental checkpoint manually{color} when I stop the following program by force(maybe NOT supported officially?).* ||My Experience||StateBackEnd||result|| |Normal Checkpoint|FsStateBackend|succeed| |Incremental Checkpoint|RocksDBStateBackend|failed| ||My Other Effort||Result|| |列 A1|列 A2| *①I {color:#de350b}hava subscribed to mailing list{color} and David Anderson haved answered this several days ago.* *But he did not tell me how I can fix it.* *I guess he's very busy.* *②My {color:#de350b}stackoverflow account{color} is forbidden to ask questions.* *③No one in alibaba {color:#de350b}Flink dingtalk group{color} can answer this question.* *④I have read [document](https://ci.apache.org/projects/flink/flink-docs-stable/ops/state/checkpoints.html) carefully,but I can NOT succeed in it.* *⑤{color:#de350b}apologise for post it here{color},but how can I make sure this question is a support instead of a bug or due to my own mistake?* *no successful example in Google/Baidu,am I right?* *⑥{color:#de350b}about less than 1000 people visit my [failure record |[https://yuchi.blog.csdn.net/article/details/108896441]]{color}about this experiment.* *Please help,thanks~* *The keypoint is:* *①Of course I know checkpoint is for automatically resuming,it's basics.* *②I want to stop the program by force(simulate Product Environment)with* {color:#de350b} *incremental* {color}*{color:#de350b}checkpoint{color} and then {color:#de350b}resume from{color}* {color:#de350b} *incremental* *checkpoint manually*{color} --original posting content-- *I want to do an experiment of"incremental checkpoint"* ||information||content|| |pom.xml|[ is in the attachment] |code| [https://paste.ubuntu.com/p/DpTyQKq6Vk/] |error|[https://paste.ubuntu.com/p/49HRYXFzR2/] *some of the above error is:* *Caused by: java.lang.IllegalStateException: Unexpected state handle type, expected: class org.apache.flink.runtime.state.KeyGroupsStateHandle, but found: class org.apache.flink.runtime.state.IncrementalRemoteKeyedStateHandle* The steps are: ||step||content|| |①|mvn clean scala:compile compile package| |②|nc -lk | |③|flink run -c wordcount_increstate datastream_api-1.0-SNAPSHOT.jar Job has been submitted with JobID df6d62a43620f258155b8538ef0ddf1b| |④|input the following conents in nc -lk before error error error error |⑤|flink run -s hdfs://Desktop:9000/tmp/flinkck/df6d62a43620f258155b8538ef0ddf1b/chk-22 -c StateWordCount datastream_api-1.0-SNAPSHOT.jar Then the above error happens. Please help,Thanks~! was: *The reason why I reopen it is that:* *I can {color:#de350b}not{color} {color:#de350b}resume from incremental checkpoint manually{color} when I stop the following program by force(maybe NOT supported officially?).* ||My Experience||StateBackEnd||result|| |Normal Checkpoint|FsStateBackend|succeed| |Incremental Checkpoint|rRocksDBStateBackend|failed| *①I {color:#de350b}hava subscribed to mailing list{color} and David Anderson haved answered this several days ago.* *But he did not tell me how I can fix it.* *I guess he's very busy.* *②My {color:#de350b}stackoverflow account{color} is forbidden to ask questions.* *③No one in alibaba {color:#de350b}Flink dingtalk group{color} can answer this question.* *④I have read [document](https://ci.apache.org/projects/flink/flink-docs-stable/ops/state/checkpoints.html) carefully,but I can NOT succeed in it.* *⑤{color:#de350b}apologise for post it here{color},but how can I make sure this question is a support instead of a bug or due to my own mistake?* *no successful example in Google/Baidu,am I right?* *⑥{color:#de350b}about less than 1000 people visit my [failure record |[https://yuchi.blog.csdn.net/article/details/108896441]]{color}about this experiment.* *Please help,thanks~* *The keypoint is:* *①Of course I know checkpoint is for automatically resuming,it's basics.* *②I want to stop the program by force(simulate Product Environment)with* {color:#de350b} *incremental* {color}*{color:#de350b}checkpoint{color} and then {color:#de350b}resume from{color}* {color:#de350b} *incremental* *checkpoint manually*{color} --original posting content-- *I want to do an experiment of"incremental checkpoint"* ||information||content|| |pom.xml|[ is in the attachment] |code| [https://paste.ubuntu.com/p/DpTyQKq6Vk/] |error|[https://paste.ubuntu.com/p/49HRYXFzR2/] *some of the
[jira] [Updated] (FLINK-19486) expected: class org.apache.flink.runtime.state.KeyGroupsStateHandle, but found: class org.apache.flink.runtime.state.IncrementalRemoteKeyedStateHandle
[ https://issues.apache.org/jira/browse/FLINK-19486?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] appleyuchi updated FLINK-19486: --- Description: *The reason why I reopen it is that:* *I can {color:#de350b}not{color} {color:#de350b}resume from incremental checkpoint manually{color} when I stop the following program by force(maybe NOT supported officially?).* ||My Experience||StateBackEnd||result|| |Normal Checkpoint|FsStateBackend|succeed| |Incremental Checkpoint|rRocksDBStateBackend|failed| *①I {color:#de350b}hava subscribed to mailing list{color} and David Anderson haved answered this several days ago.* *But he did not tell me how I can fix it.* *I guess he's very busy.* *②My {color:#de350b}stackoverflow account{color} is forbidden to ask questions.* *③No one in alibaba {color:#de350b}Flink dingtalk group{color} can answer this question.* *④I have read [document](https://ci.apache.org/projects/flink/flink-docs-stable/ops/state/checkpoints.html) carefully,but I can NOT succeed in it.* *⑤{color:#de350b}apologise for post it here{color},but how can I make sure this question is a support instead of a bug or due to my own mistake?* *no successful example in Google/Baidu,am I right?* *⑥{color:#de350b}about less than 1000 people visit my [failure record |[https://yuchi.blog.csdn.net/article/details/108896441]]{color}about this experiment.* *Please help,thanks~* *The keypoint is:* *①Of course I know checkpoint is for automatically resuming,it's basics.* *②I want to stop the program by force(simulate Product Environment)with* {color:#de350b} *incremental* {color}*{color:#de350b}checkpoint{color} and then {color:#de350b}resume from{color}* {color:#de350b} *incremental* *checkpoint manually*{color} --original posting content-- *I want to do an experiment of"incremental checkpoint"* ||information||content|| |pom.xml|[ is in the attachment] |code| [https://paste.ubuntu.com/p/DpTyQKq6Vk/] |error|[https://paste.ubuntu.com/p/49HRYXFzR2/] *some of the above error is:* *Caused by: java.lang.IllegalStateException: Unexpected state handle type, expected: class org.apache.flink.runtime.state.KeyGroupsStateHandle, but found: class org.apache.flink.runtime.state.IncrementalRemoteKeyedStateHandle* The steps are: ||step||content|| |①|mvn clean scala:compile compile package| |②|nc -lk | |③|flink run -c wordcount_increstate datastream_api-1.0-SNAPSHOT.jar Job has been submitted with JobID df6d62a43620f258155b8538ef0ddf1b| |④|input the following conents in nc -lk before error error error error |⑤|flink run -s hdfs://Desktop:9000/tmp/flinkck/df6d62a43620f258155b8538ef0ddf1b/chk-22 -c StateWordCount datastream_api-1.0-SNAPSHOT.jar Then the above error happens. Please help,Thanks~! was: *The reason why I reopen it is that:* *I can {color:#de350b}not{color} {color:#de350b}resume from incremental checkpoint manually{color} when I stop the following program by force(maybe NOT supported officially?).* ||My Experience||Content||result|| |Normal Checkpoint|succeed| |Incremental Checkpoint|failed| *①I {color:#de350b}hava subscribed to mailing list{color} and David Anderson haved answered this several days ago.* *But he did not tell me how I can fix it.* *I guess he's very busy.* *②My {color:#de350b}stackoverflow account{color} is forbidden to ask questions.* *③No one in alibaba {color:#de350b}Flink dingtalk group{color} can answer this question.* *④I have read [document](https://ci.apache.org/projects/flink/flink-docs-stable/ops/state/checkpoints.html) carefully,but I can NOT succeed in it.* *⑤{color:#de350b}apologise for post it here{color},but how can I make sure this question is a support instead of a bug or due to my own mistake?* *no successful example in Google/Baidu,am I right?* *⑥{color:#de350b}about less than 1000 people visit my [failure record |[https://yuchi.blog.csdn.net/article/details/108896441]]{color}about this experiment.* *Please help,thanks~* *The keypoint is:* *①Of course I know checkpoint is for automatically resuming,it's basics.* *②I want to stop the program by force(simulate Product Environment)with* {color:#de350b} *incremental* {color}*{color:#de350b}checkpoint{color} and then {color:#de350b}resume from{color}* {color:#de350b} *incremental* *checkpoint manually*{color} --original posting content-- *I want to do an experiment of"incremental checkpoint"* ||information||content|| |pom.xml|[ is in the attachment] |code| [https://paste.ubuntu.com/p/DpTyQKq6Vk/] |error|[https://paste.ubuntu.com/p/49HRYXFzR2/] *some of the above error is:* *Caused by: java.lang.IllegalStateException: Unexpected state
[jira] [Updated] (FLINK-19486) expected: class org.apache.flink.runtime.state.KeyGroupsStateHandle, but found: class org.apache.flink.runtime.state.IncrementalRemoteKeyedStateHandle
[ https://issues.apache.org/jira/browse/FLINK-19486?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] appleyuchi updated FLINK-19486: --- Description: *The reason why I reopen it is that:* *I can {color:#de350b}not{color} {color:#de350b}resume from incremental checkpoint manually{color} when I stop the following program by force(maybe NOT supported officially?).* ||My Experience||Content||result|| |Normal Checkpoint|succeed| |Incremental Checkpoint|failed| *①I {color:#de350b}hava subscribed to mailing list{color} and David Anderson haved answered this several days ago.* *But he did not tell me how I can fix it.* *I guess he's very busy.* *②My {color:#de350b}stackoverflow account{color} is forbidden to ask questions.* *③No one in alibaba {color:#de350b}Flink dingtalk group{color} can answer this question.* *④I have read [document](https://ci.apache.org/projects/flink/flink-docs-stable/ops/state/checkpoints.html) carefully,but I can NOT succeed in it.* *⑤{color:#de350b}apologise for post it here{color},but how can I make sure this question is a support instead of a bug or due to my own mistake?* *no successful example in Google/Baidu,am I right?* *⑥{color:#de350b}about less than 1000 people visit my [failure record |[https://yuchi.blog.csdn.net/article/details/108896441]]{color}about this experiment.* *Please help,thanks~* *The keypoint is:* *①Of course I know checkpoint is for automatically resuming,it's basics.* *②I want to stop the program by force(simulate Product Environment)with* {color:#de350b} *incremental* {color}*{color:#de350b}checkpoint{color} and then {color:#de350b}resume from{color}* {color:#de350b} *incremental* *checkpoint manually*{color} --original posting content-- *I want to do an experiment of"incremental checkpoint"* ||information||content|| |pom.xml|[ is in the attachment] |code| [https://paste.ubuntu.com/p/DpTyQKq6Vk/] |error|[https://paste.ubuntu.com/p/49HRYXFzR2/] *some of the above error is:* *Caused by: java.lang.IllegalStateException: Unexpected state handle type, expected: class org.apache.flink.runtime.state.KeyGroupsStateHandle, but found: class org.apache.flink.runtime.state.IncrementalRemoteKeyedStateHandle* The steps are: ||step||content|| |①|mvn clean scala:compile compile package| |②|nc -lk | |③|flink run -c wordcount_increstate datastream_api-1.0-SNAPSHOT.jar Job has been submitted with JobID df6d62a43620f258155b8538ef0ddf1b| |④|input the following conents in nc -lk before error error error error |⑤|flink run -s hdfs://Desktop:9000/tmp/flinkck/df6d62a43620f258155b8538ef0ddf1b/chk-22 -c StateWordCount datastream_api-1.0-SNAPSHOT.jar Then the above error happens. Please help,Thanks~! was: *The reason why I reopen it is that:* *I can {color:#de350b}not{color} {color:#de350b}resume from incremental checkpoint manually{color} when I stop the following program by force(maybe NOT supported officially?).* *①I {color:#de350b}hava subscribed to mailing list{color} and David Anderson haved answered this several days ago.* *But he did not tell me how I can fix it.* *I guess he's very busy.* *②My {color:#de350b}stackoverflow account{color} is forbidden to ask questions.* *③No one in alibaba {color:#de350b}Flink dingtalk group{color} can answer this question.* *④I have read [document](https://ci.apache.org/projects/flink/flink-docs-stable/ops/state/checkpoints.html) carefully,but I can NOT succeed in it.* *⑤{color:#de350b}apologise for post it here{color},but how can I make sure this question is a support instead of a bug or due to my own mistake?* *no successful example in Google/Baidu,am I right?* *⑥{color:#de350b}about less than 1000 people visit my [failure record |[https://yuchi.blog.csdn.net/article/details/108896441]]{color}about this experiment.* *Please help,thanks~* *The keypoint is:* *①Of course I know checkpoint is for automatically resuming,it's basics.* *②I want to stop the program by force(simulate Product Environment)with* {color:#de350b} *incremental* {color}*{color:#de350b}checkpoint{color} and then {color:#de350b}resume from{color}* {color:#de350b} *incremental* *checkpoint manually*{color} --original posting content-- *I want to do an experiment of"incremental checkpoint"* ||information||content|| |pom.xml|[ is in the attachment] |code| [https://paste.ubuntu.com/p/DpTyQKq6Vk/] |error|[https://paste.ubuntu.com/p/49HRYXFzR2/] *some of the above error is:* *Caused by: java.lang.IllegalStateException: Unexpected state handle type, expected: class org.apache.flink.runtime.state.KeyGroupsStateHandle, but found: class
[jira] [Updated] (FLINK-19486) expected: class org.apache.flink.runtime.state.KeyGroupsStateHandle, but found: class org.apache.flink.runtime.state.IncrementalRemoteKeyedStateHandle
[ https://issues.apache.org/jira/browse/FLINK-19486?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] appleyuchi updated FLINK-19486: --- Description: *The reason why I reopen it is that:* *I can {color:#de350b}not{color} {color:#de350b}resume from incremental checkpoint manually{color} when I stop the following program by force(maybe NOT supported officially?).* *①I {color:#de350b}hava subscribed to mailing list{color} and David Anderson haved answered this several days ago.* *But he did not tell me how I can fix it.* *I guess he's very busy.* *②My {color:#de350b}stackoverflow account{color} is forbidden to ask questions.* *③No one in alibaba {color:#de350b}Flink dingtalk group{color} can answer this question.* *④I have read [document](https://ci.apache.org/projects/flink/flink-docs-stable/ops/state/checkpoints.html) carefully,but I can NOT succeed in it.* *⑤{color:#de350b}apologise for post it here{color},but how can I make sure this question is a support instead of a bug or due to my own mistake?* *no successful example in Google/Baidu,am I right?* *⑥{color:#de350b}about less than 1000 people visit my [failure record |[https://yuchi.blog.csdn.net/article/details/108896441]]{color}about this experiment.* *Please help,thanks~* *The keypoint is:* *①Of course I know checkpoint is for automatically resuming,it's basics.* *②I want to stop the program by force(simulate Product Environment)with* {color:#de350b} *incremental* {color}*{color:#de350b}checkpoint{color} and then {color:#de350b}resume from{color}* {color:#de350b} *incremental* *checkpoint manually*{color} --original posting content-- *I want to do an experiment of"incremental checkpoint"* ||information||content|| |pom.xml|[ is in the attachment] |code| [https://paste.ubuntu.com/p/DpTyQKq6Vk/] |error|[https://paste.ubuntu.com/p/49HRYXFzR2/] *some of the above error is:* *Caused by: java.lang.IllegalStateException: Unexpected state handle type, expected: class org.apache.flink.runtime.state.KeyGroupsStateHandle, but found: class org.apache.flink.runtime.state.IncrementalRemoteKeyedStateHandle* The steps are: ||step||content|| |①|mvn clean scala:compile compile package| |②|nc -lk | |③|flink run -c wordcount_increstate datastream_api-1.0-SNAPSHOT.jar Job has been submitted with JobID df6d62a43620f258155b8538ef0ddf1b| |④|input the following conents in nc -lk before error error error error |⑤|flink run -s hdfs://Desktop:9000/tmp/flinkck/df6d62a43620f258155b8538ef0ddf1b/chk-22 -c StateWordCount datastream_api-1.0-SNAPSHOT.jar Then the above error happens. Please help,Thanks~! was: *The reason why I reopen it is that:* *I can {color:#de350b}not{color} {color:#de350b}resume from incremental checkpoint manually{color} when I stop the following program by force(maybe NOT supported officially?).* *①I {color:#de350b}hava subscribed to mailing list{color} and David Anderson haved answered this several days ago.* *But he did not tell me how I can fix it.* *I guess he's very busy.* *②My {color:#de350b}stackoverflow account{color} is forbidden to ask questions.* *③No one in alibaba {color:#de350b}Flink dingtalk group{color} can answer this question.* *④I have read [document](https://ci.apache.org/projects/flink/flink-docs-stable/ops/state/checkpoints.html) carefully,but I can NOT succeed in it.* *⑤{color:#de350b}apologise for post it here{color},but how can I make sure this question is a support instead of a bug or due to my own mistake?* *no successful example in Google/Baidu,am I right?* *⑥{color:#de350b}about less than 1000 people visit my [failure record |[https://yuchi.blog.csdn.net/article/details/108896441]]{color}about this experiment.* *Please help,thanks~* *The keypoint is:* *①Of course I know checkpoint is for automatically resuming,it's basics.* *②I want to stop the program by force(simulate Product Environment)with* {color:#de350b} *incremental* {color}*{color:#de350b}checkpoint{color} and then {color:#de350b}resume from{color}* {color:#de350b} *incremental* *checkpoint manually*{color} --original posting content-- *I want to do an experiment of"incremental checkpoint"* ||information||content|| |error|[https://paste.ubuntu.com/p/49HRYXFzR2/] |pom.xml|[ is in the attachment] |code| [https://paste.ubuntu.com/p/DpTyQKq6Vk/] *some of the above error is:* *Caused by: java.lang.IllegalStateException: Unexpected state handle type, expected: class org.apache.flink.runtime.state.KeyGroupsStateHandle, but found: class org.apache.flink.runtime.state.IncrementalRemoteKeyedStateHandle* The steps are: ||step||content|| |①|mvn clean scala:compile compile
[jira] [Updated] (FLINK-19486) expected: class org.apache.flink.runtime.state.KeyGroupsStateHandle, but found: class org.apache.flink.runtime.state.IncrementalRemoteKeyedStateHandle
[ https://issues.apache.org/jira/browse/FLINK-19486?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] appleyuchi updated FLINK-19486: --- Description: *The reason why I reopen it is that:* *I can {color:#de350b}not{color} {color:#de350b}resume from incremental checkpoint manually{color} when I stop the following program by force(maybe NOT supported officially?).* *①I {color:#de350b}hava subscribed to mailing list{color} and David Anderson haved answered this several days ago.* *But he did not tell me how I can fix it.* *I guess he's very busy.* *②My {color:#de350b}stackoverflow account{color} is forbidden to ask questions.* *③No one in alibaba {color:#de350b}Flink dingtalk group{color} can answer this question.* *④I have read [document](https://ci.apache.org/projects/flink/flink-docs-stable/ops/state/checkpoints.html) carefully,but I can NOT succeed in it.* *⑤{color:#de350b}apologise for post it here{color},but how can I make sure this question is a support instead of a bug or due to my own mistake?* *no successful example in Google/Baidu,am I right?* *⑥{color:#de350b}about less than 1000 people visit my [failure record |[https://yuchi.blog.csdn.net/article/details/108896441]]{color}about this experiment.* *Please help,thanks~* *The keypoint is:* *①Of course I know checkpoint is for automatically resuming,it's basics.* *②I want to stop the program by force(simulate Product Environment)with* {color:#de350b} *incremental* {color}*{color:#de350b}checkpoint{color} and then {color:#de350b}resume from{color}* {color:#de350b} *incremental* *checkpoint manually*{color} --original posting content-- *I want to do an experiment of"incremental checkpoint"* ||information||content|| |error|[https://paste.ubuntu.com/p/49HRYXFzR2/] |pom.xml|[ is in the attachment] |code| [https://paste.ubuntu.com/p/DpTyQKq6Vk/] *some of the above error is:* *Caused by: java.lang.IllegalStateException: Unexpected state handle type, expected: class org.apache.flink.runtime.state.KeyGroupsStateHandle, but found: class org.apache.flink.runtime.state.IncrementalRemoteKeyedStateHandle* The steps are: ||step||content|| |①|mvn clean scala:compile compile package| |②|nc -lk | |③|flink run -c wordcount_increstate datastream_api-1.0-SNAPSHOT.jar Job has been submitted with JobID df6d62a43620f258155b8538ef0ddf1b| |④|input the following conents in nc -lk before error error error error 5. flink run -s hdfs://Desktop:9000/tmp/flinkck/df6d62a43620f258155b8538ef0ddf1b/chk-22 -c StateWordCount datastream_api-1.0-SNAPSHOT.jar Then the above error happens. Please help,Thanks~! was: *The reason why I reopen it is that:* *I can {color:#de350b}not{color} {color:#de350b}resume from incremental checkpoint manually{color} when I stop the following program by force(maybe NOT supported officially?).* *①I {color:#de350b}hava subscribed to mailing list{color} and David Anderson haved answered this several days ago.* *But he did not tell me how I can fix it.* *I guess he's very busy.* *②My {color:#de350b}stackoverflow account{color} is forbidden to ask questions.* *③No one in alibaba {color:#de350b}Flink dingtalk group{color} can answer this question.* *④I have read [document](https://ci.apache.org/projects/flink/flink-docs-stable/ops/state/checkpoints.html) carefully,but I can NOT succeed in it.* *⑤{color:#de350b}apologise for post it here{color},but how can I make sure this question is a support instead of a bug or due to my own mistake?* *no successful example in Google/Baidu,am I right?* *⑥{color:#de350b}about less than 1000 people visit my [failure record |[https://yuchi.blog.csdn.net/article/details/108896441]]{color}about this experiment.* *Please help,thanks~* *The keypoint is:* *①Of course I know checkpoint is for automatically resuming,it's basics.* *②I want to stop the program by force(simulate Product Environment)with* {color:#de350b} *incremental* {color}*{color:#de350b}checkpoint{color} and then {color:#de350b}resume from{color}* {color:#de350b} *incremental* *checkpoint manually*{color} --original posting content-- *I want to do an experiment of"incremental checkpoint"* ||information||content|| |error|[https://paste.ubuntu.com/p/49HRYXFzR2/] |pom.xml|[ is in the attachment] |code| [https://paste.ubuntu.com/p/DpTyQKq6Vk/] *some of the above error is:* *Caused by: java.lang.IllegalStateException: Unexpected state handle type, expected: class org.apache.flink.runtime.state.KeyGroupsStateHandle, but found: class org.apache.flink.runtime.state.IncrementalRemoteKeyedStateHandle* The steps are: ||step||content|| |1|mvn clean scala:compile
[jira] [Updated] (FLINK-19486) expected: class org.apache.flink.runtime.state.KeyGroupsStateHandle, but found: class org.apache.flink.runtime.state.IncrementalRemoteKeyedStateHandle
[ https://issues.apache.org/jira/browse/FLINK-19486?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] appleyuchi updated FLINK-19486: --- Description: *The reason why I reopen it is that:* *I can {color:#de350b}not{color} {color:#de350b}resume from incremental checkpoint manually{color} when I stop the following program by force(maybe NOT supported officially?).* *①I {color:#de350b}hava subscribed to mailing list{color} and David Anderson haved answered this several days ago.* *But he did not tell me how I can fix it.* *I guess he's very busy.* *②My {color:#de350b}stackoverflow account{color} is forbidden to ask questions.* *③No one in alibaba {color:#de350b}Flink dingtalk group{color} can answer this question.* *④I have read [document](https://ci.apache.org/projects/flink/flink-docs-stable/ops/state/checkpoints.html) carefully,but I can NOT succeed in it.* *⑤{color:#de350b}apologise for post it here{color},but how can I make sure this question is a support instead of a bug or due to my own mistake?* *no successful example in Google/Baidu,am I right?* *⑥{color:#de350b}about less than 1000 people visit my [failure record |[https://yuchi.blog.csdn.net/article/details/108896441]]{color}about this experiment.* *Please help,thanks~* *The keypoint is:* *①Of course I know checkpoint is for automatically resuming,it's basics.* *②I want to stop the program by force(simulate Product Environment)with* {color:#de350b} *incremental* {color}*{color:#de350b}checkpoint{color} and then {color:#de350b}resume from{color}* {color:#de350b} *incremental* *checkpoint manually*{color} --original posting content-- *I want to do an experiment of"incremental checkpoint"* ||information||content|| |error|[https://paste.ubuntu.com/p/49HRYXFzR2/] |pom.xml|[ is in the attachment] |code| [https://paste.ubuntu.com/p/DpTyQKq6Vk/] *some of the above error is:* *Caused by: java.lang.IllegalStateException: Unexpected state handle type, expected: class org.apache.flink.runtime.state.KeyGroupsStateHandle, but found: class org.apache.flink.runtime.state.IncrementalRemoteKeyedStateHandle* The steps are: ||step||content|| |1|mvn clean scala:compile compile package| |2|nc -lk | |3|flink run -c wordcount_increstate datastream_api-1.0-SNAPSHOT.jar Job has been submitted with JobID df6d62a43620f258155b8538ef0ddf1b| 4.input the following conents in nc -lk before error error error error 5. flink run -s hdfs://Desktop:9000/tmp/flinkck/df6d62a43620f258155b8538ef0ddf1b/chk-22 -c StateWordCount datastream_api-1.0-SNAPSHOT.jar Then the above error happens. Please help,Thanks~! was: *The reason why I reopen it is that:* *I can {color:#de350b}not{color} {color:#de350b}resume from incremental checkpoint manually{color} when I stop the following program by force(maybe NOT supported officially?).* *①I {color:#de350b}hava subscribed to mailing list{color} and David Anderson haved answered this several days ago.* *But he did not tell me how I can fix it.* *I guess he's very busy.* *②My {color:#de350b}stackoverflow account{color} is forbidden to ask questions.* *③No one in alibaba {color:#de350b}Flink dingtalk group{color} can answer this question.* *④I have read [document](https://ci.apache.org/projects/flink/flink-docs-stable/ops/state/checkpoints.html) carefully,but I can NOT succeed in it.* *⑤{color:#de350b}apologise for post it here{color},but how can I make sure this question is a support instead of a bug or due to my own mistake?* *no successful example in Google/Baidu,am I right?* *⑥{color:#de350b}about less than 1000 people visit my [failure record |[https://yuchi.blog.csdn.net/article/details/108896441]]{color}about this experiment.* *Please help,thanks~* *The keypoint is:* *①Of course I know checkpoint is for automatically resuming,it's basics.* *②I want to stop the program by force(simulate Product Environment)with* {color:#de350b} *incremental* {color}*{color:#de350b}checkpoint{color} and then {color:#de350b}resume from{color}* {color:#de350b} *incremental* *checkpoint manually*{color} --original posting content-- *I want to do an experiment of"incremental checkpoint"* ||information||content|| |error|[https://paste.ubuntu.com/p/49HRYXFzR2/] |pom.xml|[ is in the attachment] |code| [https://paste.ubuntu.com/p/DpTyQKq6Vk/] *some of the above error is:* *Caused by: java.lang.IllegalStateException: Unexpected state handle type, expected: class org.apache.flink.runtime.state.KeyGroupsStateHandle, but found: class org.apache.flink.runtime.state.IncrementalRemoteKeyedStateHandle* The steps are: ||step||content|| |1|mvn clean scala:compile
[jira] [Updated] (FLINK-19486) expected: class org.apache.flink.runtime.state.KeyGroupsStateHandle, but found: class org.apache.flink.runtime.state.IncrementalRemoteKeyedStateHandle
[ https://issues.apache.org/jira/browse/FLINK-19486?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] appleyuchi updated FLINK-19486: --- Description: *The reason why I reopen it is that:* *I can {color:#de350b}not{color} {color:#de350b}resume from incremental checkpoint manually{color} when I stop the following program by force(maybe NOT supported officially?).* *①I {color:#de350b}hava subscribed to mailing list{color} and David Anderson haved answered this several days ago.* *But he did not tell me how I can fix it.* *I guess he's very busy.* *②My {color:#de350b}stackoverflow account{color} is forbidden to ask questions.* *③No one in alibaba {color:#de350b}Flink dingtalk group{color} can answer this question.* *④I have read [document](https://ci.apache.org/projects/flink/flink-docs-stable/ops/state/checkpoints.html) carefully,but I can NOT succeed in it.* *⑤{color:#de350b}apologise for post it here{color},but how can I make sure this question is a support instead of a bug or due to my own mistake?* *no successful example in Google/Baidu,am I right?* *⑥{color:#de350b}about less than 1000 people visit my [failure record |[https://yuchi.blog.csdn.net/article/details/108896441]]{color}about this experiment.* *Please help,thanks~* *The keypoint is:* *①Of course I know checkpoint is for automatically resuming,it's basics.* *②I want to stop the program by force(simulate Product Environment)with* {color:#de350b} *incremental* {color}*{color:#de350b}checkpoint{color} and then {color:#de350b}resume from{color}* {color:#de350b} *incremental* *checkpoint manually*{color} --original posting content-- *I want to do an experiment of"incremental checkpoint"* ||information||content|| |error|[https://paste.ubuntu.com/p/49HRYXFzR2/] |pom.xml|[ is in the attachment] |code| [https://paste.ubuntu.com/p/DpTyQKq6Vk/] *some of the above error is:* *Caused by: java.lang.IllegalStateException: Unexpected state handle type, expected: class org.apache.flink.runtime.state.KeyGroupsStateHandle, but found: class org.apache.flink.runtime.state.IncrementalRemoteKeyedStateHandle* The steps are: ||step||content|| |1|mvn clean scala:compile compile package| |2|nc -lk | |3|flink run -c wordcount_increstate datastream_api-1.0-SNAPSHOT.jarJob has been submitted with JobID df6d62a43620f258155b8538ef0ddf1b| 4.input the following conents in nc -lk before error error error error 5. flink run -s hdfs://Desktop:9000/tmp/flinkck/df6d62a43620f258155b8538ef0ddf1b/chk-22 -c StateWordCount datastream_api-1.0-SNAPSHOT.jar Then the above error happens. Please help,Thanks~! was: *The reason why I reopen it is that:* *I can {color:#de350b}not{color} {color:#de350b}resume from incremental checkpoint manually{color} when I stop the following program by force(maybe NOT supported officially?).* *①I {color:#de350b}hava subscribed to mailing list{color} and David Anderson haved answered this several days ago.* *But he did not tell me how I can fix it.* *I guess he's very busy.* *②My {color:#de350b}stackoverflow account{color} is forbidden to ask questions.* *③No one in alibaba {color:#de350b}Flink dingtalk group{color} can answer this question.* *④I have read [document](https://ci.apache.org/projects/flink/flink-docs-stable/ops/state/checkpoints.html) carefully,but I can NOT succeed in it.* *⑤{color:#de350b}apologise for post it here{color},but how can I make sure this question is a support instead of a bug or due to my own mistake?* *no successful example in Google/Baidu,am I right?* *⑥{color:#de350b}about less than 1000 people visit my [failure record |[https://yuchi.blog.csdn.net/article/details/108896441]]{color}about this experiment.* *Please help,thanks~* *The keypoint is:* *①Of course I know checkpoint is for automatically resuming,it's basics.* *②I want to stop the program by force(simulate Product Environment)with* {color:#de350b} *incremental* {color}*{color:#de350b}checkpoint{color} and then {color:#de350b}resume from{color}* {color:#de350b} *incremental* *checkpoint manually*{color} --original posting content-- *I want to do an experiment of"incremental checkpoint"* ||information||content|| |error|[https://paste.ubuntu.com/p/49HRYXFzR2/] |pom.xml|[ is in the attachment] |code| [https://paste.ubuntu.com/p/DpTyQKq6Vk/] *some of the above error is:* *Caused by: java.lang.IllegalStateException: Unexpected state handle type, expected: class org.apache.flink.runtime.state.KeyGroupsStateHandle, but found: class org.apache.flink.runtime.state.IncrementalRemoteKeyedStateHandle* The steps are: 1.mvn clean scala:compile compile package 2.nc
[jira] [Updated] (FLINK-19486) expected: class org.apache.flink.runtime.state.KeyGroupsStateHandle, but found: class org.apache.flink.runtime.state.IncrementalRemoteKeyedStateHandle
[ https://issues.apache.org/jira/browse/FLINK-19486?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] appleyuchi updated FLINK-19486: --- Description: *The reason why I reopen it is that:* *I can {color:#de350b}not{color} {color:#de350b}resume from incremental checkpoint manually{color} when I stop the following program by force(maybe NOT supported officially?).* *①I {color:#de350b}hava subscribed to mailing list{color} and David Anderson haved answered this several days ago.* *But he did not tell me how I can fix it.* *I guess he's very busy.* *②My {color:#de350b}stackoverflow account{color} is forbidden to ask questions.* *③No one in alibaba {color:#de350b}Flink dingtalk group{color} can answer this question.* *④I have read [document](https://ci.apache.org/projects/flink/flink-docs-stable/ops/state/checkpoints.html) carefully,but I can NOT succeed in it.* *⑤{color:#de350b}apologise for post it here{color},but how can I make sure this question is a support instead of a bug or due to my own mistake?* *no successful example in Google/Baidu,am I right?* *⑥{color:#de350b}about less than 1000 people visit my [failure record |[https://yuchi.blog.csdn.net/article/details/108896441]]{color}about this experiment.* *Please help,thanks~* *The keypoint is:* *①Of course I know checkpoint is for automatically resuming,it's basics.* *②I want to stop the program by force(simulate Product Environment)with* {color:#de350b} *incremental* {color}*{color:#de350b}checkpoint{color} and then {color:#de350b}resume from{color}* {color:#de350b} *incremental* *checkpoint manually*{color} --original posting content-- *I want to do an experiment of"incremental checkpoint"* ||information||content|| |error|[https://paste.ubuntu.com/p/49HRYXFzR2/] |pom.xml|[ is in the attachment] |code| [https://paste.ubuntu.com/p/DpTyQKq6Vk/] *some of the above error is:* *Caused by: java.lang.IllegalStateException: Unexpected state handle type, expected: class org.apache.flink.runtime.state.KeyGroupsStateHandle, but found: class org.apache.flink.runtime.state.IncrementalRemoteKeyedStateHandle* The steps are: 1.mvn clean scala:compile compile package 2.nc -lk 3.flink run -c wordcount_increstate datastream_api-1.0-SNAPSHOT.jar Job has been submitted with JobID df6d62a43620f258155b8538ef0ddf1b 4.input the following conents in nc -lk before error error error error 5. flink run -s hdfs://Desktop:9000/tmp/flinkck/df6d62a43620f258155b8538ef0ddf1b/chk-22 -c StateWordCount datastream_api-1.0-SNAPSHOT.jar Then the above error happens. Please help,Thanks~! was: *The reason why I reopen it is that:* *I can {color:#de350b}not{color} {color:#de350b}resume from incremental checkpoint manually{color} when I stop the following program by force(maybe NOT supported officially?).* *①I {color:#de350b}hava subscribed to mailing list{color} and David Anderson haved answered this several days ago.* *But he did not tell me how I can fix it.* *I guess he's very busy.* *②My {color:#de350b}stackoverflow account{color} is forbidden to ask questions.* *③No one in alibaba {color:#de350b}Flink dingtalk group{color} can answer this question.* *④I have read [document](https://ci.apache.org/projects/flink/flink-docs-stable/ops/state/checkpoints.html) carefully,but I can NOT succeed in it.* *⑤{color:#de350b}apologise for post it here{color},but how can I make sure this question is a support instead of a bug or due to my own mistake?* *no successful example in Google/Baidu,am I right?* *⑥{color:#de350b}about less than 1000 people visit my [failure record |[https://yuchi.blog.csdn.net/article/details/108896441]]{color}about this experiment.* *Please help,thanks~* *The keypoint is:* *①Of course I know checkpoint is for automatically resuming,it's basics.* *②I want to stop the program by force(simulate Product Environment)with* {color:#de350b} *incremental* {color}*{color:#de350b}checkpoint{color} and then {color:#de350b}resume from{color}* {color:#de350b} *incremental* *checkpoint manually*{color} --original posting content-- *I want to do an experiment of"incremental checkpoint"* my code is: [https://paste.ubuntu.com/p/DpTyQKq6Vk/] ||information||content|| |error|[https://paste.ubuntu.com/p/49HRYXFzR2/] |pom.xml|[ is in the attachment] *some of the above error is:* *Caused by: java.lang.IllegalStateException: Unexpected state handle type, expected: class org.apache.flink.runtime.state.KeyGroupsStateHandle, but found: class org.apache.flink.runtime.state.IncrementalRemoteKeyedStateHandle* The steps are: 1.mvn clean scala:compile compile package 2.nc -lk
[jira] [Updated] (FLINK-19486) expected: class org.apache.flink.runtime.state.KeyGroupsStateHandle, but found: class org.apache.flink.runtime.state.IncrementalRemoteKeyedStateHandle
[ https://issues.apache.org/jira/browse/FLINK-19486?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] appleyuchi updated FLINK-19486: --- Description: *The reason why I reopen it is that:* *I can {color:#de350b}not{color} {color:#de350b}resume from incremental checkpoint manually{color} when I stop the following program by force(maybe NOT supported officially?).* *①I {color:#de350b}hava subscribed to mailing list{color} and David Anderson haved answered this several days ago.* *But he did not tell me how I can fix it.* *I guess he's very busy.* *②My {color:#de350b}stackoverflow account{color} is forbidden to ask questions.* *③No one in alibaba {color:#de350b}Flink dingtalk group{color} can answer this question.* *④I have read [document](https://ci.apache.org/projects/flink/flink-docs-stable/ops/state/checkpoints.html) carefully,but I can NOT succeed in it.* *⑤{color:#de350b}apologise for post it here{color},but how can I make sure this question is a support instead of a bug or due to my own mistake?* *no successful example in Google/Baidu,am I right?* *⑥{color:#de350b}about less than 1000 people visit my [failure record |[https://yuchi.blog.csdn.net/article/details/108896441]]{color}about this experiment.* *Please help,thanks~* *The keypoint is:* *①Of course I know checkpoint is for automatically resuming,it's basics.* *②I want to stop the program by force(simulate Product Environment)with* {color:#de350b} *incremental* {color}*{color:#de350b}checkpoint{color} and then {color:#de350b}resume from{color}* {color:#de350b} *incremental* *checkpoint manually*{color} --original posting content-- *I want to do an experiment of"incremental checkpoint"* my code is: [https://paste.ubuntu.com/p/DpTyQKq6Vk/] ||information||content|| |error|[https://paste.ubuntu.com/p/49HRYXFzR2/] |pom.xml|[ is in the attachment] *some of the above error is:* *Caused by: java.lang.IllegalStateException: Unexpected state handle type, expected: class org.apache.flink.runtime.state.KeyGroupsStateHandle, but found: class org.apache.flink.runtime.state.IncrementalRemoteKeyedStateHandle* The steps are: 1.mvn clean scala:compile compile package 2.nc -lk 3.flink run -c wordcount_increstate datastream_api-1.0-SNAPSHOT.jar Job has been submitted with JobID df6d62a43620f258155b8538ef0ddf1b 4.input the following conents in nc -lk before error error error error 5. flink run -s hdfs://Desktop:9000/tmp/flinkck/df6d62a43620f258155b8538ef0ddf1b/chk-22 -c StateWordCount datastream_api-1.0-SNAPSHOT.jar Then the above error happens. Please help,Thanks~! was: *The reason why I reopen it is that:* *I can {color:#de350b}not{color} {color:#de350b}resume from incremental checkpoint manually{color} when I stop the following program by force(maybe NOT supported officially?).* *①I {color:#de350b}hava subscribed to mailing list{color} and David Anderson haved answered this several days ago.* *But he did not tell me how I can fix it.* *I guess he's very busy.* *②My {color:#de350b}stackoverflow account{color} is forbidden to ask questions.* *③No one in alibaba {color:#de350b}Flink dingtalk group{color} can answer this question.* *④I have read [document](https://ci.apache.org/projects/flink/flink-docs-stable/ops/state/checkpoints.html) carefully,but I can NOT succeed in it.* *⑤{color:#de350b}apologise for post it here{color},but how can I make sure this question is a support instead of a bug or due to my own mistake?* *no successful example in Google/Baidu,am I right?* *⑥{color:#de350b}about less than 1000 people visit my [failure record |[https://yuchi.blog.csdn.net/article/details/108896441]]{color}about this experiment.* *Please help,thanks~* *The keypoint is:* *①Of course I know checkpoint is for automatically resuming,it's basics.* *②I want to stop the program by force(simulate Product Environment)with* {color:#de350b} *incremental* {color}*{color:#de350b}checkpoint{color} and then {color:#de350b}resume from{color}* {color:#de350b} *incremental* *checkpoint manually*{color} --original posting content-- *I want to do an experiment of"incremental checkpoint"* my code is: [https://paste.ubuntu.com/p/DpTyQKq6Vk/] pom.xml is in the attachment the error I got is: [https://paste.ubuntu.com/p/49HRYXFzR2/] *some of the above error is:* *Caused by: java.lang.IllegalStateException: Unexpected state handle type, expected: class org.apache.flink.runtime.state.KeyGroupsStateHandle, but found: class org.apache.flink.runtime.state.IncrementalRemoteKeyedStateHandle* ||标题 1||标题 2|| |列 A1|列 A2| The steps are: 1.mvn clean scala:compile compile package 2.nc
[jira] [Updated] (FLINK-19486) expected: class org.apache.flink.runtime.state.KeyGroupsStateHandle, but found: class org.apache.flink.runtime.state.IncrementalRemoteKeyedStateHandle
[ https://issues.apache.org/jira/browse/FLINK-19486?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] appleyuchi updated FLINK-19486: --- Description: *The reason why I reopen it is that:* *I can {color:#de350b}not{color} {color:#de350b}resume from incremental checkpoint manually{color} when I stop the following program by force(maybe NOT supported officially?).* *①I {color:#de350b}hava subscribed to mailing list{color} and David Anderson haved answered this several days ago.* *But he did not tell me how I can fix it.* *I guess he's very busy.* *②My {color:#de350b}stackoverflow account{color} is forbidden to ask questions.* *③No one in alibaba {color:#de350b}Flink dingtalk group{color} can answer this question.* *④I have read [document](https://ci.apache.org/projects/flink/flink-docs-stable/ops/state/checkpoints.html) carefully,but I can NOT succeed in it.* *⑤{color:#de350b}apologise for post it here{color},but how can I make sure this question is a support instead of a bug or due to my own mistake?* *no successful example in Google/Baidu,am I right?* *⑥{color:#de350b}about less than 1000 people visit my [failure record |[https://yuchi.blog.csdn.net/article/details/108896441]]{color}about this experiment.* *Please help,thanks~* *The keypoint is:* *①Of course I know checkpoint is for automatically resuming,it's basics.* *②I want to stop the program by force(simulate Product Environment)with* {color:#de350b} *incremental* {color}*{color:#de350b}checkpoint{color} and then {color:#de350b}resume from{color}* {color:#de350b} *incremental* *checkpoint manually*{color} --original posting content-- *I want to do an experiment of"incremental checkpoint"* my code is: [https://paste.ubuntu.com/p/DpTyQKq6Vk/] pom.xml is in the attachment the error I got is: [https://paste.ubuntu.com/p/49HRYXFzR2/] *some of the above error is:* *Caused by: java.lang.IllegalStateException: Unexpected state handle type, expected: class org.apache.flink.runtime.state.KeyGroupsStateHandle, but found: class org.apache.flink.runtime.state.IncrementalRemoteKeyedStateHandle* The steps are: 1.mvn clean scala:compile compile package 2.nc -lk 3.flink run -c wordcount_increstate datastream_api-1.0-SNAPSHOT.jar Job has been submitted with JobID df6d62a43620f258155b8538ef0ddf1b 4.input the following conents in nc -lk before error error error error 5. flink run -s hdfs://Desktop:9000/tmp/flinkck/df6d62a43620f258155b8538ef0ddf1b/chk-22 -c StateWordCount datastream_api-1.0-SNAPSHOT.jar Then the above error happens. Please help,Thanks~! was: *The reason why I reopen it is that:* *I can {color:#de350b}not{color} {color:#de350b}resume from incremental checkpoint manually{color} when I stop the following program by force(maybe NOT supported officially?).* *①I {color:#de350b}hava subscribed to mailing list{color} and David Anderson haved answered this several days ago.* *But he did not tell me how I can fix it.* *I guess he's very busy.* *②My {color:#de350b}stackoverflow account{color} is forbidden to ask questions.* *③No one in alibaba {color:#de350b}Flink dingtalk group{color} can answer this question.* *④I have read [document](https://ci.apache.org/projects/flink/flink-docs-stable/ops/state/checkpoints.html]) carefully,but I can NOT succeed in it.* *⑤{color:#de350b}apologise for post it here{color},but how can I make sure this question is a support instead of a bug or due to my own mistake?* *no successful example in Google/Baidu,am I right?* *⑥{color:#de350b}about less than 1000 people visit my [failure record |[https://yuchi.blog.csdn.net/article/details/108896441]]{color}about this experiment.* *Please help,thanks~* *The keypoint is:* *①Of course I know checkpoint is for automatically resuming,it's basics.* *②I want to stop the program by force(simulate Product Environment)with* {color:#de350b} *incremental* {color}*{color:#de350b}checkpoint{color} and then {color:#de350b}resume from{color}* {color:#de350b} *incremental* *checkpoint manually*{color} --original posting content-- *I want to do an experiment of"incremental checkpoint"* my code is: [https://paste.ubuntu.com/p/DpTyQKq6Vk/] pom.xml is in the attachment the error I got is: [https://paste.ubuntu.com/p/49HRYXFzR2/] *some of the above error is:* *Caused by: java.lang.IllegalStateException: Unexpected state handle type, expected: class org.apache.flink.runtime.state.KeyGroupsStateHandle, but found: class org.apache.flink.runtime.state.IncrementalRemoteKeyedStateHandle* The steps are: 1.mvn clean scala:compile compile package 2.nc -lk 3.flink run -c
[jira] [Updated] (FLINK-19486) expected: class org.apache.flink.runtime.state.KeyGroupsStateHandle, but found: class org.apache.flink.runtime.state.IncrementalRemoteKeyedStateHandle
[ https://issues.apache.org/jira/browse/FLINK-19486?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] appleyuchi updated FLINK-19486: --- Description: *The reason why I reopen it is that:* *I can {color:#de350b}not{color} {color:#de350b}resume from incremental checkpoint manually{color} when I stop the following program by force(maybe NOT supported officially?).* *①I {color:#de350b}hava subscribed to mailing list{color} and David Anderson haved answered this several days ago.* *But he did not tell me how I can fix it.* *I guess he's very busy.* *②My {color:#de350b}stackoverflow account{color} is forbidden to ask questions.* *③No one in alibaba {color:#de350b}Flink dingtalk group{color} can answer this question.* *④I have read [document](https://ci.apache.org/projects/flink/flink-docs-stable/ops/state/checkpoints.html) carefully,but I can NOT succeed in it.* *⑤{color:#de350b}apologise for post it here{color},but how can I make sure this question is a support instead of a bug or due to my own mistake?* *no successful example in Google/Baidu,am I right?* *⑥{color:#de350b}about less than 1000 people visit my [failure record |[https://yuchi.blog.csdn.net/article/details/108896441]]{color}about this experiment.* *Please help,thanks~* *The keypoint is:* *①Of course I know checkpoint is for automatically resuming,it's basics.* *②I want to stop the program by force(simulate Product Environment)with* {color:#de350b} *incremental* {color}*{color:#de350b}checkpoint{color} and then {color:#de350b}resume from{color}* {color:#de350b} *incremental* *checkpoint manually*{color} --original posting content-- *I want to do an experiment of"incremental checkpoint"* my code is: [https://paste.ubuntu.com/p/DpTyQKq6Vk/] pom.xml is in the attachment the error I got is: [https://paste.ubuntu.com/p/49HRYXFzR2/] *some of the above error is:* *Caused by: java.lang.IllegalStateException: Unexpected state handle type, expected: class org.apache.flink.runtime.state.KeyGroupsStateHandle, but found: class org.apache.flink.runtime.state.IncrementalRemoteKeyedStateHandle* ||标题 1||标题 2|| |列 A1|列 A2| The steps are: 1.mvn clean scala:compile compile package 2.nc -lk 3.flink run -c wordcount_increstate datastream_api-1.0-SNAPSHOT.jar Job has been submitted with JobID df6d62a43620f258155b8538ef0ddf1b 4.input the following conents in nc -lk before error error error error 5. flink run -s hdfs://Desktop:9000/tmp/flinkck/df6d62a43620f258155b8538ef0ddf1b/chk-22 -c StateWordCount datastream_api-1.0-SNAPSHOT.jar Then the above error happens. Please help,Thanks~! was: *The reason why I reopen it is that:* *I can {color:#de350b}not{color} {color:#de350b}resume from incremental checkpoint manually{color} when I stop the following program by force(maybe NOT supported officially?).* *①I {color:#de350b}hava subscribed to mailing list{color} and David Anderson haved answered this several days ago.* *But he did not tell me how I can fix it.* *I guess he's very busy.* *②My {color:#de350b}stackoverflow account{color} is forbidden to ask questions.* *③No one in alibaba {color:#de350b}Flink dingtalk group{color} can answer this question.* *④I have read [document](https://ci.apache.org/projects/flink/flink-docs-stable/ops/state/checkpoints.html) carefully,but I can NOT succeed in it.* *⑤{color:#de350b}apologise for post it here{color},but how can I make sure this question is a support instead of a bug or due to my own mistake?* *no successful example in Google/Baidu,am I right?* *⑥{color:#de350b}about less than 1000 people visit my [failure record |[https://yuchi.blog.csdn.net/article/details/108896441]]{color}about this experiment.* *Please help,thanks~* *The keypoint is:* *①Of course I know checkpoint is for automatically resuming,it's basics.* *②I want to stop the program by force(simulate Product Environment)with* {color:#de350b} *incremental* {color}*{color:#de350b}checkpoint{color} and then {color:#de350b}resume from{color}* {color:#de350b} *incremental* *checkpoint manually*{color} --original posting content-- *I want to do an experiment of"incremental checkpoint"* my code is: [https://paste.ubuntu.com/p/DpTyQKq6Vk/] pom.xml is in the attachment the error I got is: [https://paste.ubuntu.com/p/49HRYXFzR2/] *some of the above error is:* *Caused by: java.lang.IllegalStateException: Unexpected state handle type, expected: class org.apache.flink.runtime.state.KeyGroupsStateHandle, but found: class org.apache.flink.runtime.state.IncrementalRemoteKeyedStateHandle* The steps are: 1.mvn clean scala:compile compile package 2.nc -lk
[jira] [Updated] (FLINK-19486) expected: class org.apache.flink.runtime.state.KeyGroupsStateHandle, but found: class org.apache.flink.runtime.state.IncrementalRemoteKeyedStateHandle
[ https://issues.apache.org/jira/browse/FLINK-19486?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] appleyuchi updated FLINK-19486: --- Description: *The reason why I reopen it is that:* *I can {color:#de350b}not{color} {color:#de350b}resume from incremental checkpoint manually{color} when I stop the following program by force(maybe NOT supported officially?).* *①I {color:#de350b}hava subscribed to mailing list{color} and David Anderson haved answered this several days ago.* *But he did not tell me how I can fix it.* *I guess he's very busy.* *②My {color:#de350b}stackoverflow account{color} is forbidden to ask questions.* *③No one in alibaba {color:#de350b}Flink dingtalk group{color} can answer this question.* *④I have read [document](https://ci.apache.org/projects/flink/flink-docs-stable/ops/state/checkpoints.html]) carefully,but I can NOT succeed in it.* *⑤{color:#de350b}apologise for post it here{color},but how can I make sure this question is a support instead of a bug or due to my own mistake?* *no successful example in Google/Baidu,am I right?* *⑥{color:#de350b}about less than 1000 people visit my [failure record |[https://yuchi.blog.csdn.net/article/details/108896441]]{color}about this experiment.* *Please help,thanks~* *The keypoint is:* *①Of course I know checkpoint is for automatically resuming,it's basics.* *②I want to stop the program by force(simulate Product Environment)with* {color:#de350b} *incremental* {color}*{color:#de350b}checkpoint{color} and then {color:#de350b}resume from{color}* {color:#de350b} *incremental* *checkpoint manually*{color} --original posting content-- *I want to do an experiment of"incremental checkpoint"* my code is: [https://paste.ubuntu.com/p/DpTyQKq6Vk/] pom.xml is in the attachment the error I got is: [https://paste.ubuntu.com/p/49HRYXFzR2/] *some of the above error is:* *Caused by: java.lang.IllegalStateException: Unexpected state handle type, expected: class org.apache.flink.runtime.state.KeyGroupsStateHandle, but found: class org.apache.flink.runtime.state.IncrementalRemoteKeyedStateHandle* The steps are: 1.mvn clean scala:compile compile package 2.nc -lk 3.flink run -c wordcount_increstate datastream_api-1.0-SNAPSHOT.jar Job has been submitted with JobID df6d62a43620f258155b8538ef0ddf1b 4.input the following conents in nc -lk before error error error error 5. flink run -s hdfs://Desktop:9000/tmp/flinkck/df6d62a43620f258155b8538ef0ddf1b/chk-22 -c StateWordCount datastream_api-1.0-SNAPSHOT.jar Then the above error happens. Please help,Thanks~! was: *The reason why I reopen it is that:* *I can {color:#de350b}not{color} {color:#de350b}resume from incremental checkpoint manually{color} when I stop the following program by force(maybe NOT supported officially?).* *①I {color:#de350b}hava subscribed to mailing list{color} and David Anderson haved answered this several days ago.* *But he did not tell me how I can fix it.* *I guess he's very busy.* *②My {color:#de350b}stackoverflow account{color} is forbidden to ask questions.* *③No one in alibaba {color:#de350b}Flink dingtalk group{color} can answer this question.* *④I have read [document|[https://ci.apache.org/projects/flink/flink-docs-stable/ops/state/checkpoints.html]] carefully,but I can NOT succeed in it.* *⑤{color:#de350b}apologise for post it here{color},but how can I make sure this question is a support instead of a bug or due to my own mistake?* *no successful example in Google/Baidu,am I right?* *⑥{color:#de350b}about less than 1000 people visit my [failure record |[https://yuchi.blog.csdn.net/article/details/108896441]]{color}about this experiment.* *Please help,thanks~* *The keypoint is:* *①Of course I know checkpoint is for automatically resuming,it's basics.* *②I want to stop the program by force(simulate Product Environment)with* {color:#de350b} *incremental* {color}*{color:#de350b}checkpoint{color} and then {color:#de350b}resume from{color}* {color:#de350b} *incremental* *checkpoint manually*{color} --original posting content-- *I want to do an experiment of"incremental checkpoint"* my code is: [https://paste.ubuntu.com/p/DpTyQKq6Vk/] pom.xml is in the attachment the error I got is: [https://paste.ubuntu.com/p/49HRYXFzR2/] *some of the above error is:* *Caused by: java.lang.IllegalStateException: Unexpected state handle type, expected: class org.apache.flink.runtime.state.KeyGroupsStateHandle, but found: class org.apache.flink.runtime.state.IncrementalRemoteKeyedStateHandle* The steps are: 1.mvn clean scala:compile compile package 2.nc -lk 3.flink run -c
[jira] [Updated] (FLINK-19486) expected: class org.apache.flink.runtime.state.KeyGroupsStateHandle, but found: class org.apache.flink.runtime.state.IncrementalRemoteKeyedStateHandle
[ https://issues.apache.org/jira/browse/FLINK-19486?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] appleyuchi updated FLINK-19486: --- Description: *The reason why I reopen it is that:* *I can {color:#de350b}not{color} {color:#de350b}resume from incremental checkpoint manually{color} when I stop the following program by force(maybe NOT supported officially?).* *①I {color:#de350b}hava subscribed to mailing list{color} and David Anderson haved answered this several days ago.* *But he did not tell me how I can fix it.* *I guess he's very busy.* *②My {color:#de350b}stackoverflow account{color} is forbidden to ask questions.* *③No one in alibaba {color:#de350b}Flink dingtalk group{color} can answer this question.* *④I have read [document|[https://ci.apache.org/projects/flink/flink-docs-stable/ops/state/checkpoints.html]] carefully,but I can NOT succeed in it.* *⑤{color:#de350b}apologise for post it here{color},but how can I make sure this question is a support instead of a bug or due to my own mistake?* *no successful example in Google/Baidu,am I right?* *⑥{color:#de350b}about less than 1000 people visit my [failure record |[https://yuchi.blog.csdn.net/article/details/108896441]]{color}about this experiment.* *Please help,thanks~* *The keypoint is:* *①Of course I know checkpoint is for automatically resuming,it's basics.* *②I want to stop the program by force(simulate Product Environment)with* {color:#de350b} *incremental* {color}*{color:#de350b}checkpoint{color} and then {color:#de350b}resume from{color}* {color:#de350b} *incremental* *checkpoint manually*{color} --original posting content-- *I want to do an experiment of"incremental checkpoint"* my code is: [https://paste.ubuntu.com/p/DpTyQKq6Vk/] pom.xml is in the attachment the error I got is: [https://paste.ubuntu.com/p/49HRYXFzR2/] *some of the above error is:* *Caused by: java.lang.IllegalStateException: Unexpected state handle type, expected: class org.apache.flink.runtime.state.KeyGroupsStateHandle, but found: class org.apache.flink.runtime.state.IncrementalRemoteKeyedStateHandle* The steps are: 1.mvn clean scala:compile compile package 2.nc -lk 3.flink run -c wordcount_increstate datastream_api-1.0-SNAPSHOT.jar Job has been submitted with JobID df6d62a43620f258155b8538ef0ddf1b 4.input the following conents in nc -lk before error error error error 5. flink run -s hdfs://Desktop:9000/tmp/flinkck/df6d62a43620f258155b8538ef0ddf1b/chk-22 -c StateWordCount datastream_api-1.0-SNAPSHOT.jar Then the above error happens. Please help,Thanks~! was: *The reason why I reopen it is that:* *I can {color:#de350b}not{color} {color:#de350b}resume from checkpoint manually{color} when I stop the following program by force(maybe NOT supported officially?).* *①I {color:#de350b}hava subscribed to mailing list{color} and David Anderson haved answered this several days ago.* *But he did not tell me how I can fix it.* *I guess he's very busy.* *②My {color:#de350b}stackoverflow account{color} is forbidden to ask questions.* *③No one in alibaba {color:#de350b}Flink dingtalk group{color} can answer this question.* *④I have read [document|[https://ci.apache.org/projects/flink/flink-docs-stable/ops/state/checkpoints.html]] carefully,but I can NOT succeed in it.* *⑤{color:#de350b}apologise for post it here{color},but how can I make sure this question is a support instead of a bug or due to my own mistake?* *no successful example in Google/Baidu,am I right?* *⑥{color:#de350b}about less than 1000 people visit my [failure record |[https://yuchi.blog.csdn.net/article/details/108896441]]{color}about this experiment.* *Please help,thanks~* *The keypoint is:* *①Of course I know checkpoint is for automatically resuming,it's basics.* *②I want to stop the program by force(simulate Product Environment)with checkpoint and then resume from checkpoint manually* *I want to do an experiment of"incremental checkpoint"* my code is: [https://paste.ubuntu.com/p/DpTyQKq6Vk/] pom.xml is in the attachment the error I got is: [https://paste.ubuntu.com/p/49HRYXFzR2/] *some of the above error is:* *Caused by: java.lang.IllegalStateException: Unexpected state handle type, expected: class org.apache.flink.runtime.state.KeyGroupsStateHandle, but found: class org.apache.flink.runtime.state.IncrementalRemoteKeyedStateHandle* The steps are: 1.mvn clean scala:compile compile package 2.nc -lk 3.flink run -c wordcount_increstate datastream_api-1.0-SNAPSHOT.jar Job has been submitted with JobID df6d62a43620f258155b8538ef0ddf1b 4.input the following conents in nc -lk before error error error error 5. flink run -s
[jira] [Updated] (FLINK-19486) expected: class org.apache.flink.runtime.state.KeyGroupsStateHandle, but found: class org.apache.flink.runtime.state.IncrementalRemoteKeyedStateHandle
[ https://issues.apache.org/jira/browse/FLINK-19486?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] appleyuchi updated FLINK-19486: --- Description: *The reason why I reopen it is that:* *I can {color:#de350b}not{color} {color:#de350b}resume from checkpoint manually{color} when I stop the following program by force(maybe NOT supported officially?).* *①I {color:#de350b}hava subscribed to mailing list{color} and David Anderson haved answered this several days ago.* *But he did not tell me how I can fix it.* *I guess he's very busy.* *②My {color:#de350b}stackoverflow account{color} is forbidden to ask questions.* *③No one in alibaba {color:#de350b}Flink dingtalk group{color} can answer this question.* *④I have read [document|[https://ci.apache.org/projects/flink/flink-docs-stable/ops/state/checkpoints.html]] carefully,but I can NOT succeed in it.* *⑤{color:#de350b}apologise for post it here{color},but how can I make sure this question is a support instead of a bug or due to my own mistake?* *no successful example in Google/Baidu,am I right?* *⑥{color:#de350b}about less than 1000 people visit my [failure record |[https://yuchi.blog.csdn.net/article/details/108896441]]{color}about this experiment.* *Please help,thanks~* *The keypoint is:* *①Of course I know checkpoint is for automatically resuming,it's basics.* *②I want to stop the program by force(simulate Product Environment)with checkpoint and then resume from checkpoint manually* *I want to do an experiment of"incremental checkpoint"* my code is: [https://paste.ubuntu.com/p/DpTyQKq6Vk/] pom.xml is in the attachment the error I got is: [https://paste.ubuntu.com/p/49HRYXFzR2/] *some of the above error is:* *Caused by: java.lang.IllegalStateException: Unexpected state handle type, expected: class org.apache.flink.runtime.state.KeyGroupsStateHandle, but found: class org.apache.flink.runtime.state.IncrementalRemoteKeyedStateHandle* The steps are: 1.mvn clean scala:compile compile package 2.nc -lk 3.flink run -c wordcount_increstate datastream_api-1.0-SNAPSHOT.jar Job has been submitted with JobID df6d62a43620f258155b8538ef0ddf1b 4.input the following conents in nc -lk before error error error error 5. flink run -s hdfs://Desktop:9000/tmp/flinkck/df6d62a43620f258155b8538ef0ddf1b/chk-22 -c StateWordCount datastream_api-1.0-SNAPSHOT.jar Then the above error happens. Please help,Thanks~! was: *The reason why I reopen it is that:* *I can {color:#de350b}not{color} {color:#de350b}resume from checkpoint manually{color} when I stop the following program by force(maybe NOT supported officially?).* *①I {color:#de350b}hava subscribed to mailing list{color} and David Anderson haved answered this several days ago.* *But he did not tell me how I can fix it.* *I guess he's very busy.* *②My {color:#de350b}stackoverflow account{color} is forbidden to ask questions.* *③No one in alibaba {color:#de350b}Flink dingtalk group{color} can answer this question.* *④I have read [document|[https://ci.apache.org/projects/flink/flink-docs-stable/ops/state/checkpoints.html]] carefully,but I can NOT succeed in it.* *⑤{color:#de350b}apologise for post it here{color},but how can I make sure this question is a support instead of a bug or due to my own mistake?* *no successful example in Google/Baidu,am I right?* *⑥{color:#de350b}about less than 1000 people visit my [failure record |[https://yuchi.blog.csdn.net/article/details/108896441]]{color}about this experiment.* *Please help,thanks~* *The keypoint is:* *①Of course I know checkpoint is for automatically resuming,it's basics.* *②I want to stop the program by force(simulate Product Environment)with checkpoint and then resume from checkpoint manually* *I want to do an experiment of"incremental checkpoint"* my code is: [https://paste.ubuntu.com/p/DpTyQKq6Vk/] pom.xml is: http://maven.apache.org/POM/4.0.0; xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance; xsi:schemaLocation="http://maven.apache.org/POM/4.0.0 [http://maven.apache.org/xsd/maven-4.0.0.xsd];> 4.0.0 example datastream_api 1.0-SNAPSHOT org.apache.maven.plugins maven-compiler-plugin 3.1 1.8 1.8 org.scala-tools maven-scala-plugin 2.15.2 compile testCompile org.apache.flink flink-streaming-scala_2.11 1.11.1 org.apache.flink flink-clients_2.11 1.11.1 org.apache.flink flink-statebackend-rocksdb_2.11 1.11.2 org.apache.hadoop hadoop-client 3.3.0 org.apache.flink flink-core 1.11.1 org.apache.flink flink-cep_2.11 1.11.1 org.apache.flink flink-cep-scala_2.11 1.11.1 org.apache.flink flink-scala_2.11 1.11.1 org.projectlombok lombok 1.18.4 the error I got is: [https://paste.ubuntu.com/p/49HRYXFzR2/] *some of the above
[jira] [Updated] (FLINK-19486) expected: class org.apache.flink.runtime.state.KeyGroupsStateHandle, but found: class org.apache.flink.runtime.state.IncrementalRemoteKeyedStateHandle
[ https://issues.apache.org/jira/browse/FLINK-19486?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] appleyuchi updated FLINK-19486: --- Attachment: pom.xml > expected: class org.apache.flink.runtime.state.KeyGroupsStateHandle, but > found: class org.apache.flink.runtime.state.IncrementalRemoteKeyedStateHandle > -- > > Key: FLINK-19486 > URL: https://issues.apache.org/jira/browse/FLINK-19486 > Project: Flink > Issue Type: Bug >Affects Versions: 1.11.1 >Reporter: appleyuchi >Priority: Minor > Attachments: pom.xml > > > *The reason why I reopen it is that:* > > *I can {color:#de350b}not{color} {color:#de350b}resume from checkpoint > manually{color} when I stop the following program by force(maybe NOT > supported officially?).* > *①I {color:#de350b}hava subscribed to mailing list{color} and David Anderson > haved answered this several days ago.* > *But he did not tell me how I can fix it.* > *I guess he's very busy.* > *②My {color:#de350b}stackoverflow account{color} is forbidden to ask > questions.* > *③No one in alibaba {color:#de350b}Flink dingtalk group{color} can answer > this question.* > *④I have read > [document|[https://ci.apache.org/projects/flink/flink-docs-stable/ops/state/checkpoints.html]] > carefully,but I can NOT succeed in it.* > *⑤{color:#de350b}apologise for post it here{color},but how can I make sure > this question is a support instead of a bug or due to my own mistake?* > *no successful example in Google/Baidu,am I right?* > *⑥{color:#de350b}about less than 1000 people visit my [failure record > |[https://yuchi.blog.csdn.net/article/details/108896441]]{color}about this > experiment.* > *Please help,thanks~* > > *The keypoint is:* > *①Of course I know checkpoint is for automatically resuming,it's basics.* > *②I want to stop the program by force(simulate Product Environment)with > checkpoint and then resume from checkpoint manually* > > > *I want to do an experiment of"incremental checkpoint"* > my code is: > [https://paste.ubuntu.com/p/DpTyQKq6Vk/] > > pom.xml is: > > http://maven.apache.org/POM/4.0.0; > xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance; > xsi:schemaLocation="http://maven.apache.org/POM/4.0.0 > [http://maven.apache.org/xsd/maven-4.0.0.xsd];> > 4.0.0 > example > datastream_api > 1.0-SNAPSHOT > > > > org.apache.maven.plugins > maven-compiler-plugin > 3.1 > > 1.8 > 1.8 > > > > org.scala-tools > maven-scala-plugin > 2.15.2 > > > > compile > testCompile > > > > > > > > > > > org.apache.flink > flink-streaming-scala_2.11 > 1.11.1 > > > > > > > > > > org.apache.flink > flink-clients_2.11 > 1.11.1 > > > > org.apache.flink > flink-statebackend-rocksdb_2.11 > 1.11.2 > > > > org.apache.hadoop > hadoop-client > 3.3.0 > > > org.apache.flink > flink-core > 1.11.1 > > > > > > > > > > org.apache.flink > flink-cep_2.11 > 1.11.1 > > > org.apache.flink > flink-cep-scala_2.11 > 1.11.1 > > > org.apache.flink > flink-scala_2.11 > 1.11.1 > > > > org.projectlombok > lombok > 1.18.4 > > > > > > the error I got is: > [https://paste.ubuntu.com/p/49HRYXFzR2/] > > *some of the above error is:* > *Caused by: java.lang.IllegalStateException: Unexpected state handle type, > expected: class org.apache.flink.runtime.state.KeyGroupsStateHandle, but > found: class org.apache.flink.runtime.state.IncrementalRemoteKeyedStateHandle* > > > The steps are: > 1.mvn clean scala:compile compile package > 2.nc -lk > 3.flink run -c wordcount_increstate datastream_api-1.0-SNAPSHOT.jar > Job has been submitted with JobID df6d62a43620f258155b8538ef0ddf1b > 4.input the following conents in nc -lk > before > error > error > error > error > 5. > flink run -s > hdfs://Desktop:9000/tmp/flinkck/df6d62a43620f258155b8538ef0ddf1b/chk-22 -c > StateWordCount datastream_api-1.0-SNAPSHOT.jar > Then the above error happens. > > Please help,Thanks~! > -- This message was sent by Atlassian Jira (v8.3.4#803005)
[GitHub] [flink-statefun] tzulitai commented on pull request #131: [FLINK-18968] Translate README.md to Chinese
tzulitai commented on pull request #131: URL: https://github.com/apache/flink-statefun/pull/131#issuecomment-703998886 @carp84 @klion26 Thanks for drafting up the translation specifications and driving this! I took a look, and overall it looks good to me. Yes, I think we are ready to propose a discussion to the ML lists now. This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. For queries about this service, please contact Infrastructure at: us...@infra.apache.org
[jira] [Updated] (FLINK-19486) expected: class org.apache.flink.runtime.state.KeyGroupsStateHandle, but found: class org.apache.flink.runtime.state.IncrementalRemoteKeyedStateHandle
[ https://issues.apache.org/jira/browse/FLINK-19486?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] appleyuchi updated FLINK-19486: --- Description: *The reason why I reopen it is that:* *I can {color:#de350b}not{color} {color:#de350b}resume from checkpoint manually{color} when I stop the following program by force(maybe NOT supported officially?).* *①I {color:#de350b}hava subscribed to mailing list{color} and David Anderson haved answered this several days ago.* *But he did not tell me how I can fix it.* *I guess he's very busy.* *②My {color:#de350b}stackoverflow account{color} is forbidden to ask questions.* *③No one in alibaba {color:#de350b}Flink dingtalk group{color} can answer this question.* *④I have read [document|[https://ci.apache.org/projects/flink/flink-docs-stable/ops/state/checkpoints.html]] carefully,but I can NOT succeed in it.* *⑤{color:#de350b}apologise for post it here{color},but how can I make sure this question is a support instead of a bug or due to my own mistake?* *no successful example in Google/Baidu,am I right?* *⑥{color:#de350b}about less than 1000 people visit my [failure record |[https://yuchi.blog.csdn.net/article/details/108896441]]{color}about this experiment.* *Please help,thanks~* *The keypoint is:* *①Of course I know checkpoint is for automatically resuming,it's basics.* *②I want to stop the program by force(simulate Product Environment)with checkpoint and then resume from checkpoint manually* *I want to do an experiment of"incremental checkpoint"* my code is: [https://paste.ubuntu.com/p/DpTyQKq6Vk/] pom.xml is: http://maven.apache.org/POM/4.0.0; xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance; xsi:schemaLocation="http://maven.apache.org/POM/4.0.0 [http://maven.apache.org/xsd/maven-4.0.0.xsd];> 4.0.0 example datastream_api 1.0-SNAPSHOT org.apache.maven.plugins maven-compiler-plugin 3.1 1.8 1.8 org.scala-tools maven-scala-plugin 2.15.2 compile testCompile org.apache.flink flink-streaming-scala_2.11 1.11.1 org.apache.flink flink-clients_2.11 1.11.1 org.apache.flink flink-statebackend-rocksdb_2.11 1.11.2 org.apache.hadoop hadoop-client 3.3.0 org.apache.flink flink-core 1.11.1 org.apache.flink flink-cep_2.11 1.11.1 org.apache.flink flink-cep-scala_2.11 1.11.1 org.apache.flink flink-scala_2.11 1.11.1 org.projectlombok lombok 1.18.4 the error I got is: [https://paste.ubuntu.com/p/49HRYXFzR2/] *some of the above error is:* *Caused by: java.lang.IllegalStateException: Unexpected state handle type, expected: class org.apache.flink.runtime.state.KeyGroupsStateHandle, but found: class org.apache.flink.runtime.state.IncrementalRemoteKeyedStateHandle* The steps are: 1.mvn clean scala:compile compile package 2.nc -lk 3.flink run -c wordcount_increstate datastream_api-1.0-SNAPSHOT.jar Job has been submitted with JobID df6d62a43620f258155b8538ef0ddf1b 4.input the following conents in nc -lk before error error error error 5. flink run -s hdfs://Desktop:9000/tmp/flinkck/df6d62a43620f258155b8538ef0ddf1b/chk-22 -c StateWordCount datastream_api-1.0-SNAPSHOT.jar Then the above error happens. Please help,Thanks~! was: *The reason why I reopen it is that:* *I can {color:#de350b}not{color} {color:#de350b}resume from checkpoint manually{color} when I stop the following program by force(maybe NOT supported officially?).* *①I {color:#de350b}hava subscribed to mailing list{color} and David Anderson haved answered this several days ago.* *But he did not tell me how I can fix it.* *I guess he's very busy.* *②My {color:#de350b}stackoverflow account{color} is forbidden to ask questions.* *③No one in alibaba {color:#de350b}Flink dingtalk group{color} can answer this question.* *④I have read [document|[https://ci.apache.org/projects/flink/flink-docs-stable/ops/state/checkpoints.html]] carefully,but I can NOT succeed in it.* *⑤{color:#de350b}apologise for post here{color},but how can I make sure this question is a support instead of a bug or due to my own mistake?* *no successful example in Google/Baidu,am I right?* *Please help,thanks~* *The keypoint is:* *①Of course I know checkpoint is for automatically resuming,it's basics.* *②I want to stop the program by force(simulate Product Environment)with checkpoint and then resume from checkpoint manually* *I want to do an experiment of"incremental checkpoint"* my code is: [https://paste.ubuntu.com/p/DpTyQKq6Vk/] pom.xml is: http://maven.apache.org/POM/4.0.0; xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance; xsi:schemaLocation="http://maven.apache.org/POM/4.0.0 [http://maven.apache.org/xsd/maven-4.0.0.xsd];> 4.0.0 example datastream_api 1.0-SNAPSHOT
[jira] [Updated] (FLINK-19486) expected: class org.apache.flink.runtime.state.KeyGroupsStateHandle, but found: class org.apache.flink.runtime.state.IncrementalRemoteKeyedStateHandle
[ https://issues.apache.org/jira/browse/FLINK-19486?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] appleyuchi updated FLINK-19486: --- Description: *The reason why I reopen it is that:* *I can {color:#de350b}not{color} {color:#de350b}resume from checkpoint manually{color} when I stop the following program by force(maybe NOT supported officially?).* *①I {color:#de350b}hava subscribed to mailing list{color} and David Anderson haved answered this several days ago.* *But he did not tell me how I can fix it.* *I guess he's very busy.* *②My {color:#de350b}stackoverflow account{color} is forbidden to ask questions.* *③No one in alibaba {color:#de350b}Flink dingtalk group{color} can answer this question.* *④I have read [document|[https://ci.apache.org/projects/flink/flink-docs-stable/ops/state/checkpoints.html]] carefully,but I can NOT succeed in it.* *⑤{color:#de350b}apologise for post here{color},but how can I make sure this question is a support instead of a bug or due to my own mistake?* *no successful example in Google/Baidu,am I right?* *Please help,thanks~* *The keypoint is:* *①Of course I know checkpoint is for automatically resuming,it's basics.* *②I want to stop the program by force(simulate Product Environment)with checkpoint and then resume from checkpoint manually* *I want to do an experiment of"incremental checkpoint"* my code is: [https://paste.ubuntu.com/p/DpTyQKq6Vk/] pom.xml is: http://maven.apache.org/POM/4.0.0; xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance; xsi:schemaLocation="http://maven.apache.org/POM/4.0.0 [http://maven.apache.org/xsd/maven-4.0.0.xsd];> 4.0.0 example datastream_api 1.0-SNAPSHOT org.apache.maven.plugins maven-compiler-plugin 3.1 1.8 1.8 org.scala-tools maven-scala-plugin 2.15.2 compile testCompile org.apache.flink flink-streaming-scala_2.11 1.11.1 org.apache.flink flink-clients_2.11 1.11.1 org.apache.flink flink-statebackend-rocksdb_2.11 1.11.2 org.apache.hadoop hadoop-client 3.3.0 org.apache.flink flink-core 1.11.1 org.apache.flink flink-cep_2.11 1.11.1 org.apache.flink flink-cep-scala_2.11 1.11.1 org.apache.flink flink-scala_2.11 1.11.1 org.projectlombok lombok 1.18.4 the error I got is: [https://paste.ubuntu.com/p/49HRYXFzR2/] *some of the above error is:* *Caused by: java.lang.IllegalStateException: Unexpected state handle type, expected: class org.apache.flink.runtime.state.KeyGroupsStateHandle, but found: class org.apache.flink.runtime.state.IncrementalRemoteKeyedStateHandle* The steps are: 1.mvn clean scala:compile compile package 2.nc -lk 3.flink run -c wordcount_increstate datastream_api-1.0-SNAPSHOT.jar Job has been submitted with JobID df6d62a43620f258155b8538ef0ddf1b 4.input the following conents in nc -lk before error error error error 5. flink run -s hdfs://Desktop:9000/tmp/flinkck/df6d62a43620f258155b8538ef0ddf1b/chk-22 -c StateWordCount datastream_api-1.0-SNAPSHOT.jar Then the above error happens. Please help,Thanks~! was: *The reason why I reopen it is that:* *I can {color:#de350b}not{color} {color:#de350b}resume from checkpoint manually{color} when I stop the following program by force(maybe NOT supported officially?).* *①I {color:#de350b}hava subscribed to mailing list{color} and David Anderson haved answered this several days ago.* *But he did not tell me how I can fix it.* *I guess he's very busy.* *②My {color:#de350b}stackoverflow account{color} is forbidden to ask questions.* *③No one in alibaba {color:#de350b}Flink dingtalk group{color} can answer this question.* *④I have read [document|[https://ci.apache.org/projects/flink/flink-docs-stable/ops/state/checkpoints.html]] carefully,but I can NOT succeed in it.* *Please help.* *The keypoint is:* *①Of course I know checkpoint is for automatically resuming,it's basics.* *②I want to stop the program by force(simulate Product Environment)with checkpoint and then resume from checkpoint manually* *I want to do an experiment of"incremental checkpoint"* my code is: [https://paste.ubuntu.com/p/DpTyQKq6Vk/] pom.xml is: http://maven.apache.org/POM/4.0.0; xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance; xsi:schemaLocation="http://maven.apache.org/POM/4.0.0 [http://maven.apache.org/xsd/maven-4.0.0.xsd];> 4.0.0 example datastream_api 1.0-SNAPSHOT org.apache.maven.plugins maven-compiler-plugin 3.1 1.8 1.8 org.scala-tools maven-scala-plugin 2.15.2 compile testCompile org.apache.flink flink-streaming-scala_2.11 1.11.1 org.apache.flink flink-clients_2.11 1.11.1 org.apache.flink flink-statebackend-rocksdb_2.11 1.11.2 org.apache.hadoop hadoop-client
[GitHub] [flink] flinkbot edited a comment on pull request #13542: [FLINK-19501][runtime-web] Missing state in enum type in rest_api_v1.…
flinkbot edited a comment on pull request #13542: URL: https://github.com/apache/flink/pull/13542#issuecomment-703993218 ## CI report: * 8666e10ce4b127d2a81c437c2540b5338f818a8b Azure: [PENDING](https://dev.azure.com/apache-flink/98463496-1af2-4620-8eab-a2ecc1a2e6fe/_build/results?buildId=7218) Bot commands The @flinkbot bot supports the following commands: - `@flinkbot run travis` re-run the last Travis build - `@flinkbot run azure` re-run the last Azure build This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. For queries about this service, please contact Infrastructure at: us...@infra.apache.org
[GitHub] [flink] flinkbot edited a comment on pull request #13514: [FLINK-19468][DataStream API] Metrics return empty when data stream / operator name contains "+"
flinkbot edited a comment on pull request #13514: URL: https://github.com/apache/flink/pull/13514#issuecomment-701038173 ## CI report: * 8a72f73417ceb18536d282679e8ce5c60f634acb Azure: [FAILURE](https://dev.azure.com/apache-flink/98463496-1af2-4620-8eab-a2ecc1a2e6fe/_build/results?buildId=7100) Azure: [FAILURE](https://dev.azure.com/apache-flink/98463496-1af2-4620-8eab-a2ecc1a2e6fe/_build/results?buildId=7211) * 8c642f8e743621fbcad59ebe2827043319d0fc3c Azure: [PENDING](https://dev.azure.com/apache-flink/98463496-1af2-4620-8eab-a2ecc1a2e6fe/_build/results?buildId=7217) Bot commands The @flinkbot bot supports the following commands: - `@flinkbot run travis` re-run the last Travis build - `@flinkbot run azure` re-run the last Azure build This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. For queries about this service, please contact Infrastructure at: us...@infra.apache.org
[jira] [Updated] (FLINK-19486) expected: class org.apache.flink.runtime.state.KeyGroupsStateHandle, but found: class org.apache.flink.runtime.state.IncrementalRemoteKeyedStateHandle
[ https://issues.apache.org/jira/browse/FLINK-19486?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] appleyuchi updated FLINK-19486: --- Description: *The reason why I reopen it is that:* *I can {color:#de350b}not{color} {color:#de350b}resume from checkpoint manually{color} when I stop the following program by force(maybe NOT supported officially?).* *①I {color:#de350b}hava subscribed to mailing list{color} and David Anderson haved answered this several days ago.* *But he did not tell me how I can fix it.* *I guess he's very busy.* *②My {color:#de350b}stackoverflow account{color} is forbidden to ask questions.* *③No one in alibaba {color:#de350b}Flink dingtalk group{color} can answer this question.* *④I have read [document|[https://ci.apache.org/projects/flink/flink-docs-stable/ops/state/checkpoints.html]] carefully,but I can NOT succeed in it.* *Please help.* *The keypoint is:* *①Of course I know checkpoint is for automatically resuming,it's basics.* *②I want to stop the program by force(simulate Product Environment)with checkpoint and then resume from checkpoint manually* *I want to do an experiment of"incremental checkpoint"* my code is: [https://paste.ubuntu.com/p/DpTyQKq6Vk/] pom.xml is: http://maven.apache.org/POM/4.0.0; xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance; xsi:schemaLocation="http://maven.apache.org/POM/4.0.0 [http://maven.apache.org/xsd/maven-4.0.0.xsd];> 4.0.0 example datastream_api 1.0-SNAPSHOT org.apache.maven.plugins maven-compiler-plugin 3.1 1.8 1.8 org.scala-tools maven-scala-plugin 2.15.2 compile testCompile org.apache.flink flink-streaming-scala_2.11 1.11.1 org.apache.flink flink-clients_2.11 1.11.1 org.apache.flink flink-statebackend-rocksdb_2.11 1.11.2 org.apache.hadoop hadoop-client 3.3.0 org.apache.flink flink-core 1.11.1 org.apache.flink flink-cep_2.11 1.11.1 org.apache.flink flink-cep-scala_2.11 1.11.1 org.apache.flink flink-scala_2.11 1.11.1 org.projectlombok lombok 1.18.4 the error I got is: [https://paste.ubuntu.com/p/49HRYXFzR2/] *some of the above error is:* *Caused by: java.lang.IllegalStateException: Unexpected state handle type, expected: class org.apache.flink.runtime.state.KeyGroupsStateHandle, but found: class org.apache.flink.runtime.state.IncrementalRemoteKeyedStateHandle* The steps are: 1.mvn clean scala:compile compile package 2.nc -lk 3.flink run -c wordcount_increstate datastream_api-1.0-SNAPSHOT.jar Job has been submitted with JobID df6d62a43620f258155b8538ef0ddf1b 4.input the following conents in nc -lk before error error error error 5. flink run -s hdfs://Desktop:9000/tmp/flinkck/df6d62a43620f258155b8538ef0ddf1b/chk-22 -c StateWordCount datastream_api-1.0-SNAPSHOT.jar Then the above error happens. Please help,Thanks~! was: *The reason why I reopen it is that:* *I can not resume from checkpoint manually when I stop the following program by force(maybe NOT supported officially?).* *①I hava subscribed to mailing list and David Anderson haved answered this several days ago.* *But he did not tell me how I can fix it.* *I guess he's very busy.* *②My stackoverflow account is forbidden to ask questions.* *③No one in alibaba Flink dingtalk can answer this question.* *④I have read [document|[https://ci.apache.org/projects/flink/flink-docs-stable/ops/state/checkpoints.html]] carefully,but I can NOT succeed in it.* *Please help.* *The keypoint is:* *①Of course I know checkpoint is for automatically resuming,it's basics.* *②I want to stop the program by force(simulate Product Environment)with checkpoint and then resume from checkpoint manually* *I want to do an experiment of"incremental checkpoint"* my code is: [https://paste.ubuntu.com/p/DpTyQKq6Vk/] pom.xml is: http://maven.apache.org/POM/4.0.0; xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance; xsi:schemaLocation="http://maven.apache.org/POM/4.0.0 [http://maven.apache.org/xsd/maven-4.0.0.xsd];> 4.0.0 example datastream_api 1.0-SNAPSHOT org.apache.maven.plugins maven-compiler-plugin 3.1 1.8 1.8 org.scala-tools maven-scala-plugin 2.15.2 compile testCompile org.apache.flink flink-streaming-scala_2.11 1.11.1 org.apache.flink flink-clients_2.11 1.11.1 org.apache.flink flink-statebackend-rocksdb_2.11 1.11.2 org.apache.hadoop hadoop-client 3.3.0 org.apache.flink flink-core 1.11.1 org.apache.flink flink-cep_2.11 1.11.1 org.apache.flink flink-cep-scala_2.11 1.11.1 org.apache.flink flink-scala_2.11 1.11.1 org.projectlombok lombok 1.18.4 the error I got is: [https://paste.ubuntu.com/p/49HRYXFzR2/]
[jira] [Updated] (FLINK-19486) expected: class org.apache.flink.runtime.state.KeyGroupsStateHandle, but found: class org.apache.flink.runtime.state.IncrementalRemoteKeyedStateHandle
[ https://issues.apache.org/jira/browse/FLINK-19486?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] appleyuchi updated FLINK-19486: --- Description: *The reason why I reopen it is that:* *I can not resume from checkpoint manually when I stop the following program by force(maybe NOT supported officially?).* *①I hava subscribed to mailing list and David Anderson haved answered this several days ago.* *But he did not tell me how I can fix it.* *I guess he's very busy.* *②My stackoverflow account is forbidden to ask questions.* *③No one in alibaba Flink dingtalk can answer this question.* *④I have read [document|[https://ci.apache.org/projects/flink/flink-docs-stable/ops/state/checkpoints.html]] carefully,but I can NOT succeed in it.* *Please help.* *The keypoint is:* *①Of course I know checkpoint is for automatically resuming,it's basics.* *②I want to stop the program by force(simulate Product Environment)with checkpoint and then resume from checkpoint manually* *I want to do an experiment of"incremental checkpoint"* my code is: [https://paste.ubuntu.com/p/DpTyQKq6Vk/] pom.xml is: http://maven.apache.org/POM/4.0.0; xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance; xsi:schemaLocation="http://maven.apache.org/POM/4.0.0 [http://maven.apache.org/xsd/maven-4.0.0.xsd];> 4.0.0 example datastream_api 1.0-SNAPSHOT org.apache.maven.plugins maven-compiler-plugin 3.1 1.8 1.8 org.scala-tools maven-scala-plugin 2.15.2 compile testCompile org.apache.flink flink-streaming-scala_2.11 1.11.1 org.apache.flink flink-clients_2.11 1.11.1 org.apache.flink flink-statebackend-rocksdb_2.11 1.11.2 org.apache.hadoop hadoop-client 3.3.0 org.apache.flink flink-core 1.11.1 org.apache.flink flink-cep_2.11 1.11.1 org.apache.flink flink-cep-scala_2.11 1.11.1 org.apache.flink flink-scala_2.11 1.11.1 org.projectlombok lombok 1.18.4 the error I got is: [https://paste.ubuntu.com/p/49HRYXFzR2/] *some of the above error is:* *Caused by: java.lang.IllegalStateException: Unexpected state handle type, expected: class org.apache.flink.runtime.state.KeyGroupsStateHandle, but found: class org.apache.flink.runtime.state.IncrementalRemoteKeyedStateHandle* The steps are: 1.mvn clean scala:compile compile package 2.nc -lk 3.flink run -c wordcount_increstate datastream_api-1.0-SNAPSHOT.jar Job has been submitted with JobID df6d62a43620f258155b8538ef0ddf1b 4.input the following conents in nc -lk before error error error error 5. flink run -s hdfs://Desktop:9000/tmp/flinkck/df6d62a43620f258155b8538ef0ddf1b/chk-22 -c StateWordCount datastream_api-1.0-SNAPSHOT.jar Then the above error happens. Please help,Thanks~! was: *The reason why I reopen it is that:* *I can not resume from checkpoint manually when I stop the following program by force(maybe NOT supported officially?).* *①I hava subscribed to mailing list and David Anderson haved answered this several days ago.* *But he did not tell me how I can fix it.* *I guess he's very busy.* *②My stackoverflow account is forbidden to ask questions.* *Please help.* *The keypoint is:* *①Of course I know checkpoint is for automatically resuming,it's basics.* *②I want to stop the program by force(simulate Product Environment)with checkpoint and then resume from checkpoint manually* *I want to do an experiment of"incremental checkpoint"* my code is: [https://paste.ubuntu.com/p/DpTyQKq6Vk/] pom.xml is: http://maven.apache.org/POM/4.0.0; xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance; xsi:schemaLocation="http://maven.apache.org/POM/4.0.0 [http://maven.apache.org/xsd/maven-4.0.0.xsd];> 4.0.0 example datastream_api 1.0-SNAPSHOT org.apache.maven.plugins maven-compiler-plugin 3.1 1.8 1.8 org.scala-tools maven-scala-plugin 2.15.2 compile testCompile org.apache.flink flink-streaming-scala_2.11 1.11.1 org.apache.flink flink-clients_2.11 1.11.1 org.apache.flink flink-statebackend-rocksdb_2.11 1.11.2 org.apache.hadoop hadoop-client 3.3.0 org.apache.flink flink-core 1.11.1 org.apache.flink flink-cep_2.11 1.11.1 org.apache.flink flink-cep-scala_2.11 1.11.1 org.apache.flink flink-scala_2.11 1.11.1 org.projectlombok lombok 1.18.4 the error I got is: [https://paste.ubuntu.com/p/49HRYXFzR2/] *some of the above error is:* *Caused by: java.lang.IllegalStateException: Unexpected state handle type, expected: class org.apache.flink.runtime.state.KeyGroupsStateHandle, but found: class org.apache.flink.runtime.state.IncrementalRemoteKeyedStateHandle* The steps are: 1.mvn clean scala:compile compile package 2.nc
[jira] [Updated] (FLINK-19486) expected: class org.apache.flink.runtime.state.KeyGroupsStateHandle, but found: class org.apache.flink.runtime.state.IncrementalRemoteKeyedStateHandle
[ https://issues.apache.org/jira/browse/FLINK-19486?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] appleyuchi updated FLINK-19486: --- Description: *The reason why I reopen it is that:* *I can not resume from checkpoint manually when I stop the following program by force(maybe NOT supported officially?).* *①I hava subscribed to mailing list and David Anderson haved answered this several days ago.* *But he did not tell me how I can fix it.* *I guess he's very busy.* *②My stackoverflow account is forbidden to ask questions.* *Please help.* *The keypoint is:* *①Of course I know checkpoint is for automatically resuming,it's basics.* *②I want to stop the program by force(simulate Product Environment)with checkpoint and then resume from checkpoint manually* *I want to do an experiment of"incremental checkpoint"* my code is: [https://paste.ubuntu.com/p/DpTyQKq6Vk/] pom.xml is: http://maven.apache.org/POM/4.0.0; xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance; xsi:schemaLocation="http://maven.apache.org/POM/4.0.0 [http://maven.apache.org/xsd/maven-4.0.0.xsd];> 4.0.0 example datastream_api 1.0-SNAPSHOT org.apache.maven.plugins maven-compiler-plugin 3.1 1.8 1.8 org.scala-tools maven-scala-plugin 2.15.2 compile testCompile org.apache.flink flink-streaming-scala_2.11 1.11.1 org.apache.flink flink-clients_2.11 1.11.1 org.apache.flink flink-statebackend-rocksdb_2.11 1.11.2 org.apache.hadoop hadoop-client 3.3.0 org.apache.flink flink-core 1.11.1 org.apache.flink flink-cep_2.11 1.11.1 org.apache.flink flink-cep-scala_2.11 1.11.1 org.apache.flink flink-scala_2.11 1.11.1 org.projectlombok lombok 1.18.4 the error I got is: [https://paste.ubuntu.com/p/49HRYXFzR2/] *some of the above error is:* *Caused by: java.lang.IllegalStateException: Unexpected state handle type, expected: class org.apache.flink.runtime.state.KeyGroupsStateHandle, but found: class org.apache.flink.runtime.state.IncrementalRemoteKeyedStateHandle* The steps are: 1.mvn clean scala:compile compile package 2.nc -lk 3.flink run -c wordcount_increstate datastream_api-1.0-SNAPSHOT.jar Job has been submitted with JobID df6d62a43620f258155b8538ef0ddf1b 4.input the following conents in nc -lk before error error error error 5. flink run -s hdfs://Desktop:9000/tmp/flinkck/df6d62a43620f258155b8538ef0ddf1b/chk-22 -c StateWordCount datastream_api-1.0-SNAPSHOT.jar Then the above error happens. Please help,Thanks~! was: *The reason why I reopen it is that:* *I can not resume from checkpoint manually when I stop the following program by force(maybe NOT supported officially?).* *①I hava subscribed to mailing list and David Anderson haved answered this several days ago.* *But he did not tell me how I can fix it.* *②My stackoverflow account is forbidden to ask questions.* *Please help.* *The keypoint is:* *①* *②* *I want to do an experiment of"incremental checkpoint"* my code is: [https://paste.ubuntu.com/p/DpTyQKq6Vk/] pom.xml is: http://maven.apache.org/POM/4.0.0; xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance; xsi:schemaLocation="http://maven.apache.org/POM/4.0.0 [http://maven.apache.org/xsd/maven-4.0.0.xsd];> 4.0.0 example datastream_api 1.0-SNAPSHOT org.apache.maven.plugins maven-compiler-plugin 3.1 1.8 1.8 org.scala-tools maven-scala-plugin 2.15.2 compile testCompile org.apache.flink flink-streaming-scala_2.11 1.11.1 org.apache.flink flink-clients_2.11 1.11.1 org.apache.flink flink-statebackend-rocksdb_2.11 1.11.2 org.apache.hadoop hadoop-client 3.3.0 org.apache.flink flink-core 1.11.1 org.apache.flink flink-cep_2.11 1.11.1 org.apache.flink flink-cep-scala_2.11 1.11.1 org.apache.flink flink-scala_2.11 1.11.1 org.projectlombok lombok 1.18.4 the error I got is: [https://paste.ubuntu.com/p/49HRYXFzR2/] *some of the above error is:* *Caused by: java.lang.IllegalStateException: Unexpected state handle type, expected: class org.apache.flink.runtime.state.KeyGroupsStateHandle, but found: class org.apache.flink.runtime.state.IncrementalRemoteKeyedStateHandle* The steps are: 1.mvn clean scala:compile compile package 2.nc -lk 3.flink run -c wordcount_increstate datastream_api-1.0-SNAPSHOT.jar Job has been submitted with JobID df6d62a43620f258155b8538ef0ddf1b 4.input the following conents in nc -lk before error error error error 5. flink run -s hdfs://Desktop:9000/tmp/flinkck/df6d62a43620f258155b8538ef0ddf1b/chk-22 -c StateWordCount datastream_api-1.0-SNAPSHOT.jar Then the above error happens. Please help,Thanks~! >
[jira] [Updated] (FLINK-19486) expected: class org.apache.flink.runtime.state.KeyGroupsStateHandle, but found: class org.apache.flink.runtime.state.IncrementalRemoteKeyedStateHandle
[ https://issues.apache.org/jira/browse/FLINK-19486?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] appleyuchi updated FLINK-19486: --- Description: *The reason why I reopen it is that:* *I can not resume from checkpoint manually when I stop the following program by force(maybe NOT supported officially?).* *①I hava subscribed to mailing list and David Anderson haved answered this several days ago.* *But he did not tell me how I can fix it.* *②My stackoverflow account is forbidden to ask questions.* *Please help.* *The keypoint is:* *①* *②* *I want to do an experiment of"incremental checkpoint"* my code is: [https://paste.ubuntu.com/p/DpTyQKq6Vk/] pom.xml is: http://maven.apache.org/POM/4.0.0; xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance; xsi:schemaLocation="http://maven.apache.org/POM/4.0.0 [http://maven.apache.org/xsd/maven-4.0.0.xsd];> 4.0.0 example datastream_api 1.0-SNAPSHOT org.apache.maven.plugins maven-compiler-plugin 3.1 1.8 1.8 org.scala-tools maven-scala-plugin 2.15.2 compile testCompile org.apache.flink flink-streaming-scala_2.11 1.11.1 org.apache.flink flink-clients_2.11 1.11.1 org.apache.flink flink-statebackend-rocksdb_2.11 1.11.2 org.apache.hadoop hadoop-client 3.3.0 org.apache.flink flink-core 1.11.1 org.apache.flink flink-cep_2.11 1.11.1 org.apache.flink flink-cep-scala_2.11 1.11.1 org.apache.flink flink-scala_2.11 1.11.1 org.projectlombok lombok 1.18.4 the error I got is: [https://paste.ubuntu.com/p/49HRYXFzR2/] *some of the above error is:* *Caused by: java.lang.IllegalStateException: Unexpected state handle type, expected: class org.apache.flink.runtime.state.KeyGroupsStateHandle, but found: class org.apache.flink.runtime.state.IncrementalRemoteKeyedStateHandle* The steps are: 1.mvn clean scala:compile compile package 2.nc -lk 3.flink run -c wordcount_increstate datastream_api-1.0-SNAPSHOT.jar Job has been submitted with JobID df6d62a43620f258155b8538ef0ddf1b 4.input the following conents in nc -lk before error error error error 5. flink run -s hdfs://Desktop:9000/tmp/flinkck/df6d62a43620f258155b8538ef0ddf1b/chk-22 -c StateWordCount datastream_api-1.0-SNAPSHOT.jar Then the above error happens. Please help,Thanks~! was: *I want to do an experiment of"incremental checkpoint"* my code is: [https://paste.ubuntu.com/p/DpTyQKq6Vk/] pom.xml is: http://maven.apache.org/POM/4.0.0; xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance; xsi:schemaLocation="http://maven.apache.org/POM/4.0.0 [http://maven.apache.org/xsd/maven-4.0.0.xsd];> 4.0.0 example datastream_api 1.0-SNAPSHOT org.apache.maven.plugins maven-compiler-plugin 3.1 1.8 1.8 org.scala-tools maven-scala-plugin 2.15.2 compile testCompile org.apache.flink flink-streaming-scala_2.11 1.11.1 org.apache.flink flink-clients_2.11 1.11.1 org.apache.flink flink-statebackend-rocksdb_2.11 1.11.2 org.apache.hadoop hadoop-client 3.3.0 org.apache.flink flink-core 1.11.1 org.apache.flink flink-cep_2.11 1.11.1 org.apache.flink flink-cep-scala_2.11 1.11.1 org.apache.flink flink-scala_2.11 1.11.1 org.projectlombok lombok 1.18.4 the error I got is: [https://paste.ubuntu.com/p/49HRYXFzR2/] *some of the above error is:* *Caused by: java.lang.IllegalStateException: Unexpected state handle type, expected: class org.apache.flink.runtime.state.KeyGroupsStateHandle, but found: class org.apache.flink.runtime.state.IncrementalRemoteKeyedStateHandle* The steps are: 1.mvn clean scala:compile compile package 2.nc -lk 3.flink run -c wordcount_increstate datastream_api-1.0-SNAPSHOT.jar Job has been submitted with JobID df6d62a43620f258155b8538ef0ddf1b 4.input the following conents in nc -lk before error error error error 5. flink run -s hdfs://Desktop:9000/tmp/flinkck/df6d62a43620f258155b8538ef0ddf1b/chk-22 -c StateWordCount datastream_api-1.0-SNAPSHOT.jar Then the above error happens. Please help,Thanks~! > expected: class org.apache.flink.runtime.state.KeyGroupsStateHandle, but > found: class org.apache.flink.runtime.state.IncrementalRemoteKeyedStateHandle > -- > > Key: FLINK-19486 > URL: https://issues.apache.org/jira/browse/FLINK-19486 > Project: Flink > Issue Type: Bug >Affects Versions: 1.11.1 >Reporter: appleyuchi >Priority: Minor > > *The reason why I reopen it is that:* > > *I can not resume from
[jira] [Reopened] (FLINK-19486) expected: class org.apache.flink.runtime.state.KeyGroupsStateHandle, but found: class org.apache.flink.runtime.state.IncrementalRemoteKeyedStateHandle
[ https://issues.apache.org/jira/browse/FLINK-19486?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] appleyuchi reopened FLINK-19486: > expected: class org.apache.flink.runtime.state.KeyGroupsStateHandle, but > found: class org.apache.flink.runtime.state.IncrementalRemoteKeyedStateHandle > -- > > Key: FLINK-19486 > URL: https://issues.apache.org/jira/browse/FLINK-19486 > Project: Flink > Issue Type: Bug >Affects Versions: 1.11.1 >Reporter: appleyuchi >Priority: Minor > > *I want to do an experiment of"incremental checkpoint"* > my code is: > [https://paste.ubuntu.com/p/DpTyQKq6Vk/] > > pom.xml is: > > http://maven.apache.org/POM/4.0.0; > xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance; > xsi:schemaLocation="http://maven.apache.org/POM/4.0.0 > [http://maven.apache.org/xsd/maven-4.0.0.xsd];> > 4.0.0 > example > datastream_api > 1.0-SNAPSHOT > > > > org.apache.maven.plugins > maven-compiler-plugin > 3.1 > > 1.8 > 1.8 > > > > org.scala-tools > maven-scala-plugin > 2.15.2 > > > > compile > testCompile > > > > > > > > > > > org.apache.flink > flink-streaming-scala_2.11 > 1.11.1 > > > > > > > > > > org.apache.flink > flink-clients_2.11 > 1.11.1 > > > > org.apache.flink > flink-statebackend-rocksdb_2.11 > 1.11.2 > > > > org.apache.hadoop > hadoop-client > 3.3.0 > > > org.apache.flink > flink-core > 1.11.1 > > > > > > > > > > org.apache.flink > flink-cep_2.11 > 1.11.1 > > > org.apache.flink > flink-cep-scala_2.11 > 1.11.1 > > > org.apache.flink > flink-scala_2.11 > 1.11.1 > > > > org.projectlombok > lombok > 1.18.4 > > > > > > the error I got is: > [https://paste.ubuntu.com/p/49HRYXFzR2/] > > *some of the above error is:* > *Caused by: java.lang.IllegalStateException: Unexpected state handle type, > expected: class org.apache.flink.runtime.state.KeyGroupsStateHandle, but > found: class org.apache.flink.runtime.state.IncrementalRemoteKeyedStateHandle* > > > The steps are: > 1.mvn clean scala:compile compile package > 2.nc -lk > 3.flink run -c wordcount_increstate datastream_api-1.0-SNAPSHOT.jar > Job has been submitted with JobID df6d62a43620f258155b8538ef0ddf1b > 4.input the following conents in nc -lk > before > error > error > error > error > 5. > flink run -s > hdfs://Desktop:9000/tmp/flinkck/df6d62a43620f258155b8538ef0ddf1b/chk-22 -c > StateWordCount datastream_api-1.0-SNAPSHOT.jar > Then the above error happens. > > Please help,Thanks~! > -- This message was sent by Atlassian Jira (v8.3.4#803005)
[GitHub] [flink] flinkbot commented on pull request #13542: [FLINK-19501][runtime-web] Missing state in enum type in rest_api_v1.…
flinkbot commented on pull request #13542: URL: https://github.com/apache/flink/pull/13542#issuecomment-703993218 ## CI report: * 8666e10ce4b127d2a81c437c2540b5338f818a8b UNKNOWN Bot commands The @flinkbot bot supports the following commands: - `@flinkbot run travis` re-run the last Travis build - `@flinkbot run azure` re-run the last Azure build This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. For queries about this service, please contact Infrastructure at: us...@infra.apache.org
[jira] [Commented] (FLINK-14197) Increasing trend for state size of keyed stream using ProcessWindowFunction with ProcessingTimeSessionWindows
[ https://issues.apache.org/jira/browse/FLINK-14197?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17208458#comment-17208458 ] Ravi Itha commented on FLINK-14197: --- I am impacted with this issue. The checkpoint size increases continuously to a point where application crashes. After a complete restart the Checkpoint size resets back to 0. I have been running 6 different application with steady data load and I can see the same trend in every single application. Flink Version = 1.8.2 Window used = SessionWindow State backend = RocksDB > Increasing trend for state size of keyed stream using ProcessWindowFunction > with ProcessingTimeSessionWindows > - > > Key: FLINK-14197 > URL: https://issues.apache.org/jira/browse/FLINK-14197 > Project: Flink > Issue Type: Bug > Components: Runtime / Checkpointing, Runtime / State Backends >Affects Versions: 1.9.0 > Environment: Tested with: > * Local Flink Mini Cluster running from IDE > * Flink standalone cluster run in docker >Reporter: Oliver Kostera >Priority: Major > > I'm using *ProcessWindowFunction* in a keyed stream with the following > definition: > {code:java} > final SingleOutputStreamOperator processWindowFunctionStream > = > > keyedStream.window(ProcessingTimeSessionWindows.withGap(Time.milliseconds(100))) > .process(new > CustomProcessWindowFunction()).uid(PROCESS_WINDOW_FUNCTION_OPERATOR_ID) > .name("Process window function"); > {code} > My checkpointing configuration is set to use RocksDB state backend with > incremental checkpointing and EXACTLY_ONCE mode. > In a runtime I noticed that even though data ingestion is static - same keys > and frequency of messages the size of the process window operator keeps > increasing. I tried to reproduce it with minimal similar setup here: > https://github.com/loliver1234/flink-process-window-function and was > successful to do so. > Testing conditions: > - RabbitMQ source with Exactly-once guarantee and 65k prefetch count > - RabbitMQ sink to collect messages > - Simple ProcessWindowFunction that only pass messages through > - Stream time characteristic set to TimeCharacteristic.ProcessingTime > Testing scenario: > - Start flink job and check initial state size - State Size: 127 KB > - Start sending messages, 1000 same unique keys every 1s (they are not > falling into defined time window gap set to 100ms, each message should create > new window) > - State of the process window operator keeps increasing - after 1mln messages > state ended up to be around 2mb > - Stop sending messages and wait till rabbit queue is fully consumed and few > checkpoints go by > - Was expected to see state size to decrease to base value but it stayed at > 2mb > - Continue to send messages with the same keys and state kept increasing > trend. > What I checked: > - Registration and deregistration of timestamps set for time windows - each > registration matched its deregistration > - Checked that in fact there are no window merges > - Tried custom Trigger disabling window merges and setting onProcessingTime > trigger to TriggerResult.FIRE_AND_PURGE - same state behavior > On staging environment, we noticed that state for that operator keeps > increasing indefinitely, after some months reaching even 1,5gb for 100k > unique keys > Flink commit id: 9c32ed9 > -- This message was sent by Atlassian Jira (v8.3.4#803005)
[GitHub] [flink] flinkbot edited a comment on pull request #13514: [FLINK-19468][DataStream API] Metrics return empty when data stream / operator name contains "+"
flinkbot edited a comment on pull request #13514: URL: https://github.com/apache/flink/pull/13514#issuecomment-701038173 ## CI report: * 8a72f73417ceb18536d282679e8ce5c60f634acb Azure: [FAILURE](https://dev.azure.com/apache-flink/98463496-1af2-4620-8eab-a2ecc1a2e6fe/_build/results?buildId=7100) Azure: [FAILURE](https://dev.azure.com/apache-flink/98463496-1af2-4620-8eab-a2ecc1a2e6fe/_build/results?buildId=7211) * 8c642f8e743621fbcad59ebe2827043319d0fc3c UNKNOWN Bot commands The @flinkbot bot supports the following commands: - `@flinkbot run travis` re-run the last Travis build - `@flinkbot run azure` re-run the last Azure build This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. For queries about this service, please contact Infrastructure at: us...@infra.apache.org
[jira] [Reopened] (FLINK-19501) Missing state in enum type in rest_api_v1.snapshot
[ https://issues.apache.org/jira/browse/FLINK-19501?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] goutham reopened FLINK-19501: - this enum type is still missing in the snapshot file > Missing state in enum type in rest_api_v1.snapshot > -- > > Key: FLINK-19501 > URL: https://issues.apache.org/jira/browse/FLINK-19501 > Project: Flink > Issue Type: Bug > Components: Documentation, Runtime / REST >Affects Versions: 1.11.2 >Reporter: goutham >Priority: Minor > Labels: pull-request-available > > INITIALIZING state is missing for one of enums api snapshot document > > "enum" : [ "INITIALIZING", "CREATED", "RUNNING", "FAILING", "FAILED", > "CANCELLING", "CANCELED", "FINISHED", "RESTARTING", "SUSPENDED", > "RECONCILING" ] -- This message was sent by Atlassian Jira (v8.3.4#803005)
[jira] [Updated] (FLINK-19468) Metrics return empty when data stream / operator name contains "+"
[ https://issues.apache.org/jira/browse/FLINK-19468?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Boyang Jerry Peng updated FLINK-19468: -- Description: If I submit a Flink job in which a stream has a name that contains a "+" character, the metrics returned for the stream is always empty. For example: ``` env.addSource(new TestSource()).name("testing + plus"); ``` If I try to get the metrics ``` $ curl [http://localhost:8081/jobs/5d4f28a221b2e5762f8404398a462eb0/vertices/cbc357ccb763df2852fee8c4fc7d55f2/metrics?get=0.Source__testing_+_plus.numRecordsOut] [] ``` The http request is also made from the UI if you view the metric "[0.Source__testing_+_plus.numRecordsOut|http://localhost:8081/jobs/5d4f28a221b2e5762f8404398a462eb0/vertices/cbc357ccb763df2852fee8c4fc7d55f2/metrics?get=0.Source__testing_+_plus.numRecordsOut];. The metrics will always return empty. However if I remove the "+" from the name of the stream. Metrics are returned non-empty. Maybe it has something to do with this method: [https://github.com/apache/flink/blob/master/flink-runtime/src/main/java/org/apache/flink/runtime/metrics/dump/MetricQueryService.java#L208] It does not consider "+" character? I have attached the full example to reproduce the issue. was: If I submit a Flink job in which a stream has a name that contains a "+" character, the metrics returned for the stream is always empty. For example: ``` env.addSource(new TestSource()).name("testing + plus"); ``` If I try to get the metrics ``` $ curl [http://localhost:8081/jobs/5d4f28a221b2e5762f8404398a462eb0/vertices/cbc357ccb763df2852fee8c4fc7d55f2/metrics?get=0.Source__testing_+_plus.numRecordsOut] [] ``` The metrics will always return empty. However if I remove the "+" from the name of the stream. Metrics are returned non-empty. Maybe it has something to do with this method: [https://github.com/apache/flink/blob/master/flink-runtime/src/main/java/org/apache/flink/runtime/metrics/dump/MetricQueryService.java#L208] It does not consider "+" character? I have attached the full example to reproduce the issue. > Metrics return empty when data stream / operator name contains "+" > -- > > Key: FLINK-19468 > URL: https://issues.apache.org/jira/browse/FLINK-19468 > Project: Flink > Issue Type: Bug > Components: API / DataStream >Affects Versions: 1.9.0, 1.9.2, 1.9.3, 2.0.0 >Reporter: Boyang Jerry Peng >Assignee: Boyang Jerry Peng >Priority: Major > Labels: pull-request-available > Attachments: StreamingJob.java > > > If I submit a Flink job in which a stream has a name that contains a "+" > character, the metrics returned for the stream is always empty. For example: > ``` > env.addSource(new TestSource()).name("testing + plus"); > ``` > > If I try to get the metrics > ``` > $ curl > [http://localhost:8081/jobs/5d4f28a221b2e5762f8404398a462eb0/vertices/cbc357ccb763df2852fee8c4fc7d55f2/metrics?get=0.Source__testing_+_plus.numRecordsOut] > [] > ``` > > The http request is also made from the UI if you view the metric > "[0.Source__testing_+_plus.numRecordsOut|http://localhost:8081/jobs/5d4f28a221b2e5762f8404398a462eb0/vertices/cbc357ccb763df2852fee8c4fc7d55f2/metrics?get=0.Source__testing_+_plus.numRecordsOut];. > The metrics will always return empty. However if I remove the "+" from the > name of the stream. Metrics are returned non-empty. > Maybe it has something to do with this method: > [https://github.com/apache/flink/blob/master/flink-runtime/src/main/java/org/apache/flink/runtime/metrics/dump/MetricQueryService.java#L208] > It does not consider "+" character? > > I have attached the full example to reproduce the issue. -- This message was sent by Atlassian Jira (v8.3.4#803005)
[GitHub] [flink] flinkbot edited a comment on pull request #13458: FLINK-18851 [runtime] Add checkpoint type to checkpoint history entries in Web UI
flinkbot edited a comment on pull request #13458: URL: https://github.com/apache/flink/pull/13458#issuecomment-697041219 ## CI report: * 1b59c5c3c61d9ea667f3e722bb54788d1d687cec Azure: [SUCCESS](https://dev.azure.com/apache-flink/98463496-1af2-4620-8eab-a2ecc1a2e6fe/_build/results?buildId=7040) * 3a9c127acf7d5615f5a32c7af35f8611b93cd6ab Azure: [PENDING](https://dev.azure.com/apache-flink/98463496-1af2-4620-8eab-a2ecc1a2e6fe/_build/results?buildId=7216) Bot commands The @flinkbot bot supports the following commands: - `@flinkbot run travis` re-run the last Travis build - `@flinkbot run azure` re-run the last Azure build This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. For queries about this service, please contact Infrastructure at: us...@infra.apache.org
[GitHub] [flink] flinkbot commented on pull request #13542: [FLINK-19501][runtime-web] Missing state in enum type in rest_api_v1.…
flinkbot commented on pull request #13542: URL: https://github.com/apache/flink/pull/13542#issuecomment-703987128 Thanks a lot for your contribution to the Apache Flink project. I'm the @flinkbot. I help the community to review your pull request. We will use this comment to track the progress of the review. ## Automated Checks Last check on commit 8666e10ce4b127d2a81c437c2540b5338f818a8b (Tue Oct 06 02:13:49 UTC 2020) **Warnings:** * No documentation files were touched! Remember to keep the Flink docs up to date! * **This pull request references an unassigned [Jira ticket](https://issues.apache.org/jira/browse/FLINK-19501).** According to the [code contribution guide](https://flink.apache.org/contributing/contribute-code.html), tickets need to be assigned before starting with the implementation work. Mention the bot in a comment to re-run the automated checks. ## Review Progress * ❓ 1. The [description] looks good. * ❓ 2. There is [consensus] that the contribution should go into to Flink. * ❓ 3. Needs [attention] from. * ❓ 4. The change fits into the overall [architecture]. * ❓ 5. Overall code [quality] is good. Please see the [Pull Request Review Guide](https://flink.apache.org/contributing/reviewing-prs.html) for a full explanation of the review process. The Bot is tracking the review progress through labels. Labels are applied according to the order of the review items. For consensus, approval by a Flink committer of PMC member is required Bot commands The @flinkbot bot supports the following commands: - `@flinkbot approve description` to approve one or more aspects (aspects: `description`, `consensus`, `architecture` and `quality`) - `@flinkbot approve all` to approve all aspects - `@flinkbot approve-until architecture` to approve everything until `architecture` - `@flinkbot attention @username1 [@username2 ..]` to require somebody's attention - `@flinkbot disapprove architecture` to remove an approval you gave earlier This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. For queries about this service, please contact Infrastructure at: us...@infra.apache.org
[GitHub] [flink] gm7y8 edited a comment on pull request #13458: FLINK-18851 [runtime] Add checkpoint type to checkpoint history entries in Web UI
gm7y8 edited a comment on pull request #13458: URL: https://github.com/apache/flink/pull/13458#issuecomment-703980529 @XComp @azagrebin created to fix bug FLINK-19501 to fix the missing enum.. other PR just addresses only the html part not the snapshot document. created this PR to fix the issue https://github.com/apache/flink/pull/13542 This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. For queries about this service, please contact Infrastructure at: us...@infra.apache.org
[GitHub] [flink] gm7y8 opened a new pull request #13542: [FLINK-19501][runtime-web] Missing state in enum type in rest_api_v1.…
gm7y8 opened a new pull request #13542: URL: https://github.com/apache/flink/pull/13542 …snapshot ## What is the purpose of the change *(For example: This pull request makes task deployment go through the blob server, rather than through RPC. That way we avoid re-transferring them on each deployment (during recovery).)* ## Brief change log *(for example:)* - *The TaskInfo is stored in the blob store on job creation time as a persistent artifact* - *Deployments RPC transmits only the blob storage reference* - *TaskManagers retrieve the TaskInfo from the blob cache* ## Verifying this change *(Please pick either of the following options)* This change is a trivial rework / code cleanup without any test coverage. *(or)* This change is already covered by existing tests, such as *(please describe tests)*. *(or)* This change added tests and can be verified as follows: *(example:)* - *Added integration tests for end-to-end deployment with large payloads (100MB)* - *Extended integration test for recovery after master (JobManager) failure* - *Added test that validates that TaskInfo is transferred only once across recoveries* - *Manually verified the change by running a 4 node cluser with 2 JobManagers and 4 TaskManagers, a stateful streaming program, and killing one JobManager and two TaskManagers during the execution, verifying that recovery happens correctly.* ## Does this pull request potentially affect one of the following parts: - Dependencies (does it add or upgrade a dependency): (yes / no) - The public API, i.e., is any changed class annotated with `@Public(Evolving)`: (yes / no) - The serializers: (yes / no / don't know) - The runtime per-record code paths (performance sensitive): (yes / no / don't know) - Anything that affects deployment or recovery: JobManager (and its components), Checkpointing, Kubernetes/Yarn/Mesos, ZooKeeper: (yes / no / don't know) - The S3 file system connector: (yes / no / don't know) ## Documentation - Does this pull request introduce a new feature? (yes / no) - If yes, how is the feature documented? (not applicable / docs / JavaDocs / not documented) This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. For queries about this service, please contact Infrastructure at: us...@infra.apache.org
[jira] [Updated] (FLINK-19501) Missing state in enum type in rest_api_v1.snapshot
[ https://issues.apache.org/jira/browse/FLINK-19501?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated FLINK-19501: --- Labels: pull-request-available (was: ) > Missing state in enum type in rest_api_v1.snapshot > -- > > Key: FLINK-19501 > URL: https://issues.apache.org/jira/browse/FLINK-19501 > Project: Flink > Issue Type: Bug > Components: Documentation, Runtime / REST >Affects Versions: 1.11.2 >Reporter: goutham >Priority: Minor > Labels: pull-request-available > > INITIALIZING state is missing for one of enums api snapshot document > > "enum" : [ "INITIALIZING", "CREATED", "RUNNING", "FAILING", "FAILED", > "CANCELLING", "CANCELED", "FINISHED", "RESTARTING", "SUSPENDED", > "RECONCILING" ] -- This message was sent by Atlassian Jira (v8.3.4#803005)
[GitHub] [flink] jerrypeng edited a comment on pull request #13514: [FLINK-19468][DataStream API] Metrics return empty when data stream / operator name contains "+"
jerrypeng edited a comment on pull request #13514: URL: https://github.com/apache/flink/pull/13514#issuecomment-703985054 "+" is also a special character in http requests. For HTTP requests especially via GET where parameters are in the URL, special characters like "+" are typically encoded: https://www.w3schools.com/tags/ref_urlencode.ASP This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. For queries about this service, please contact Infrastructure at: us...@infra.apache.org
[GitHub] [flink] jerrypeng edited a comment on pull request #13514: [FLINK-19468][DataStream API] Metrics return empty when data stream / operator name contains "+"
jerrypeng edited a comment on pull request #13514: URL: https://github.com/apache/flink/pull/13514#issuecomment-703985054 "+" is also a special character in http requests. For HTTP requests especially via GET where parameters are in the URL, special characters like "+" are encoded: https://www.w3schools.com/tags/ref_urlencode.ASP This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. For queries about this service, please contact Infrastructure at: us...@infra.apache.org
[GitHub] [flink] jerrypeng commented on pull request #13514: [FLINK-19468][DataStream API] Metrics return empty when data stream / operator name contains "+"
jerrypeng commented on pull request #13514: URL: https://github.com/apache/flink/pull/13514#issuecomment-703985054 "+" is also a special character in http requests. Technically all the following characters are "reserved" characters for URIs delimiters — :/?#[]@ subdelimiters — !$&'()*+,;= This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. For queries about this service, please contact Infrastructure at: us...@infra.apache.org
[jira] [Updated] (FLINK-19468) Metrics return empty when data stream / operator name contains "+"
[ https://issues.apache.org/jira/browse/FLINK-19468?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Boyang Jerry Peng updated FLINK-19468: -- Description: If I submit a Flink job in which a stream has a name that contains a "+" character, the metrics returned for the stream is always empty. For example: ``` env.addSource(new TestSource()).name("testing + plus"); ``` If I try to get the metrics ``` $ curl [http://localhost:8081/jobs/5d4f28a221b2e5762f8404398a462eb0/vertices/cbc357ccb763df2852fee8c4fc7d55f2/metrics?get=0.Source__testing_+_plus.numRecordsOut] [] ``` The metrics will always return empty. However if I remove the "+" from the name of the stream. Metrics are returned non-empty. Maybe it has something to do with this method: [https://github.com/apache/flink/blob/master/flink-runtime/src/main/java/org/apache/flink/runtime/metrics/dump/MetricQueryService.java#L208] It does not consider "+" character? I have attached the full example to reproduce the issue. was: If I submit a Flink job in which the stream name has a "+" character, the metrics returned for the stream is always empty. Consider the following example: ``` env.addSource(new TestSource()).name("testing + plus"); ``` If I try to get the metrics ``` $ curl [http://localhost:8081/jobs/5d4f28a221b2e5762f8404398a462eb0/vertices/cbc357ccb763df2852fee8c4fc7d55f2/metrics?get=0.Source__testing_+_plus.numRecordsOut] [] ``` The metrics will always return empty. However if I remove the "+" from the name of the stream. Metrics are returned non-empty. Maybe it has something to do with this method: [https://github.com/apache/flink/blob/master/flink-runtime/src/main/java/org/apache/flink/runtime/metrics/dump/MetricQueryService.java#L208] It does not consider "+" character? I have attached the full example to reproduce the issue. > Metrics return empty when data stream / operator name contains "+" > -- > > Key: FLINK-19468 > URL: https://issues.apache.org/jira/browse/FLINK-19468 > Project: Flink > Issue Type: Bug > Components: API / DataStream >Affects Versions: 1.9.0, 1.9.2, 1.9.3, 2.0.0 >Reporter: Boyang Jerry Peng >Assignee: Boyang Jerry Peng >Priority: Major > Labels: pull-request-available > Attachments: StreamingJob.java > > > If I submit a Flink job in which a stream has a name that contains a "+" > character, the metrics returned for the stream is always empty. For example: > ``` > env.addSource(new TestSource()).name("testing + plus"); > ``` > > If I try to get the metrics > ``` > $ curl > [http://localhost:8081/jobs/5d4f28a221b2e5762f8404398a462eb0/vertices/cbc357ccb763df2852fee8c4fc7d55f2/metrics?get=0.Source__testing_+_plus.numRecordsOut] > [] > ``` > The metrics will always return empty. However if I remove the "+" from the > name of the stream. Metrics are returned non-empty. > Maybe it has something to do with this method: > [https://github.com/apache/flink/blob/master/flink-runtime/src/main/java/org/apache/flink/runtime/metrics/dump/MetricQueryService.java#L208] > It does not consider "+" character? > > I have attached the full example to reproduce the issue. -- This message was sent by Atlassian Jira (v8.3.4#803005)
[GitHub] [flink] jerrypeng commented on pull request #13514: [FLINK-19468][DataStream API] Metrics return empty when data stream / operator name contains "+"
jerrypeng commented on pull request #13514: URL: https://github.com/apache/flink/pull/13514#issuecomment-703982356 @jgrier thanks for taking a look at my PR. I have added more details to the jira and example code to reproduce. I have verified my fix with a local standalone cluster with the example code in the JIRA This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. For queries about this service, please contact Infrastructure at: us...@infra.apache.org
[GitHub] [flink] flinkbot edited a comment on pull request #13458: FLINK-18851 [runtime] Add checkpoint type to checkpoint history entries in Web UI
flinkbot edited a comment on pull request #13458: URL: https://github.com/apache/flink/pull/13458#issuecomment-697041219 ## CI report: * 1b59c5c3c61d9ea667f3e722bb54788d1d687cec Azure: [SUCCESS](https://dev.azure.com/apache-flink/98463496-1af2-4620-8eab-a2ecc1a2e6fe/_build/results?buildId=7040) * 3a9c127acf7d5615f5a32c7af35f8611b93cd6ab UNKNOWN Bot commands The @flinkbot bot supports the following commands: - `@flinkbot run travis` re-run the last Travis build - `@flinkbot run azure` re-run the last Azure build This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. For queries about this service, please contact Infrastructure at: us...@infra.apache.org
[jira] [Commented] (FLINK-19491) AvroSerializerSnapshot cannot handle large schema
[ https://issues.apache.org/jira/browse/FLINK-19491?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17208453#comment-17208453 ] Nicholas Jiang commented on FLINK-19491: [~AHeise], the maximum length of the allowed string literal is 64KB, therefore encoded string of UTF is within 64KB. IMO, I couldn't get the bug of this issue. > AvroSerializerSnapshot cannot handle large schema > - > > Key: FLINK-19491 > URL: https://issues.apache.org/jira/browse/FLINK-19491 > Project: Flink > Issue Type: Bug > Components: Formats (JSON, Avro, Parquet, ORC, SequenceFile) >Affects Versions: 1.10.2, 1.12.0, 1.11.2 >Reporter: Arvid Heise >Priority: Major > > Flink can only handle schemas up to a size of 64kb. > > {noformat} > Caused by: java.io.UTFDataFormatException: encoded string too long: 223502 > bytes > at java.io.DataOutputStream.writeUTF(DataOutputStream.java:364) > at java.io.DataOutputStream.writeUTF(DataOutputStream.java:323) > at > org.apache.flink.formats.avro.typeutils.AvroSerializerSnapshot.writeSnapshot(AvroSerializerSnapshot.java:75) > at > org.apache.flink.api.common.typeutils.TypeSerializerSnapshot.writeVersionedSnapshot(TypeSerializerSnapshot.java:153) > at > org.apache.flink.api.common.typeutils.NestedSerializersSnapshotDelegate.writeNestedSerializerSnapshots(NestedSerializersSnapshotDelegate.java:159) > at > org.apache.flink.api.common.typeutils.CompositeTypeSerializerSnapshot.writeSnapshot(CompositeTypeSerializerSnapshot.java:148) > at > org.apache.flink.api.common.typeutils.TypeSerializerSnapshot.writeVersionedSnapshot(TypeSerializerSnapshot.java:153) > at > org.apache.flink.api.common.typeutils.TypeSerializerSnapshotSerializationUtil$TypeSerializerSnapshotSerializationProxy.write(TypeSerializerSnapshotSerializationUtil.java:138) > at > org.apache.flink.api.common.typeutils.TypeSerializerSnapshotSerializationUtil.writeSerializerSnapshot(TypeSerializerSnapshotSerializationUtil.java:55) > at > org.apache.flink.runtime.state.metainfo.StateMetaInfoSnapshotReadersWriters$CurrentWriterImpl.writeStateMetaInfoSnapshot(StateMetaInfoSnapshotReadersWriters.java:183) > at > org.apache.flink.runtime.state.KeyedBackendSerializationProxy.write(KeyedBackendSerializationProxy.java:126) > at > org.apache.flink.runtime.state.heap.HeapSnapshotStrategy$1.callInternal(HeapSnapshotStrategy.java:171) > at > org.apache.flink.runtime.state.heap.HeapSnapshotStrategy$1.callInternal(HeapSnapshotStrategy.java:158) > at > org.apache.flink.runtime.state.AsyncSnapshotCallable.call(AsyncSnapshotCallable.java:75) > at java.util.concurrent.FutureTask.run(FutureTask.java:266) > at > org.apache.flink.runtime.concurrent.FutureUtils.runIfNotDoneAndGet(FutureUtils.java:510) > ... 5 common frames omitted{noformat} -- This message was sent by Atlassian Jira (v8.3.4#803005)
[GitHub] [flink] gm7y8 commented on pull request #13458: FLINK-18851 [runtime] Add checkpoint type to checkpoint history entries in Web UI
gm7y8 commented on pull request #13458: URL: https://github.com/apache/flink/pull/13458#issuecomment-703980529 @XComp @azagrebin created to fix bug FLINK-19501 to fix the missing enum.. other PR just addresses only the html part not the snapshot document This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. For queries about this service, please contact Infrastructure at: us...@infra.apache.org
[jira] [Updated] (FLINK-19501) Missing state in enum type in
[ https://issues.apache.org/jira/browse/FLINK-19501?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] goutham updated FLINK-19501: Summary: Missing state in enum type in (was: Missing state in enum type) > Missing state in enum type in > -- > > Key: FLINK-19501 > URL: https://issues.apache.org/jira/browse/FLINK-19501 > Project: Flink > Issue Type: Bug > Components: Documentation, Runtime / REST >Affects Versions: 1.11.2 >Reporter: goutham >Priority: Minor > > INITIALIZING state is missing for one of enums api snapshot document > > "enum" : [ "INITIALIZING", "CREATED", "RUNNING", "FAILING", "FAILED", > "CANCELLING", "CANCELED", "FINISHED", "RESTARTING", "SUSPENDED", > "RECONCILING" ] -- This message was sent by Atlassian Jira (v8.3.4#803005)
[jira] [Updated] (FLINK-19501) Missing state in enum type in rest_api_v1.snapshot
[ https://issues.apache.org/jira/browse/FLINK-19501?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] goutham updated FLINK-19501: Summary: Missing state in enum type in rest_api_v1.snapshot (was: Missing state in enum type in ) > Missing state in enum type in rest_api_v1.snapshot > -- > > Key: FLINK-19501 > URL: https://issues.apache.org/jira/browse/FLINK-19501 > Project: Flink > Issue Type: Bug > Components: Documentation, Runtime / REST >Affects Versions: 1.11.2 >Reporter: goutham >Priority: Minor > > INITIALIZING state is missing for one of enums api snapshot document > > "enum" : [ "INITIALIZING", "CREATED", "RUNNING", "FAILING", "FAILED", > "CANCELLING", "CANCELED", "FINISHED", "RESTARTING", "SUSPENDED", > "RECONCILING" ] -- This message was sent by Atlassian Jira (v8.3.4#803005)
[GitHub] [flink] gm7y8 commented on a change in pull request #13458: FLINK-18851 [runtime] Add checkpoint type to checkpoint history entries in Web UI
gm7y8 commented on a change in pull request #13458: URL: https://github.com/apache/flink/pull/13458#discussion_r499961398 ## File path: flink-runtime/src/main/java/org/apache/flink/runtime/rest/messages/checkpoints/CheckpointStatistics.java ## @@ -229,8 +242,8 @@ public static CheckpointStatistics generateCheckpointStatistics(AbstractCheckpoi checkpointStatisticsPerTask.put( taskStateStat.getJobVertexId(), new TaskCheckpointStatistics( - checkpointStats.getCheckpointId(), - checkpointStats.getStatus(), + checkpointStats.getCheckpointId(), + checkpointStats.getStatus(), Review comment: reverted back. This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. For queries about this service, please contact Infrastructure at: us...@infra.apache.org
[jira] [Updated] (FLINK-19468) Metrics return empty when data stream / operator name contains "+"
[ https://issues.apache.org/jira/browse/FLINK-19468?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Boyang Jerry Peng updated FLINK-19468: -- Attachment: StreamingJob.java > Metrics return empty when data stream / operator name contains "+" > -- > > Key: FLINK-19468 > URL: https://issues.apache.org/jira/browse/FLINK-19468 > Project: Flink > Issue Type: Bug > Components: API / DataStream >Affects Versions: 1.9.0, 1.9.2, 1.9.3, 2.0.0 >Reporter: Boyang Jerry Peng >Assignee: Boyang Jerry Peng >Priority: Major > Labels: pull-request-available > Attachments: StreamingJob.java > > > If I submit a Flink job in which the stream name has a "+" character, the > metrics returned for the stream is always empty. Consider the following > example: > ``` > env.addSource(new TestSource()).name("testing + plus"); > ``` > > If I try to get the metrics > ``` > $ curl > [http://localhost:8081/jobs/5d4f28a221b2e5762f8404398a462eb0/vertices/cbc357ccb763df2852fee8c4fc7d55f2/metrics?get=0.Source__testing_+_plus.numRecordsOut] > [] > ``` > The metrics will always return empty. However if I remove the "+" from the > name of the stream. Metrics are returned non-empty. > Maybe it has something to do with this method: > [https://github.com/apache/flink/blob/master/flink-runtime/src/main/java/org/apache/flink/runtime/metrics/dump/MetricQueryService.java#L208] > It does not consider "+" character? -- This message was sent by Atlassian Jira (v8.3.4#803005)
[jira] [Updated] (FLINK-19468) Metrics return empty when data stream / operator name contains "+"
[ https://issues.apache.org/jira/browse/FLINK-19468?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Boyang Jerry Peng updated FLINK-19468: -- Description: If I submit a Flink job in which the stream name has a "+" character, the metrics returned for the stream is always empty. Consider the following example: ``` env.addSource(new TestSource()).name("testing + plus"); ``` If I try to get the metrics ``` $ curl [http://localhost:8081/jobs/5d4f28a221b2e5762f8404398a462eb0/vertices/cbc357ccb763df2852fee8c4fc7d55f2/metrics?get=0.Source__testing_+_plus.numRecordsOut] [] ``` The metrics will always return empty. However if I remove the "+" from the name of the stream. Metrics are returned non-empty. Maybe it has something to do with this method: [https://github.com/apache/flink/blob/master/flink-runtime/src/main/java/org/apache/flink/runtime/metrics/dump/MetricQueryService.java#L208] It does not consider "+" character? I have attached the full example to reproduce the issue. was: If I submit a Flink job in which the stream name has a "+" character, the metrics returned for the stream is always empty. Consider the following example: ``` env.addSource(new TestSource()).name("testing + plus"); ``` If I try to get the metrics ``` $ curl [http://localhost:8081/jobs/5d4f28a221b2e5762f8404398a462eb0/vertices/cbc357ccb763df2852fee8c4fc7d55f2/metrics?get=0.Source__testing_+_plus.numRecordsOut] [] ``` The metrics will always return empty. However if I remove the "+" from the name of the stream. Metrics are returned non-empty. Maybe it has something to do with this method: [https://github.com/apache/flink/blob/master/flink-runtime/src/main/java/org/apache/flink/runtime/metrics/dump/MetricQueryService.java#L208] It does not consider "+" character? > Metrics return empty when data stream / operator name contains "+" > -- > > Key: FLINK-19468 > URL: https://issues.apache.org/jira/browse/FLINK-19468 > Project: Flink > Issue Type: Bug > Components: API / DataStream >Affects Versions: 1.9.0, 1.9.2, 1.9.3, 2.0.0 >Reporter: Boyang Jerry Peng >Assignee: Boyang Jerry Peng >Priority: Major > Labels: pull-request-available > Attachments: StreamingJob.java > > > If I submit a Flink job in which the stream name has a "+" character, the > metrics returned for the stream is always empty. Consider the following > example: > ``` > env.addSource(new TestSource()).name("testing + plus"); > ``` > > If I try to get the metrics > ``` > $ curl > [http://localhost:8081/jobs/5d4f28a221b2e5762f8404398a462eb0/vertices/cbc357ccb763df2852fee8c4fc7d55f2/metrics?get=0.Source__testing_+_plus.numRecordsOut] > [] > ``` > The metrics will always return empty. However if I remove the "+" from the > name of the stream. Metrics are returned non-empty. > Maybe it has something to do with this method: > [https://github.com/apache/flink/blob/master/flink-runtime/src/main/java/org/apache/flink/runtime/metrics/dump/MetricQueryService.java#L208] > It does not consider "+" character? > > I have attached the full example to reproduce the issue. -- This message was sent by Atlassian Jira (v8.3.4#803005)
[jira] [Updated] (FLINK-19468) Metrics return empty when data stream / operator name contains "+"
[ https://issues.apache.org/jira/browse/FLINK-19468?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Boyang Jerry Peng updated FLINK-19468: -- Attachment: (was: StreamingJob.java) > Metrics return empty when data stream / operator name contains "+" > -- > > Key: FLINK-19468 > URL: https://issues.apache.org/jira/browse/FLINK-19468 > Project: Flink > Issue Type: Bug > Components: API / DataStream >Affects Versions: 1.9.0, 1.9.2, 1.9.3, 2.0.0 >Reporter: Boyang Jerry Peng >Assignee: Boyang Jerry Peng >Priority: Major > Labels: pull-request-available > > If I submit a Flink job in which the stream name has a "+" character, the > metrics returned for the stream is always empty. Consider the following > example: > ``` > env.addSource(new TestSource()).name("testing + plus"); > ``` > > If I try to get the metrics > ``` > $ curl > [http://localhost:8081/jobs/5d4f28a221b2e5762f8404398a462eb0/vertices/cbc357ccb763df2852fee8c4fc7d55f2/metrics?get=0.Source__testing_+_plus.numRecordsOut] > [] > ``` > The metrics will always return empty. However if I remove the "+" from the > name of the stream. Metrics are returned non-empty. > Maybe it has something to do with this method: > [https://github.com/apache/flink/blob/master/flink-runtime/src/main/java/org/apache/flink/runtime/metrics/dump/MetricQueryService.java#L208] > It does not consider "+" character? -- This message was sent by Atlassian Jira (v8.3.4#803005)
[jira] [Updated] (FLINK-19468) Metrics return empty when data stream / operator name contains "+"
[ https://issues.apache.org/jira/browse/FLINK-19468?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Boyang Jerry Peng updated FLINK-19468: -- Attachment: StreamingJob.java > Metrics return empty when data stream / operator name contains "+" > -- > > Key: FLINK-19468 > URL: https://issues.apache.org/jira/browse/FLINK-19468 > Project: Flink > Issue Type: Bug > Components: API / DataStream >Affects Versions: 1.9.0, 1.9.2, 1.9.3, 2.0.0 >Reporter: Boyang Jerry Peng >Assignee: Boyang Jerry Peng >Priority: Major > Labels: pull-request-available > Attachments: StreamingJob.java > > > If I submit a Flink job in which the stream name has a "+" character, the > metrics returned for the stream is always empty. Consider the following > example: > ``` > env.addSource(new TestSource()).name("testing + plus"); > ``` > > If I try to get the metrics > ``` > $ curl > [http://localhost:8081/jobs/5d4f28a221b2e5762f8404398a462eb0/vertices/cbc357ccb763df2852fee8c4fc7d55f2/metrics?get=0.Source__testing_+_plus.numRecordsOut] > [] > ``` > The metrics will always return empty. However if I remove the "+" from the > name of the stream. Metrics are returned non-empty. > Maybe it has something to do with this method: > [https://github.com/apache/flink/blob/master/flink-runtime/src/main/java/org/apache/flink/runtime/metrics/dump/MetricQueryService.java#L208] > It does not consider "+" character? -- This message was sent by Atlassian Jira (v8.3.4#803005)
[jira] [Comment Edited] (FLINK-19401) Job stuck in restart loop due to excessive checkpoint recoveries which block the JobMaster
[ https://issues.apache.org/jira/browse/FLINK-19401?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17208437#comment-17208437 ] Steven Zhen Wu edited comment on FLINK-19401 at 10/6/20, 12:42 AM: --- [~roman_khachatryan] I don't know if repeated checkpoint recovery is the root/main cause or not. [~trohrmann] identified two problems during his investigation. This is one of the identified problem. Regarding the logs related to Titus, please ignore them. They are just noise. We haven't cleaned up our logs yet. was (Author: stevenz3wu): [~roman_khachatryan] I don't know if repeated checkpoint recovery is the root/main cause or not. [~trohrmann] identified two problems during his investigation. This is one of the identified problem. Regarding the logs related to Titusm please ignore them. They are just noise. We haven't cleaned up our logs yet. > Job stuck in restart loop due to excessive checkpoint recoveries which block > the JobMaster > -- > > Key: FLINK-19401 > URL: https://issues.apache.org/jira/browse/FLINK-19401 > Project: Flink > Issue Type: Bug > Components: Runtime / Checkpointing >Affects Versions: 1.10.1, 1.11.2 >Reporter: Steven Zhen Wu >Priority: Critical > Fix For: 1.12.0, 1.10.3, 1.11.3 > > > Flink job sometimes got into a restart loop for many hours and can't recover > until redeployed. We had some issue with Kafka that initially caused the job > to restart. > Below is the first of the many exceptions for "ResourceManagerException: > Could not find registered job manager" error. > {code} > 2020-09-19 00:03:31,614 INFO > org.apache.flink.runtime.jobmaster.slotpool.SlotPoolImpl > [flink-akka.actor.default-dispatcher-35973] - Requesting new slot > [SlotRequestId{171f1df017dab3a42c032abd07908b9b}] and profile ResourceP > rofile{UNKNOWN} from resource manager. > 2020-09-19 00:03:31,615 INFO > org.apache.flink.runtime.jobmaster.slotpool.SlotPoolImpl > [flink-akka.actor.default-dispatcher-35973] - Requesting new slot > [SlotRequestId{cc7d136c4ce1f32285edd4928e3ab2e2}] and profile > ResourceProfile{UNKNOWN} from resource manager. > 2020-09-19 00:03:31,615 INFO > org.apache.flink.runtime.jobmaster.slotpool.SlotPoolImpl > [flink-akka.actor.default-dispatcher-35973] - Requesting new slot > [SlotRequestId{024c8a48dafaf8f07c49dd4320d5cc94}] and profile > ResourceProfile{UNKNOWN} from resource manager. > 2020-09-19 00:03:31,615 INFO > org.apache.flink.runtime.jobmaster.slotpool.SlotPoolImpl > [flink-akka.actor.default-dispatcher-35973] - Requesting new slot > [SlotRequestId{a591eda805b3081ad2767f5641d0db06}] and profile > ResourceProfile{UNKNOWN} from resource manager. > 2020-09-19 00:03:31,620 INFO > org.apache.flink.runtime.executiongraph.ExecutionGraph > [flink-akka.actor.default-dispatcher-35973] - Source: k2-csevpc -> > k2-csevpcRaw -> (vhsPlaybackEvents -> Flat Map, merchImpressionsClientLog -> > Flat Map) (56/640) (1b0d3dd1f19890886ff373a3f08809e8) switched from SCHEDULED > to FAILED. > java.util.concurrent.CompletionException: > java.util.concurrent.CompletionException: > org.apache.flink.runtime.jobmanager.scheduler.NoResourceAvailableException: > No pooled slot available and request to ResourceManager for new slot failed > at > org.apache.flink.runtime.jobmaster.slotpool.SlotSharingManager$MultiTaskSlot.lambda$new$0(SlotSharingManager.java:433) > at > java.util.concurrent.CompletableFuture.uniHandle(CompletableFuture.java:836) > at > java.util.concurrent.CompletableFuture$UniHandle.tryFire(CompletableFuture.java:811) > at > java.util.concurrent.CompletableFuture.postComplete(CompletableFuture.java:488) > at > java.util.concurrent.CompletableFuture.completeExceptionally(CompletableFuture.java:1990) > at > org.apache.flink.runtime.concurrent.FutureUtils.lambda$forward$21(FutureUtils.java:1065) > at > java.util.concurrent.CompletableFuture.uniWhenComplete(CompletableFuture.java:774) > at > java.util.concurrent.CompletableFuture.uniWhenCompleteStage(CompletableFuture.java:792) > at > java.util.concurrent.CompletableFuture.whenComplete(CompletableFuture.java:2153) > at > org.apache.flink.runtime.concurrent.FutureUtils.forward(FutureUtils.java:1063) > at > org.apache.flink.runtime.jobmaster.slotpool.SlotSharingManager.createRootSlot(SlotSharingManager.java:155) > at > org.apache.flink.runtime.jobmaster.slotpool.SchedulerImpl.allocateMultiTaskSlot(SchedulerImpl.java:511) > at > org.apache.flink.runtime.jobmaster.slotpool.SchedulerImpl.allocateSharedSlot(SchedulerImpl.java:311) > at >
[jira] [Commented] (FLINK-19401) Job stuck in restart loop due to excessive checkpoint recoveries which block the JobMaster
[ https://issues.apache.org/jira/browse/FLINK-19401?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17208437#comment-17208437 ] Steven Zhen Wu commented on FLINK-19401: [~roman_khachatryan] I don't know if repeated checkpoint recovery is the root/main cause or not. [~trohrmann] identified two problems during his investigation. This is one of the identified problem. Regarding the logs related to Titusm please ignore them. They are just noise. We haven't cleaned up our logs yet. > Job stuck in restart loop due to excessive checkpoint recoveries which block > the JobMaster > -- > > Key: FLINK-19401 > URL: https://issues.apache.org/jira/browse/FLINK-19401 > Project: Flink > Issue Type: Bug > Components: Runtime / Checkpointing >Affects Versions: 1.10.1, 1.11.2 >Reporter: Steven Zhen Wu >Priority: Critical > Fix For: 1.12.0, 1.10.3, 1.11.3 > > > Flink job sometimes got into a restart loop for many hours and can't recover > until redeployed. We had some issue with Kafka that initially caused the job > to restart. > Below is the first of the many exceptions for "ResourceManagerException: > Could not find registered job manager" error. > {code} > 2020-09-19 00:03:31,614 INFO > org.apache.flink.runtime.jobmaster.slotpool.SlotPoolImpl > [flink-akka.actor.default-dispatcher-35973] - Requesting new slot > [SlotRequestId{171f1df017dab3a42c032abd07908b9b}] and profile ResourceP > rofile{UNKNOWN} from resource manager. > 2020-09-19 00:03:31,615 INFO > org.apache.flink.runtime.jobmaster.slotpool.SlotPoolImpl > [flink-akka.actor.default-dispatcher-35973] - Requesting new slot > [SlotRequestId{cc7d136c4ce1f32285edd4928e3ab2e2}] and profile > ResourceProfile{UNKNOWN} from resource manager. > 2020-09-19 00:03:31,615 INFO > org.apache.flink.runtime.jobmaster.slotpool.SlotPoolImpl > [flink-akka.actor.default-dispatcher-35973] - Requesting new slot > [SlotRequestId{024c8a48dafaf8f07c49dd4320d5cc94}] and profile > ResourceProfile{UNKNOWN} from resource manager. > 2020-09-19 00:03:31,615 INFO > org.apache.flink.runtime.jobmaster.slotpool.SlotPoolImpl > [flink-akka.actor.default-dispatcher-35973] - Requesting new slot > [SlotRequestId{a591eda805b3081ad2767f5641d0db06}] and profile > ResourceProfile{UNKNOWN} from resource manager. > 2020-09-19 00:03:31,620 INFO > org.apache.flink.runtime.executiongraph.ExecutionGraph > [flink-akka.actor.default-dispatcher-35973] - Source: k2-csevpc -> > k2-csevpcRaw -> (vhsPlaybackEvents -> Flat Map, merchImpressionsClientLog -> > Flat Map) (56/640) (1b0d3dd1f19890886ff373a3f08809e8) switched from SCHEDULED > to FAILED. > java.util.concurrent.CompletionException: > java.util.concurrent.CompletionException: > org.apache.flink.runtime.jobmanager.scheduler.NoResourceAvailableException: > No pooled slot available and request to ResourceManager for new slot failed > at > org.apache.flink.runtime.jobmaster.slotpool.SlotSharingManager$MultiTaskSlot.lambda$new$0(SlotSharingManager.java:433) > at > java.util.concurrent.CompletableFuture.uniHandle(CompletableFuture.java:836) > at > java.util.concurrent.CompletableFuture$UniHandle.tryFire(CompletableFuture.java:811) > at > java.util.concurrent.CompletableFuture.postComplete(CompletableFuture.java:488) > at > java.util.concurrent.CompletableFuture.completeExceptionally(CompletableFuture.java:1990) > at > org.apache.flink.runtime.concurrent.FutureUtils.lambda$forward$21(FutureUtils.java:1065) > at > java.util.concurrent.CompletableFuture.uniWhenComplete(CompletableFuture.java:774) > at > java.util.concurrent.CompletableFuture.uniWhenCompleteStage(CompletableFuture.java:792) > at > java.util.concurrent.CompletableFuture.whenComplete(CompletableFuture.java:2153) > at > org.apache.flink.runtime.concurrent.FutureUtils.forward(FutureUtils.java:1063) > at > org.apache.flink.runtime.jobmaster.slotpool.SlotSharingManager.createRootSlot(SlotSharingManager.java:155) > at > org.apache.flink.runtime.jobmaster.slotpool.SchedulerImpl.allocateMultiTaskSlot(SchedulerImpl.java:511) > at > org.apache.flink.runtime.jobmaster.slotpool.SchedulerImpl.allocateSharedSlot(SchedulerImpl.java:311) > at > org.apache.flink.runtime.jobmaster.slotpool.SchedulerImpl.internalAllocateSlot(SchedulerImpl.java:160) > at > org.apache.flink.runtime.jobmaster.slotpool.SchedulerImpl.allocateSlotInternal(SchedulerImpl.java:143) > at > org.apache.flink.runtime.jobmaster.slotpool.SchedulerImpl.allocateSlot(SchedulerImpl.java:113) > at >
[jira] [Updated] (FLINK-19468) Metrics return empty when data stream / operator name contains "+"
[ https://issues.apache.org/jira/browse/FLINK-19468?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Boyang Jerry Peng updated FLINK-19468: -- Description: If I submit a Flink job in which the stream name has a "+" character, the metrics returned for the stream is always empty. Consider the following example: ``` env.addSource(new TestSource()).name("testing + plus"); ``` If I try to get the metrics ``` $ curl [http://localhost:8081/jobs/5d4f28a221b2e5762f8404398a462eb0/vertices/cbc357ccb763df2852fee8c4fc7d55f2/metrics?get=0.Source__testing_+_plus.numRecordsOut] [] ``` The metrics will always return empty. However if I remove the "+" from the name of the stream. Metrics are returned non-empty. Maybe it has something to do with this method: [https://github.com/apache/flink/blob/master/flink-runtime/src/main/java/org/apache/flink/runtime/metrics/dump/MetricQueryService.java#L208] It does not consider "+" character? was: If I submit a Flink job in which the stream name has a "+" character, the metrics returned for the stream is always empty. Consider the following example: ``` env.addSource(new TestSource()).name("testing + plus"); ``` If I try to get the metrics ``` $ curl http://localhost:8081/jobs/5d4f28a221b2e5762f8404398a462eb0/vertices/cbc357ccb763df2852fee8c4fc7d55f2/metrics?get=0.Source__testing_+_plus.numRecordsOut [] ``` The metrics will always return empty. However if I remove the "+" from the name of the stream. Metrics are returned non-empty > Metrics return empty when data stream / operator name contains "+" > -- > > Key: FLINK-19468 > URL: https://issues.apache.org/jira/browse/FLINK-19468 > Project: Flink > Issue Type: Bug > Components: API / DataStream >Affects Versions: 1.9.0, 1.9.2, 1.9.3, 2.0.0 >Reporter: Boyang Jerry Peng >Assignee: Boyang Jerry Peng >Priority: Major > Labels: pull-request-available > > If I submit a Flink job in which the stream name has a "+" character, the > metrics returned for the stream is always empty. Consider the following > example: > ``` > env.addSource(new TestSource()).name("testing + plus"); > ``` > > If I try to get the metrics > ``` > $ curl > [http://localhost:8081/jobs/5d4f28a221b2e5762f8404398a462eb0/vertices/cbc357ccb763df2852fee8c4fc7d55f2/metrics?get=0.Source__testing_+_plus.numRecordsOut] > [] > ``` > The metrics will always return empty. However if I remove the "+" from the > name of the stream. Metrics are returned non-empty. > Maybe it has something to do with this method: > [https://github.com/apache/flink/blob/master/flink-runtime/src/main/java/org/apache/flink/runtime/metrics/dump/MetricQueryService.java#L208] > It does not consider "+" character? -- This message was sent by Atlassian Jira (v8.3.4#803005)
[jira] [Updated] (FLINK-19468) Metrics return empty when data stream / operator name contains "+"
[ https://issues.apache.org/jira/browse/FLINK-19468?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Boyang Jerry Peng updated FLINK-19468: -- Description: If I submit a Flink job in which the stream name has a "+" character, the metrics returned for the stream is always empty. Consider the following example: ``` env.addSource(new TestSource()).name("testing + plus"); ``` If I try to get the metrics ``` $ curl http://localhost:8081/jobs/5d4f28a221b2e5762f8404398a462eb0/vertices/cbc357ccb763df2852fee8c4fc7d55f2/metrics?get=0.Source__testing_+_plus.numRecordsOut [] ``` The metrics will always return empty. However if I remove the "+" from the name of the stream. Metrics are returned non-empty was: If I submit a Flink job in which the stream name has a "+" character, the metrics returned for the stream is always empty. Consider the following example: ``` env.addSource(new TestSource()).name("testing + plus"); ``` For example if the operator name is: pulsar(url: pulsar+ssl://192.168.1.198:56014) Metrics for an operator with the above name will always return empty. > Metrics return empty when data stream / operator name contains "+" > -- > > Key: FLINK-19468 > URL: https://issues.apache.org/jira/browse/FLINK-19468 > Project: Flink > Issue Type: Bug > Components: API / DataStream >Affects Versions: 1.9.0, 1.9.2, 1.9.3, 2.0.0 >Reporter: Boyang Jerry Peng >Assignee: Boyang Jerry Peng >Priority: Major > Labels: pull-request-available > > If I submit a Flink job in which the stream name has a "+" character, the > metrics returned for the stream is always empty. Consider the following > example: > ``` > env.addSource(new TestSource()).name("testing + plus"); > ``` > > If I try to get the metrics > ``` > $ curl > http://localhost:8081/jobs/5d4f28a221b2e5762f8404398a462eb0/vertices/cbc357ccb763df2852fee8c4fc7d55f2/metrics?get=0.Source__testing_+_plus.numRecordsOut > [] > ``` > The metrics will always return empty. However if I remove the "+" from the > name of the stream. Metrics are returned non-empty -- This message was sent by Atlassian Jira (v8.3.4#803005)
[jira] [Updated] (FLINK-19468) Metrics return empty when data stream / operator name contains "+"
[ https://issues.apache.org/jira/browse/FLINK-19468?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Boyang Jerry Peng updated FLINK-19468: -- Description: If I submit a Flink job in which the stream name has a "+" character, the metrics returned for the stream is always empty. Consider the following example: ``` env.addSource(new TestSource()).name("testing + plus"); ``` For example if the operator name is: pulsar(url: pulsar+ssl://192.168.1.198:56014) Metrics for an operator with the above name will always return empty. was: If I submit a Flink job in which the stream name has a "+" character, the metrics returned for the stream is always empty. Consider the following example: ` env.addSource(new TestSource()).name("testing + plus"); ` For example if the operator name is: pulsar(url: pulsar+ssl://192.168.1.198:56014) Metrics for an operator with the above name will always return empty. > Metrics return empty when data stream / operator name contains "+" > -- > > Key: FLINK-19468 > URL: https://issues.apache.org/jira/browse/FLINK-19468 > Project: Flink > Issue Type: Bug > Components: API / DataStream >Affects Versions: 1.9.0, 1.9.2, 1.9.3, 2.0.0 >Reporter: Boyang Jerry Peng >Assignee: Boyang Jerry Peng >Priority: Major > Labels: pull-request-available > > If I submit a Flink job in which the stream name has a "+" character, the > metrics returned for the stream is always empty. Consider the following > example: > ``` > env.addSource(new TestSource()).name("testing + plus"); > ``` > > For example if the operator name is: > pulsar(url: pulsar+ssl://192.168.1.198:56014) > Metrics for an operator with the above name will always return empty. -- This message was sent by Atlassian Jira (v8.3.4#803005)
[jira] [Updated] (FLINK-19468) Metrics return empty when data stream / operator name contains "+"
[ https://issues.apache.org/jira/browse/FLINK-19468?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Boyang Jerry Peng updated FLINK-19468: -- Description: If I submit a Flink job in which the stream name has a "+" character, the metrics returned for the stream is always empty. Consider the following example: ` env.addSource(new TestSource()).name("testing + plus"); ` For example if the operator name is: pulsar(url: pulsar+ssl://192.168.1.198:56014) Metrics for an operator with the above name will always return empty. was: There is an issue in which the special character "+" is not removed from the data stream / operator name which causes metrics for the operator to not be properly returned. Code Reference: [https://github.com/apache/flink/blob/master/flink-runtime/src/main/java/org/apache/flink/runtime/metrics/dump/MetricQueryService.java#L208] For example if the operator name is: pulsar(url: pulsar+ssl://192.168.1.198:56014) Metrics for an operator with the above name will always return empty. > Metrics return empty when data stream / operator name contains "+" > -- > > Key: FLINK-19468 > URL: https://issues.apache.org/jira/browse/FLINK-19468 > Project: Flink > Issue Type: Bug > Components: API / DataStream >Affects Versions: 1.9.0, 1.9.2, 1.9.3, 2.0.0 >Reporter: Boyang Jerry Peng >Assignee: Boyang Jerry Peng >Priority: Major > Labels: pull-request-available > > If I submit a Flink job in which the stream name has a "+" character, the > metrics returned for the stream is always empty. Consider the following > example: > ` > env.addSource(new TestSource()).name("testing + plus"); > ` > > For example if the operator name is: > pulsar(url: pulsar+ssl://192.168.1.198:56014) > Metrics for an operator with the above name will always return empty. -- This message was sent by Atlassian Jira (v8.3.4#803005)
[GitHub] [flink] flinkbot edited a comment on pull request #13351: [FLINK-18990][task] Read channel state sequentially
flinkbot edited a comment on pull request #13351: URL: https://github.com/apache/flink/pull/13351#issuecomment-688741492 ## CI report: * 3e38dfe739fa34ed301dc2c0303086568f138d57 Azure: [FAILURE](https://dev.azure.com/apache-flink/98463496-1af2-4620-8eab-a2ecc1a2e6fe/_build/results?buildId=7213) Bot commands The @flinkbot bot supports the following commands: - `@flinkbot run travis` re-run the last Travis build - `@flinkbot run azure` re-run the last Azure build This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. For queries about this service, please contact Infrastructure at: us...@infra.apache.org
[jira] [Updated] (FLINK-3655) Allow comma-separated or multiple directories to be specified for FileInputFormat
[ https://issues.apache.org/jira/browse/FLINK-3655?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated FLINK-3655: -- Labels: pull-request-available starter (was: starter) > Allow comma-separated or multiple directories to be specified for > FileInputFormat > - > > Key: FLINK-3655 > URL: https://issues.apache.org/jira/browse/FLINK-3655 > Project: Flink > Issue Type: Improvement >Affects Versions: 1.0.0 >Reporter: Gna Phetsarath >Assignee: Fabian Hueske >Priority: Major > Labels: pull-request-available, starter > Fix For: 1.5.0 > > > Allow comma-separated or multiple directories to be specified for > FileInputFormat so that a DataSource will process the directories > sequentially. > > env.readFile("/data/2016/01/01/*/*,/data/2016/01/02/*/*,/data/2016/01/03/*/*") > in Scala >env.readFile(paths: Seq[String]) > or > env.readFile(path: String, otherPaths: String*) > Wildcard support would be a bonus. -- This message was sent by Atlassian Jira (v8.3.4#803005)
[GitHub] [flink] Jason-liujc commented on pull request #5415: [FLINK-3655] [core] Support multiple paths in FileInputFormat
Jason-liujc commented on pull request #5415: URL: https://github.com/apache/flink/pull/5415#issuecomment-703941810 Question, how do one use this with StreamExecutionEnvironment? It seems in the readFile call, the file path just gets overwritten here: https://github.com/apache/flink/blob/6c7b195d57c3bad5bc1f2251de75ac744dbbe4a7/flink-streaming-java/src/main/java/org/apache/flink/streaming/api/environment/StreamExecutionEnvironment.java#L1322 This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. For queries about this service, please contact Infrastructure at: us...@infra.apache.org
[jira] [Commented] (FLINK-15578) Implement exactly-once JDBC sink
[ https://issues.apache.org/jira/browse/FLINK-15578?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17208377#comment-17208377 ] Roman Khachatryan commented on FLINK-15578: --- Hello [~klden], I think you can extend [GenericWriteAheadSink|https://ci.apache.org/projects/flink/flink-docs-release-1.11/api/java/index.html?org/apache/flink/streaming/runtime/operators/GenericWriteAheadSink.html]. It's currently used by Cassandra sinks. Probably [~chesnay] knows more about it. > Implement exactly-once JDBC sink > > > Key: FLINK-15578 > URL: https://issues.apache.org/jira/browse/FLINK-15578 > Project: Flink > Issue Type: New Feature > Components: Connectors / JDBC >Reporter: Roman Khachatryan >Priority: Major > Labels: pull-request-available > Fix For: 1.12.0 > > Time Spent: 10m > Remaining Estimate: 0h > > As per discussion in the dev mailing list, there are two options: > # Write-ahead log > # Two-phase commit (XA) > the latter being preferable. > -- This message was sent by Atlassian Jira (v8.3.4#803005)
[GitHub] [flink] flinkbot edited a comment on pull request #13536: [FLINK-19497] [metrics] implement mutator methods for FlinkCounterWrapper
flinkbot edited a comment on pull request #13536: URL: https://github.com/apache/flink/pull/13536#issuecomment-702954233 ## CI report: * 409f484f4df45e52ae812a9a66757b8cdcf18e2a Azure: [SUCCESS](https://dev.azure.com/apache-flink/98463496-1af2-4620-8eab-a2ecc1a2e6fe/_build/results?buildId=7212) Bot commands The @flinkbot bot supports the following commands: - `@flinkbot run travis` re-run the last Travis build - `@flinkbot run azure` re-run the last Azure build This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. For queries about this service, please contact Infrastructure at: us...@infra.apache.org
[jira] [Comment Edited] (FLINK-19401) Job stuck in restart loop due to excessive checkpoint recoveries which block the JobMaster
[ https://issues.apache.org/jira/browse/FLINK-19401?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17208335#comment-17208335 ] Roman Khachatryan edited comment on FLINK-19401 at 10/5/20, 9:42 PM: - I do see that the same data is downloaded over and over, but I'm not sure whether it's the root cause. From the logs I see that heartbeats *on JM* from TMs are expired first, and heartbeats *on TMs* from JM expire two minutes later: {code:java} flink-19401 $ grep -ah -B1 'Heartbeat of Task' *.log | grep '^2020' | cut -c-16 | sort | uniq -c 66 2020-09-18 23:57 11 2020-09-19 01:06 flink-19401 $ grep -ah -B1 'heartbeat of Job' *.log | grep '^2020' | cut -c-16 | sort | uniq -c 509 2020-09-18 23:59 2 2020-09-19 00:03 2 2020-09-19 00:32 2 2020-09-19 00:37 2 2020-09-19 01:06 {code} (77 vs 517 is due to parallelism level I think) Furthermore, the first 8 recoveries start after failures in Kafka: NPE in NetworkClient and "SSLProtocolException: Handshake message sequence violation, 2". Then they happen after Kafka failures and the aforementioned TM heartbeat timeouts on JM. So it looks like that the root cause is misconfiguration and/or network. WDYT [~trohrmann], [~stevenz3wu] ? Steven, can you please also explain what Titus is doing here? (looks like it'is adding and then removing the task every minute - does it ring any bell?) {code:java} $ grep 72cc604b-dcd5-4c75-a466-0321d7c51c3e *.log 2020-09-18 23:55:43,804 - Titus task 72cc604b-dcd5-4c75-a466-0321d7c51c3e is being removed from TM termination pool after 60067 ms. 2020-09-18 23:56:11,256 - Deploying Source: ee_clevent_presentation -> ee_clevent_presentationRaw -> cleventPresentation -> Flat Map (510/640) (attempt #1) to 1273fbc6edbab45548807742b2db6c4e @ 72cc604b-dcd5-4c75-a466-0321d7c51c3e (dataPort=39833) 2020-09-18 23:56:11,256 - Deploying Source: k2-logtracevpc -> k2-logtracevpcRaw -> (Filter -> cybertronLogReplicated, Filter -> cybertronLogReplicatedCrossRegion, gpsRequestPivotAudit, gpsRequestPivotAuditCrossRegion) (510/640) (attempt #1) to 1273fbc6edbab45548807742b2db6c4e @ 72cc604b-dcd5-4c75-a466-0321d7c51c3e (dataPort=39833) 2020-09-18 23:56:11,261 - Deploying Source: k2-ee_clevent -> k2-ee_cleventRaw -> (cleventAddToPlaylistCommand -> Flat Map, cleventThumbRating -> Flat Map) (510/640) (attempt #1) to 1273fbc6edbab45548807742b2db6c4e @ 72cc604b-dcd5-4c75-a466-0321d7c51c3e (dataPort=39833) 2020-09-18 23:56:11,261 - Deploying Source: k2-defaultvpc -> k2-defaultvpcRaw -> (PssiPlaybackEvents -> Flat Map, Filter -> cybertronLogUnreplicated) (510/640) (attempt #1) to 1273fbc6edbab45548807742b2db6c4e @ 72cc604b-dcd5-4c75-a466-0321d7c51c3e (dataPort=39833) 2020-09-18 23:56:11,261 - Deploying Filter -> Process -> (Sink: cybertron_joiner_client_input_0, Sink: cybertron_joiner_client_input_1) (510/640) (attempt #1) to 1273fbc6edbab45548807742b2db6c4e @ 72cc604b-dcd5-4c75-a466-0321d7c51c3e (dataPort=39833) 2020-09-18 23:56:11,261 - Deploying Filter -> Process -> (Sink: cybertron_joiner_server_input_0, Sink: cybertron_joiner_server_input_1) (510/640) (attempt #1) to 1273fbc6edbab45548807742b2db6c4e @ 72cc604b-dcd5-4c75-a466-0321d7c51c3e (dataPort=39833) 2020-09-18 23:56:43,872 - Adding Titus task 72cc604b-dcd5-4c75-a466-0321d7c51c3e to TM termination pool because TM is not registered. 2020-09-18 23:57:43,940 - Titus task 72cc604b-dcd5-4c75-a466-0321d7c51c3e is being removed from TM termination pool after 60068 ms. 2020-09-18 23:58:44,008 - Adding Titus task 72cc604b-dcd5-4c75-a466-0321d7c51c3e to TM termination pool because TM is not registered. 2020-09-18 23:59:11,794 - Deploying Source: k2-defaultvpc -> k2-defaultvpcRaw -> (PssiPlaybackEvents -> Flat Map, Filter -> cybertronLogUnreplicated) (488/640) (attempt #3) to 1273fbc6edbab45548807742b2db6c4e @ 72cc604b-dcd5-4c75-a466-0321d7c51c3e (dataPort=39833) 2020-09-18 23:59:27,101 - Deploying Source: k2-defaultvpc -> k2-defaultvpcRaw -> (PssiPlaybackEvents -> Flat Map, Filter -> cybertronLogUnreplicated) (478/640) (attempt #2) to 1273fbc6edbab45548807742b2db6c4e @ 72cc604b-dcd5-4c75-a466-0321d7c51c3e (dataPort=39833) 2020-09-18 23:59:44,075 - Titus task 72cc604b-dcd5-4c75-a466-0321d7c51c3e is being removed from TM termination pool after 60066 ms. 2020-09-19 00:00:44,144 - Adding Titus task 72cc604b-dcd5-4c75-a466-0321d7c51c3e to TM termination pool because TM is not registered. 2020-09-19 00:01:44,634 - Titus task 72cc604b-dcd5-4c75-a466-0321d7c51c3e is being removed from TM termination pool after 60489 ms. 2020-09-19 00:02:44,710 - Adding Titus task 72cc604b-dcd5-4c75-a466-0321d7c51c3e to TM termination pool because TM is not registered. 2020-09-19 00:03:44,799 - Titus task 72cc604b-dcd5-4c75-a466-0321d7c51c3e is being removed from TM termination pool after 60088 ms. 2020-09-19 00:04:44,871 - Adding Titus task
[jira] [Comment Edited] (FLINK-19401) Job stuck in restart loop due to excessive checkpoint recoveries which block the JobMaster
[ https://issues.apache.org/jira/browse/FLINK-19401?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17208335#comment-17208335 ] Roman Khachatryan edited comment on FLINK-19401 at 10/5/20, 9:31 PM: - I do see that the same data is downloaded over and over, but I'm not sure whether it's the root cause. From the logs I see that heartbeat from *on JM* from TMs are expired first, and heartbeat *on TMs* expire two minutes later: {code:java} flink-19401 $ grep -ah -B1 'Heartbeat of Task' *.log | grep '^2020' | cut -c-16 | sort | uniq -c 66 2020-09-18 23:57 11 2020-09-19 01:06 flink-19401 $ grep -ah -B1 'heartbeat of Job' *.log | grep '^2020' | cut -c-16 | sort | uniq -c 509 2020-09-18 23:59 2 2020-09-19 00:03 2 2020-09-19 00:32 2 2020-09-19 00:37 2 2020-09-19 01:06 {code} (77 vs 517 is due to parallelism level I think) Furthermore, the first 8 recoveries start after failures in Kafka: NPE in NetworkClient and "SSLProtocolException: Handshake message sequence violation, 2". Then they happen after Kafka failures and the aforementioned TM heartbeat timeouts on JM. So it looks like that the root cause is misconfiguration and/or network. WDYT [~trohrmann], [~stevenz3wu] ? Steven, can you please also explain what Titus is doing here? (looks like it'is adding and then removing the task every minute - does it ring any bell?) {code:java} $ grep 72cc604b-dcd5-4c75-a466-0321d7c51c3e *.log 2020-09-18 23:55:43,804 - Titus task 72cc604b-dcd5-4c75-a466-0321d7c51c3e is being removed from TM termination pool after 60067 ms. 2020-09-18 23:56:11,256 - Deploying Source: ee_clevent_presentation -> ee_clevent_presentationRaw -> cleventPresentation -> Flat Map (510/640) (attempt #1) to 1273fbc6edbab45548807742b2db6c4e @ 72cc604b-dcd5-4c75-a466-0321d7c51c3e (dataPort=39833) 2020-09-18 23:56:11,256 - Deploying Source: k2-logtracevpc -> k2-logtracevpcRaw -> (Filter -> cybertronLogReplicated, Filter -> cybertronLogReplicatedCrossRegion, gpsRequestPivotAudit, gpsRequestPivotAuditCrossRegion) (510/640) (attempt #1) to 1273fbc6edbab45548807742b2db6c4e @ 72cc604b-dcd5-4c75-a466-0321d7c51c3e (dataPort=39833) 2020-09-18 23:56:11,261 - Deploying Source: k2-ee_clevent -> k2-ee_cleventRaw -> (cleventAddToPlaylistCommand -> Flat Map, cleventThumbRating -> Flat Map) (510/640) (attempt #1) to 1273fbc6edbab45548807742b2db6c4e @ 72cc604b-dcd5-4c75-a466-0321d7c51c3e (dataPort=39833) 2020-09-18 23:56:11,261 - Deploying Source: k2-defaultvpc -> k2-defaultvpcRaw -> (PssiPlaybackEvents -> Flat Map, Filter -> cybertronLogUnreplicated) (510/640) (attempt #1) to 1273fbc6edbab45548807742b2db6c4e @ 72cc604b-dcd5-4c75-a466-0321d7c51c3e (dataPort=39833) 2020-09-18 23:56:11,261 - Deploying Filter -> Process -> (Sink: cybertron_joiner_client_input_0, Sink: cybertron_joiner_client_input_1) (510/640) (attempt #1) to 1273fbc6edbab45548807742b2db6c4e @ 72cc604b-dcd5-4c75-a466-0321d7c51c3e (dataPort=39833) 2020-09-18 23:56:11,261 - Deploying Filter -> Process -> (Sink: cybertron_joiner_server_input_0, Sink: cybertron_joiner_server_input_1) (510/640) (attempt #1) to 1273fbc6edbab45548807742b2db6c4e @ 72cc604b-dcd5-4c75-a466-0321d7c51c3e (dataPort=39833) 2020-09-18 23:56:43,872 - Adding Titus task 72cc604b-dcd5-4c75-a466-0321d7c51c3e to TM termination pool because TM is not registered. 2020-09-18 23:57:43,940 - Titus task 72cc604b-dcd5-4c75-a466-0321d7c51c3e is being removed from TM termination pool after 60068 ms. 2020-09-18 23:58:44,008 - Adding Titus task 72cc604b-dcd5-4c75-a466-0321d7c51c3e to TM termination pool because TM is not registered. 2020-09-18 23:59:11,794 - Deploying Source: k2-defaultvpc -> k2-defaultvpcRaw -> (PssiPlaybackEvents -> Flat Map, Filter -> cybertronLogUnreplicated) (488/640) (attempt #3) to 1273fbc6edbab45548807742b2db6c4e @ 72cc604b-dcd5-4c75-a466-0321d7c51c3e (dataPort=39833) 2020-09-18 23:59:27,101 - Deploying Source: k2-defaultvpc -> k2-defaultvpcRaw -> (PssiPlaybackEvents -> Flat Map, Filter -> cybertronLogUnreplicated) (478/640) (attempt #2) to 1273fbc6edbab45548807742b2db6c4e @ 72cc604b-dcd5-4c75-a466-0321d7c51c3e (dataPort=39833) 2020-09-18 23:59:44,075 - Titus task 72cc604b-dcd5-4c75-a466-0321d7c51c3e is being removed from TM termination pool after 60066 ms. 2020-09-19 00:00:44,144 - Adding Titus task 72cc604b-dcd5-4c75-a466-0321d7c51c3e to TM termination pool because TM is not registered. 2020-09-19 00:01:44,634 - Titus task 72cc604b-dcd5-4c75-a466-0321d7c51c3e is being removed from TM termination pool after 60489 ms. 2020-09-19 00:02:44,710 - Adding Titus task 72cc604b-dcd5-4c75-a466-0321d7c51c3e to TM termination pool because TM is not registered. 2020-09-19 00:03:44,799 - Titus task 72cc604b-dcd5-4c75-a466-0321d7c51c3e is being removed from TM termination pool after 60088 ms. 2020-09-19 00:04:44,871 - Adding Titus task
[jira] [Commented] (FLINK-19401) Job stuck in restart loop due to excessive checkpoint recoveries which block the JobMaster
[ https://issues.apache.org/jira/browse/FLINK-19401?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17208335#comment-17208335 ] Roman Khachatryan commented on FLINK-19401: --- I do see that the same data is downloaded over and over, but I'm not sure whether it's the root cause. From the logs I see that heartbeat from *on JM* from TMs are expired first, and heartbeat *on TMs* expire two minutes later: {code:java} flink-19401 $ grep -ah -B1 'Heartbeat of Task' *.log | grep '^2020' | cut -c-16 | sort | uniq -c 66 2020-09-18 23:57 11 2020-09-19 01:06 flink-19401 $ grep -ah -B1 'heartbeat of Job' *.log | grep '^2020' | cut -c-16 | sort | uniq -c 509 2020-09-18 23:59 2 2020-09-19 00:03 2 2020-09-19 00:32 2 2020-09-19 00:37 2 2020-09-19 01:06 {code} (77 vs 517 is due to parallelism level I think) Furthermore, the first 8 recoveries start after failures in Kafka: NPE in NetworkClient and "SSLProtocolException: Handshake message sequence violation, 2". Then they happen after Kafka failures and the aforementioned TM heartbeat timeouts on JM. So it looks like that the root cause is misconfiguration and/or network. WDYT [~trohrmann], [~stevenz3wu] ? Steven, can you please also explain what does this mean: {code:java} $ grep 72cc604b-dcd5-4c75-a466-0321d7c51c3e 2020-09-18 23:55:43,804 Titus task 72cc604b-dcd5-4c75-a466-0321d7c51c3e is being removed from TM termination pool after 60067 ms. 2020-09-18 23:56:11,256 Deploying Source: ee_clevent_presentation -> ee_clevent_presentationRaw -> cleventPresentation -> Flat Map (510/640) (attempt #1) to 1273fbc6edbab45548807742b2db6c4e @ 72cc604b-dcd5-4c75-a466-0321d7c51c3e (dataPort=39833) 2020-09-18 23:56:11,256 Deploying Source: k2-logtracevpc -> k2-logtracevpcRaw -> (Filter -> cybertronLogReplicated, Filter -> cybertronLogReplicatedCrossRegion, gpsRequestPivotAudit, gpsRequestPivotAuditCrossRegion) (510/640) (attempt #1) to 1273fbc6edbab45548807742b2db6c4e @ 72cc604b-dcd5-4c75-a466-0321d7c51c3e (dataPort=39833) 2020-09-18 23:56:11,261 Deploying Source: k2-ee_clevent -> k2-ee_cleventRaw -> (cleventAddToPlaylistCommand -> Flat Map, cleventThumbRating -> Flat Map) (510/640) (attempt #1) to 1273fbc6edbab45548807742b2db6c4e @ 72cc604b-dcd5-4c75-a466-0321d7c51c3e (dataPort=39833) 2020-09-18 23:56:11,261 Deploying Source: k2-defaultvpc -> k2-defaultvpcRaw -> (PssiPlaybackEvents -> Flat Map, Filter -> cybertronLogUnreplicated) (510/640) (attempt #1) to 1273fbc6edbab45548807742b2db6c4e @ 72cc604b-dcd5-4c75-a466-0321d7c51c3e (dataPort=39833) 2020-09-18 23:56:11,261 Deploying Filter -> Process -> (Sink: cybertron_joiner_client_input_0, Sink: cybertron_joiner_client_input_1) (510/640) (attempt #1) to 1273fbc6edbab45548807742b2db6c4e @ 72cc604b-dcd5-4c75-a466-0321d7c51c3e (dataPort=39833) 2020-09-18 23:56:11,261 Deploying Filter -> Process -> (Sink: cybertron_joiner_server_input_0, Sink: cybertron_joiner_server_input_1) (510/640) (attempt #1) to 1273fbc6edbab45548807742b2db6c4e @ 72cc604b-dcd5-4c75-a466-0321d7c51c3e (dataPort=39833) 2020-09-18 23:56:43,872 Adding Titus task 72cc604b-dcd5-4c75-a466-0321d7c51c3e to TM termination pool because TM is not registered. 2020-09-18 23:57:43,940 Titus task 72cc604b-dcd5-4c75-a466-0321d7c51c3e is being removed from TM termination pool after 60068 ms. 2020-09-18 23:58:44,008 Adding Titus task 72cc604b-dcd5-4c75-a466-0321d7c51c3e to TM termination pool because TM is not registered. 2020-09-18 23:59:11,794 Deploying Source: k2-defaultvpc -> k2-defaultvpcRaw -> (PssiPlaybackEvents -> Flat Map, Filter -> cybertronLogUnreplicated) (488/640) (attempt #3) to 1273fbc6edbab45548807742b2db6c4e @ 72cc604b-dcd5-4c75-a466-0321d7c51c3e (dataPort=39833) 2020-09-18 23:59:27,101 Deploying Source: k2-defaultvpc -> k2-defaultvpcRaw -> (PssiPlaybackEvents -> Flat Map, Filter -> cybertronLogUnreplicated) (478/640) (attempt #2) to 1273fbc6edbab45548807742b2db6c4e @ 72cc604b-dcd5-4c75-a466-0321d7c51c3e (dataPort=39833) 2020-09-18 23:59:44,075 Titus task 72cc604b-dcd5-4c75-a466-0321d7c51c3e is being removed from TM termination pool after 60066 ms. 2020-09-19 00:00:44,144 Adding Titus task 72cc604b-dcd5-4c75-a466-0321d7c51c3e to TM termination pool because TM is not registered. 2020-09-19 00:01:44,634 Titus task 72cc604b-dcd5-4c75-a466-0321d7c51c3e is being removed from TM termination pool after 60489 ms. 2020-09-19 00:02:44,710 Adding Titus task 72cc604b-dcd5-4c75-a466-0321d7c51c3e to TM termination pool because TM is not registered. 2020-09-19 00:03:44,799 Titus task 72cc604b-dcd5-4c75-a466-0321d7c51c3e is being removed from TM termination pool after 60088 ms. 2020-09-19 00:04:44,871 Adding Titus task 72cc604b-dcd5-4c75-a466-0321d7c51c3e to TM termination pool because TM is not registered. 2020-09-19 00:05:44,938 Titus task 72cc604b-dcd5-4c75-a466-0321d7c51c3e is being removed from TM termination pool
[GitHub] [flink] flinkbot edited a comment on pull request #13351: [FLINK-18990][task] Read channel state sequentially
flinkbot edited a comment on pull request #13351: URL: https://github.com/apache/flink/pull/13351#issuecomment-688741492 ## CI report: * e8c03361de3a583a8da862bcf3839de214c5cb83 Azure: [SUCCESS](https://dev.azure.com/apache-flink/98463496-1af2-4620-8eab-a2ecc1a2e6fe/_build/results?buildId=7209) * 3e38dfe739fa34ed301dc2c0303086568f138d57 Azure: [PENDING](https://dev.azure.com/apache-flink/98463496-1af2-4620-8eab-a2ecc1a2e6fe/_build/results?buildId=7213) Bot commands The @flinkbot bot supports the following commands: - `@flinkbot run travis` re-run the last Travis build - `@flinkbot run azure` re-run the last Azure build This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. For queries about this service, please contact Infrastructure at: us...@infra.apache.org
[GitHub] [flink] flinkbot edited a comment on pull request #13521: [FLINK-19472] Implement a one input sorting DataInput
flinkbot edited a comment on pull request #13521: URL: https://github.com/apache/flink/pull/13521#issuecomment-701282459 ## CI report: * c5164bceb05c9e11c652fdb2a0cefe289af0ec1b UNKNOWN * 6202374f19277f6ed8a7d4e6ec2ab0985e0f054a Azure: [FAILURE](https://dev.azure.com/apache-flink/98463496-1af2-4620-8eab-a2ecc1a2e6fe/_build/results?buildId=7206) Bot commands The @flinkbot bot supports the following commands: - `@flinkbot run travis` re-run the last Travis build - `@flinkbot run azure` re-run the last Azure build This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. For queries about this service, please contact Infrastructure at: us...@infra.apache.org
[GitHub] [flink] flinkbot edited a comment on pull request #13530: [FLINK-19123] Use shared MiniCluster for executeAsync() in TestStreamEnvironment
flinkbot edited a comment on pull request #13530: URL: https://github.com/apache/flink/pull/13530#issuecomment-702582807 ## CI report: * c0c3a012813e6bf76ee374073218ab647f1aa1be UNKNOWN * 47ca19a74e11c72842124852875262959477c459 Azure: [SUCCESS](https://dev.azure.com/apache-flink/98463496-1af2-4620-8eab-a2ecc1a2e6fe/_build/results?buildId=7207) Bot commands The @flinkbot bot supports the following commands: - `@flinkbot run travis` re-run the last Travis build - `@flinkbot run azure` re-run the last Azure build This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. For queries about this service, please contact Infrastructure at: us...@infra.apache.org
[GitHub] [flink] flinkbot edited a comment on pull request #13351: [FLINK-18990][task] Read channel state sequentially
flinkbot edited a comment on pull request #13351: URL: https://github.com/apache/flink/pull/13351#issuecomment-688741492 ## CI report: * e8c03361de3a583a8da862bcf3839de214c5cb83 Azure: [SUCCESS](https://dev.azure.com/apache-flink/98463496-1af2-4620-8eab-a2ecc1a2e6fe/_build/results?buildId=7209) * 3e38dfe739fa34ed301dc2c0303086568f138d57 UNKNOWN Bot commands The @flinkbot bot supports the following commands: - `@flinkbot run travis` re-run the last Travis build - `@flinkbot run azure` re-run the last Azure build This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. For queries about this service, please contact Infrastructure at: us...@infra.apache.org
[GitHub] [flink] rkhachatryan commented on a change in pull request #13351: [FLINK-18990][task] Read channel state sequentially
rkhachatryan commented on a change in pull request #13351: URL: https://github.com/apache/flink/pull/13351#discussion_r499823224 ## File path: flink-streaming-java/src/test/java/org/apache/flink/streaming/runtime/tasks/StreamTaskMultipleInputSelectiveReadingTest.java ## @@ -183,6 +183,9 @@ public void testInputStarvation() throws Exception { .setupOutputForSingletonOperatorChain(new TestInputStarvationMultipleInputOperatorFactory()) .build()) { + for (int i = 0; i < testHarness.inputGates.length; i++) { + testHarness.processSingleStep(); + } Review comment: Yes! :smile: This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. For queries about this service, please contact Infrastructure at: us...@infra.apache.org
[GitHub] [flink] flinkbot edited a comment on pull request #13539: [FLINK-19027][network] Assign exclusive buffers to LocalRecoveredInputChannel.
flinkbot edited a comment on pull request #13539: URL: https://github.com/apache/flink/pull/13539#issuecomment-703227980 ## CI report: * 693a3c828e7c7b8782ad978ff7fb63288ef2f4d9 UNKNOWN * 36f3c775a53ddae8951b3ab9bccc295de694c9de UNKNOWN * 52830dff69c113aeb1a53a1cbe3fd859c7863a60 Azure: [SUCCESS](https://dev.azure.com/apache-flink/98463496-1af2-4620-8eab-a2ecc1a2e6fe/_build/results?buildId=7208) Bot commands The @flinkbot bot supports the following commands: - `@flinkbot run travis` re-run the last Travis build - `@flinkbot run azure` re-run the last Azure build This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. For queries about this service, please contact Infrastructure at: us...@infra.apache.org
[GitHub] [flink] jgrier commented on pull request #13514: [FLINK-19468][DataStream API] Metrics return empty when data stream / operator name contains "+"
jgrier commented on pull request #13514: URL: https://github.com/apache/flink/pull/13514#issuecomment-703802523 @jerrypeng I'm not 100% this is the correct place to make this change. Let's add more detail to the JIRA about exactly what the issue is. I believe at this layer the '+' character should actually be just fine. What is the symptom exactly? This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. For queries about this service, please contact Infrastructure at: us...@infra.apache.org
[GitHub] [flink] flinkbot edited a comment on pull request #13536: [FLINK-19497] [metrics] implement mutator methods for FlinkCounterWrapper
flinkbot edited a comment on pull request #13536: URL: https://github.com/apache/flink/pull/13536#issuecomment-702954233 ## CI report: * 74e35eda48683e5879285ce57ec68cfd27bbdf00 Azure: [FAILURE](https://dev.azure.com/apache-flink/98463496-1af2-4620-8eab-a2ecc1a2e6fe/_build/results?buildId=7210) * 409f484f4df45e52ae812a9a66757b8cdcf18e2a Azure: [PENDING](https://dev.azure.com/apache-flink/98463496-1af2-4620-8eab-a2ecc1a2e6fe/_build/results?buildId=7212) Bot commands The @flinkbot bot supports the following commands: - `@flinkbot run travis` re-run the last Travis build - `@flinkbot run azure` re-run the last Azure build This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. For queries about this service, please contact Infrastructure at: us...@infra.apache.org
[GitHub] [flink] flinkbot edited a comment on pull request #13514: [FLINK-19468][DataStream API] Metrics return empty when data stream / operator name contains "+"
flinkbot edited a comment on pull request #13514: URL: https://github.com/apache/flink/pull/13514#issuecomment-701038173 ## CI report: * 8a72f73417ceb18536d282679e8ce5c60f634acb Azure: [FAILURE](https://dev.azure.com/apache-flink/98463496-1af2-4620-8eab-a2ecc1a2e6fe/_build/results?buildId=7100) Azure: [FAILURE](https://dev.azure.com/apache-flink/98463496-1af2-4620-8eab-a2ecc1a2e6fe/_build/results?buildId=7211) Bot commands The @flinkbot bot supports the following commands: - `@flinkbot run travis` re-run the last Travis build - `@flinkbot run azure` re-run the last Azure build This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. For queries about this service, please contact Infrastructure at: us...@infra.apache.org
[GitHub] [flink] jgrier removed a comment on pull request #13514: [FLINK-19468][DataStream API] Metrics return empty when data stream / operator name contains "+"
jgrier removed a comment on pull request #13514: URL: https://github.com/apache/flink/pull/13514#issuecomment-703774722 This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. For queries about this service, please contact Infrastructure at: us...@infra.apache.org
[GitHub] [flink] flinkbot edited a comment on pull request #13536: [FLINK-19497] [metrics] implement mutator methods for FlinkCounterWrapper
flinkbot edited a comment on pull request #13536: URL: https://github.com/apache/flink/pull/13536#issuecomment-702954233 ## CI report: * 74e35eda48683e5879285ce57ec68cfd27bbdf00 Azure: [FAILURE](https://dev.azure.com/apache-flink/98463496-1af2-4620-8eab-a2ecc1a2e6fe/_build/results?buildId=7210) * 409f484f4df45e52ae812a9a66757b8cdcf18e2a UNKNOWN Bot commands The @flinkbot bot supports the following commands: - `@flinkbot run travis` re-run the last Travis build - `@flinkbot run azure` re-run the last Azure build This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. For queries about this service, please contact Infrastructure at: us...@infra.apache.org
[GitHub] [flink] flinkbot edited a comment on pull request #13514: [FLINK-19468][DataStream API] Metrics return empty when data stream / operator name contains "+"
flinkbot edited a comment on pull request #13514: URL: https://github.com/apache/flink/pull/13514#issuecomment-701038173 ## CI report: * 8a72f73417ceb18536d282679e8ce5c60f634acb Azure: [FAILURE](https://dev.azure.com/apache-flink/98463496-1af2-4620-8eab-a2ecc1a2e6fe/_build/results?buildId=7100) Azure: [PENDING](https://dev.azure.com/apache-flink/98463496-1af2-4620-8eab-a2ecc1a2e6fe/_build/results?buildId=7211) Bot commands The @flinkbot bot supports the following commands: - `@flinkbot run travis` re-run the last Travis build - `@flinkbot run azure` re-run the last Azure build This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. For queries about this service, please contact Infrastructure at: us...@infra.apache.org
[GitHub] [flink] pnowojski commented on a change in pull request #13351: [FLINK-18990][task] Read channel state sequentially
pnowojski commented on a change in pull request #13351: URL: https://github.com/apache/flink/pull/13351#discussion_r499760494 ## File path: flink-streaming-java/src/test/java/org/apache/flink/streaming/runtime/tasks/StreamTaskMultipleInputSelectiveReadingTest.java ## @@ -197,22 +198,21 @@ public void testInputStarvation() throws Exception { testHarness.processElement(new StreamRecord<>("3"), 1); testHarness.processElement(new StreamRecord<>("4"), 1); - testHarness.processSingleStep(); expectedOutput.add(new StreamRecord<>("[2]: 1")); - testHarness.processSingleStep(); expectedOutput.add(new StreamRecord<>("[2]: 2")); - assertThat(testHarness.getOutput(), contains(expectedOutput.toArray())); + testHarness.processAll(); + assertEquals(expectedOutput, new ArrayList<>(testHarness.getOutput()).subList(0, expectedOutput.size())); Review comment: Hmmm, maybe replace it with more future proof condition? sth like: ``` boolean noElementFromInputGate3 = true; steps = 0; while (noElementFromInputGate3 && steps++ < 100 && testHarness.procesSingleStep()) { noElementFromInputGate3 = ... // check for presence of `new StreamRecord<>("[3]: 1"))` } ``` ? Or maybe https://github.com/apache/flink/pull/13351#discussion_r499762014 ? This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. For queries about this service, please contact Infrastructure at: us...@infra.apache.org
[GitHub] [flink] pnowojski commented on a change in pull request #13351: [FLINK-18990][task] Read channel state sequentially
pnowojski commented on a change in pull request #13351: URL: https://github.com/apache/flink/pull/13351#discussion_r499762014 ## File path: flink-streaming-java/src/test/java/org/apache/flink/streaming/runtime/tasks/StreamTaskMultipleInputSelectiveReadingTest.java ## @@ -183,6 +183,9 @@ public void testInputStarvation() throws Exception { .setupOutputForSingletonOperatorChain(new TestInputStarvationMultipleInputOperatorFactory()) .build()) { + for (int i = 0; i < testHarness.inputGates.length; i++) { + testHarness.processSingleStep(); + } Review comment: Would it work if replaced with `testHarness.processAll` ? This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. For queries about this service, please contact Infrastructure at: us...@infra.apache.org
[GitHub] [flink] jgrier commented on pull request #13514: [FLINK-19468][DataStream API] Metrics return empty when data stream / operator name contains "+"
jgrier commented on pull request #13514: URL: https://github.com/apache/flink/pull/13514#issuecomment-703778459 @flinkbot run azure This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. For queries about this service, please contact Infrastructure at: us...@infra.apache.org
[GitHub] [flink] pnowojski commented on a change in pull request #13351: [FLINK-18990][task] Read channel state sequentially
pnowojski commented on a change in pull request #13351: URL: https://github.com/apache/flink/pull/13351#discussion_r499760494 ## File path: flink-streaming-java/src/test/java/org/apache/flink/streaming/runtime/tasks/StreamTaskMultipleInputSelectiveReadingTest.java ## @@ -197,22 +198,21 @@ public void testInputStarvation() throws Exception { testHarness.processElement(new StreamRecord<>("3"), 1); testHarness.processElement(new StreamRecord<>("4"), 1); - testHarness.processSingleStep(); expectedOutput.add(new StreamRecord<>("[2]: 1")); - testHarness.processSingleStep(); expectedOutput.add(new StreamRecord<>("[2]: 2")); - assertThat(testHarness.getOutput(), contains(expectedOutput.toArray())); + testHarness.processAll(); + assertEquals(expectedOutput, new ArrayList<>(testHarness.getOutput()).subList(0, expectedOutput.size())); Review comment: Hmmm, maybe replace it with more future proof condition? sth like: ``` boolean noElementFromInputGate3 = true; steps = 0; while (noElementFromInputGate3 && steps++ < 100 && testHarness.procesSingleStep()) { noElementFromInputGate3 = ... // check for presence of `new StreamRecord<>("[3]: 1"))` } ``` ? This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. For queries about this service, please contact Infrastructure at: us...@infra.apache.org
[GitHub] [flink] pnowojski commented on a change in pull request #13351: [FLINK-18990][task] Read channel state sequentially
pnowojski commented on a change in pull request #13351: URL: https://github.com/apache/flink/pull/13351#discussion_r499760494 ## File path: flink-streaming-java/src/test/java/org/apache/flink/streaming/runtime/tasks/StreamTaskMultipleInputSelectiveReadingTest.java ## @@ -197,22 +198,21 @@ public void testInputStarvation() throws Exception { testHarness.processElement(new StreamRecord<>("3"), 1); testHarness.processElement(new StreamRecord<>("4"), 1); - testHarness.processSingleStep(); expectedOutput.add(new StreamRecord<>("[2]: 1")); - testHarness.processSingleStep(); expectedOutput.add(new StreamRecord<>("[2]: 2")); - assertThat(testHarness.getOutput(), contains(expectedOutput.toArray())); + testHarness.processAll(); + assertEquals(expectedOutput, new ArrayList<>(testHarness.getOutput()).subList(0, expectedOutput.size())); Review comment: Hmmm, maybe replace it with more reliable condition, sth like: ``` boolean noElementFromInputGate3 = true; steps = 0; while (noElementFromInputGate3 && steps++ < 100 && testHarness.procesSingleStep()) { noElementFromInputGate3 = ... // check for presence of `new StreamRecord<>("[3]: 1"))` } ``` ? This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. For queries about this service, please contact Infrastructure at: us...@infra.apache.org
[GitHub] [flink] jgrier commented on pull request #13514: [FLINK-19468][DataStream API] Metrics return empty when data stream / operator name contains "+"
jgrier commented on pull request #13514: URL: https://github.com/apache/flink/pull/13514#issuecomment-703774722 @flinkbot approve all This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. For queries about this service, please contact Infrastructure at: us...@infra.apache.org
[GitHub] [flink] flinkbot edited a comment on pull request #13541: [FLINK-19422] Upgrade Kafka and schema registry versions in the avro registry e2e test
flinkbot edited a comment on pull request #13541: URL: https://github.com/apache/flink/pull/13541#issuecomment-703617870 ## CI report: * 098133ce65783b3a119851d3011c7b88dce76318 Azure: [SUCCESS](https://dev.azure.com/apache-flink/98463496-1af2-4620-8eab-a2ecc1a2e6fe/_build/results?buildId=7205) Bot commands The @flinkbot bot supports the following commands: - `@flinkbot run travis` re-run the last Travis build - `@flinkbot run azure` re-run the last Azure build This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. For queries about this service, please contact Infrastructure at: us...@infra.apache.org
[jira] [Assigned] (FLINK-19468) Metrics return empty when data stream / operator name contains "+"
[ https://issues.apache.org/jira/browse/FLINK-19468?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Jamie Grier reassigned FLINK-19468: --- Assignee: Boyang Jerry Peng > Metrics return empty when data stream / operator name contains "+" > -- > > Key: FLINK-19468 > URL: https://issues.apache.org/jira/browse/FLINK-19468 > Project: Flink > Issue Type: Bug > Components: API / DataStream >Affects Versions: 1.9.0, 1.9.2, 1.9.3, 2.0.0 >Reporter: Boyang Jerry Peng >Assignee: Boyang Jerry Peng >Priority: Major > Labels: pull-request-available > > There is an issue in which the special character "+" is not removed from the > data stream / operator name which causes metrics for the operator to not be > properly returned. Code Reference: > [https://github.com/apache/flink/blob/master/flink-runtime/src/main/java/org/apache/flink/runtime/metrics/dump/MetricQueryService.java#L208] > > For example if the operator name is: > pulsar(url: pulsar+ssl://192.168.1.198:56014) > Metrics for an operator with the above name will always return empty. > -- This message was sent by Atlassian Jira (v8.3.4#803005)
[GitHub] [flink] flinkbot edited a comment on pull request #13531: [FLINK-19318] Deprecate timeWindow() operations in DataStream API
flinkbot edited a comment on pull request #13531: URL: https://github.com/apache/flink/pull/13531#issuecomment-702591517 ## CI report: * 4609bf890fe953a697b15f26280729abc36ea913 Azure: [SUCCESS](https://dev.azure.com/apache-flink/98463496-1af2-4620-8eab-a2ecc1a2e6fe/_build/results?buildId=7204) Bot commands The @flinkbot bot supports the following commands: - `@flinkbot run travis` re-run the last Travis build - `@flinkbot run azure` re-run the last Azure build This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. For queries about this service, please contact Infrastructure at: us...@infra.apache.org