xintongsong commented on issue #11284: [FLINK-15911][runtime] Make Flink work with NAT. URL: https://github.com/apache/flink/pull/11284#issuecomment-595001635 @tillrohrmann I checked the test failures. The failures on Azure are all unrelated. The NAT e2e test failure on Travis is cause by error message in logs. - The job is executed successfully, with correct result outputted. - The error message is about the RM RPC service not started yet when one of the TMs tries to connect to it. - The following JM log shows that the message is successfully received before discarded, indicating the TM has no problem resolving the correct RM address and RPC port. `The rpc endpoint org.apache.flink.runtime.resourcemanager.StandaloneResourceManager has not been started yet. Discarding message org.apache.flink.runtime.rpc.messages.RemoteFencedMessage until processing is started.` - I think this is not a real problem, because TM will retry connecting to RM later. I've set `skip_check_exceptions` for this test case. I think relying on the result hash check should be enough.
---------------------------------------------------------------- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. For queries about this service, please contact Infrastructure at: [email protected] With regards, Apache Git Services
