[
https://issues.apache.org/jira/browse/FLINK-23196?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17372498#comment-17372498
]
Yangyang ZHANG edited comment on FLINK-23196 at 7/1/21, 8:00 AM:
-----------------------------------------------------------------
[~xintongsong] I met the same problem recently. The reason may be concurrent
MiniCluster with the default port in different concurrent tests and the port
conflict causes the error.
I explicitly set the rest and rpc port to 0 in JobMasterITCase and passed the
tests as is in MiniClusterResource where both rest and RPC port are set to 0 to
avoid clashes with concurrent MiniClusters.
There are already some tests where the port is set to 0 when using MiniCluster
directly.
But there are other cases that use MiniCluster without setting the port to 0
and may cause this issue when running concurrently. Maybe we can find all these
cases and correct them.
See:
MiniClusterResource
[https://github.com/apache/flink/blob/a40abc7f834888a5f42efeefa662ad6ad5d7c222/flink-runtime/src/test/java/org/apache/flink/runtime/testutils/MiniClusterResource.java#L185]
My Fix:
https://github.com/zhangyy91/flink/blob/170d40507599618f471e46b3b2843fb83234100f/flink-tests/src/test/java/org/apache/flink/runtime/jobmaster/JobMasterITCase.java#L53
was (Author: zhangyy91):
[~xintongsong] I met the same problem recently. The reason may be concurrent
MiniCluster with the default port in different concurrent tests and the port
conflict causes the error.
I explicitly set the rest and rpc port to 0 in JobMasterITCase and passed the
tests as is in MiniClusterResource where both rest and RPC port are set to 0 to
avoid clashes with concurrent MiniClusters.
There are already some tests where the port is set to 0 when using MiniCluster
directly.
But there are other cases that use MiniCluster without setting the port to 0
and may cause this issue when running concurrently. Maybe we can find all these
cases and correct them.
See:
https://github.com/apache/flink/blob/a40abc7f834888a5f42efeefa662ad6ad5d7c222/flink-runtime/src/test/java/org/apache/flink/runtime/testutils/MiniClusterResource.java#L185
> JobMasterITCase fail on azure due to BindException
> --------------------------------------------------
>
> Key: FLINK-23196
> URL: https://issues.apache.org/jira/browse/FLINK-23196
> Project: Flink
> Issue Type: Bug
> Components: Runtime / Coordination
> Affects Versions: 1.14.0
> Reporter: Xintong Song
> Assignee: Chesnay Schepler
> Priority: Major
> Labels: pull-request-available, test-stability
> Fix For: 1.14.0
>
>
> https://dev.azure.com/apache-flink/apache-flink/_build/results?buildId=19753&view=logs&j=39d5b1d5-3b41-54dc-6458-1e2ddd1cdcf3&t=a99e99c7-21cd-5a1f-7274-585e62b72f56&l=4251
> {code}
> Jul 01 00:00:27 [ERROR] Tests run: 2, Failures: 0, Errors: 1, Skipped: 0,
> Time elapsed: 4.272 s <<< FAILURE! - in
> org.apache.flink.runtime.jobmaster.JobMasterITCase
> Jul 01 00:00:27 [ERROR]
> testRejectionOfEmptyJobGraphs(org.apache.flink.runtime.jobmaster.JobMasterITCase)
> Time elapsed: 3.009 s <<< ERROR!
> Jul 01 00:00:27 org.apache.flink.util.FlinkException: Could not create the
> DispatcherResourceManagerComponent.
> Jul 01 00:00:27 at
> org.apache.flink.runtime.entrypoint.component.DefaultDispatcherResourceManagerComponentFactory.create(DefaultDispatcherResourceManagerComponentFactory.java:275)
> Jul 01 00:00:27 at
> org.apache.flink.runtime.minicluster.MiniCluster.createDispatcherResourceManagerComponents(MiniCluster.java:470)
> Jul 01 00:00:27 at
> org.apache.flink.runtime.minicluster.MiniCluster.setupDispatcherResourceManagerComponents(MiniCluster.java:429)
> Jul 01 00:00:27 at
> org.apache.flink.runtime.minicluster.MiniCluster.start(MiniCluster.java:373)
> Jul 01 00:00:27 at
> org.apache.flink.runtime.jobmaster.JobMasterITCase.testRejectionOfEmptyJobGraphs(JobMasterITCase.java:56)
> Jul 01 00:00:27 at sun.reflect.NativeMethodAccessorImpl.invoke0(Native
> Method)
> Jul 01 00:00:27 at
> sun.reflect.NativeMethodAccessorImpl.invoke(NativeMethodAccessorImpl.java:62)
> Jul 01 00:00:27 at
> sun.reflect.DelegatingMethodAccessorImpl.invoke(DelegatingMethodAccessorImpl.java:43)
> Jul 01 00:00:27 at java.lang.reflect.Method.invoke(Method.java:498)
> Jul 01 00:00:27 at
> org.junit.runners.model.FrameworkMethod$1.runReflectiveCall(FrameworkMethod.java:59)
> Jul 01 00:00:27 at
> org.junit.internal.runners.model.ReflectiveCallable.run(ReflectiveCallable.java:12)
> Jul 01 00:00:27 at
> org.junit.runners.model.FrameworkMethod.invokeExplosively(FrameworkMethod.java:56)
> Jul 01 00:00:27 at
> org.junit.internal.runners.statements.InvokeMethod.evaluate(InvokeMethod.java:17)
> Jul 01 00:00:27 at
> org.apache.flink.util.TestNameProvider$1.evaluate(TestNameProvider.java:45)
> Jul 01 00:00:27 at
> org.junit.rules.TestWatcher$1.evaluate(TestWatcher.java:61)
> Jul 01 00:00:27 at
> org.junit.runners.ParentRunner$3.evaluate(ParentRunner.java:306)
> Jul 01 00:00:27 at
> org.junit.runners.BlockJUnit4ClassRunner$1.evaluate(BlockJUnit4ClassRunner.java:100)
> Jul 01 00:00:27 at
> org.junit.runners.ParentRunner.runLeaf(ParentRunner.java:366)
> Jul 01 00:00:27 at
> org.junit.runners.BlockJUnit4ClassRunner.runChild(BlockJUnit4ClassRunner.java:103)
> Jul 01 00:00:27 at
> org.junit.runners.BlockJUnit4ClassRunner.runChild(BlockJUnit4ClassRunner.java:63)
> Jul 01 00:00:27 at
> org.junit.runners.ParentRunner$4.run(ParentRunner.java:331)
> Jul 01 00:00:27 at
> org.junit.runners.ParentRunner$1.schedule(ParentRunner.java:79)
> Jul 01 00:00:27 at
> org.junit.runners.ParentRunner.runChildren(ParentRunner.java:329)
> Jul 01 00:00:27 at
> org.junit.runners.ParentRunner.access$100(ParentRunner.java:66)
> Jul 01 00:00:27 at
> org.junit.runners.ParentRunner$2.evaluate(ParentRunner.java:293)
> Jul 01 00:00:27 at
> org.junit.runners.ParentRunner$3.evaluate(ParentRunner.java:306)
> Jul 01 00:00:27 at
> org.junit.runners.ParentRunner.run(ParentRunner.java:413)
> Jul 01 00:00:27 at
> org.apache.maven.surefire.junit4.JUnit4Provider.execute(JUnit4Provider.java:365)
> Jul 01 00:00:27 at
> org.apache.maven.surefire.junit4.JUnit4Provider.executeWithRerun(JUnit4Provider.java:273)
> Jul 01 00:00:27 at
> org.apache.maven.surefire.junit4.JUnit4Provider.executeTestSet(JUnit4Provider.java:238)
> Jul 01 00:00:27 at
> org.apache.maven.surefire.junit4.JUnit4Provider.invoke(JUnit4Provider.java:159)
> Jul 01 00:00:27 at
> org.apache.maven.surefire.booter.ForkedBooter.invokeProviderInSameClassLoader(ForkedBooter.java:384)
> Jul 01 00:00:27 at
> org.apache.maven.surefire.booter.ForkedBooter.runSuitesInProcess(ForkedBooter.java:345)
> Jul 01 00:00:27 at
> org.apache.maven.surefire.booter.ForkedBooter.execute(ForkedBooter.java:126)
> Jul 01 00:00:27 at
> org.apache.maven.surefire.booter.ForkedBooter.main(ForkedBooter.java:418)
> Jul 01 00:00:27 Caused by: java.net.BindException: Could not start rest
> endpoint on any port in port range 8081
> Jul 01 00:00:27 at
> org.apache.flink.runtime.rest.RestServerEndpoint.start(RestServerEndpoint.java:234)
> Jul 01 00:00:27 at
> org.apache.flink.runtime.entrypoint.component.DefaultDispatcherResourceManagerComponentFactory.create(DefaultDispatcherResourceManagerComponentFactory.java:172)
> Jul 01 00:00:27 ... 34 more
> {code}
--
This message was sent by Atlassian Jira
(v8.3.4#803005)