github-actions[bot] commented on issue #13714:
URL: 
https://github.com/apache/dolphinscheduler/issues/13714#issuecomment-1462223070

   ### Search before asking
   
   - [X] I had searched in the 
[issues](https://github.com/apache/dolphinscheduler/issues?q=is%3Aissue) and 
found no similar issues.
   
   
   ### What happened
   
   我用集群部署发现始终无法下发任务。
   ```
   [WARN] 2023-03-09 15:02:30.594 +0000 
org.apache.dolphinscheduler.remote.NettyRemotingClient:[315] - 
[WorkflowInstance-0][TaskInstance-0] - connect to Host{hostWeight=100, 
workerGroup='release'} error
   java.net.UnknownHostException: dolphinscheduler-worker-0
           at java.net.InetAddress$CachedAddresses.get(InetAddress.java:764)
           at java.net.InetAddress.getAllByName0(InetAddress.java:1282)
           at java.net.InetAddress.getAllByName(InetAddress.java:1140)
           at java.net.InetAddress.getAllByName(InetAddress.java:1064)
           at java.net.InetAddress.getByName(InetAddress.java:1014)
           at io.netty.util.internal.SocketUtils$8.run(SocketUtils.java:156)
           at io.netty.util.internal.SocketUtils$8.run(SocketUtils.java:153)
           at java.security.AccessController.doPrivileged(Native Method)
           at 
io.netty.util.internal.SocketUtils.addressByName(SocketUtils.java:153)
           at 
io.netty.resolver.DefaultNameResolver.doResolve(DefaultNameResolver.java:41)
           at 
io.netty.resolver.SimpleNameResolver.resolve(SimpleNameResolver.java:61)
           at 
io.netty.resolver.SimpleNameResolver.resolve(SimpleNameResolver.java:53)
           at 
io.netty.resolver.InetSocketAddressResolver.doResolve(InetSocketAddressResolver.java:55)
           at 
io.netty.resolver.InetSocketAddressResolver.doResolve(InetSocketAddressResolver.java:31)
           at 
io.netty.resolver.AbstractAddressResolver.resolve(AbstractAddressResolver.java:106)
           at 
io.netty.bootstrap.Bootstrap.doResolveAndConnect0(Bootstrap.java:206)
           at io.netty.bootstrap.Bootstrap.access$000(Bootstrap.java:46)
           at 
io.netty.bootstrap.Bootstrap$1.operationComplete(Bootstrap.java:180)
           at 
io.netty.bootstrap.Bootstrap$1.operationComplete(Bootstrap.java:166)
           at 
io.netty.util.concurrent.DefaultPromise.notifyListener0(DefaultPromise.java:577)
           at 
io.netty.util.concurrent.DefaultPromise.notifyListenersNow(DefaultPromise.java:551)
           at 
io.netty.util.concurrent.DefaultPromise.notifyListeners(DefaultPromise.java:490)
           at 
io.netty.util.concurrent.DefaultPromise.setValue0(DefaultPromise.java:615)
           at 
io.netty.util.concurrent.DefaultPromise.setSuccess0(DefaultPromise.java:604)
           at 
io.netty.util.concurrent.DefaultPromise.trySuccess(DefaultPromise.java:104)
           at 
io.netty.channel.DefaultChannelPromise.trySuccess(DefaultChannelPromise.java:84)
           at 
io.netty.channel.AbstractChannel$AbstractUnsafe.safeSetSuccess(AbstractChannel.java:989)
           at 
io.netty.channel.AbstractChannel$AbstractUnsafe.register0(AbstractChannel.java:504)
           at 
io.netty.channel.AbstractChannel$AbstractUnsafe.access$200(AbstractChannel.java:417)
           at 
io.netty.channel.AbstractChannel$AbstractUnsafe$1.run(AbstractChannel.java:474)
           at 
io.netty.util.concurrent.AbstractEventExecutor.safeExecute(AbstractEventExecutor.java:164)
           at 
io.netty.util.concurrent.SingleThreadEventExecutor.runAllTasks(SingleThreadEventExecutor.java:472)
           at io.netty.channel.epoll.EpollEventLoop.run(EpollEventLoop.java:384)
           at 
io.netty.util.concurrent.SingleThreadEventExecutor$4.run(SingleThreadEventExecutor.java:989)
           at 
io.netty.util.internal.ThreadExecutorMap$2.run(ThreadExecutorMap.java:74)
           at java.lang.Thread.run(Thread.java:750)
   ```
   worker分组是
   <img width="910" alt="image" 
src="https://user-images.githubusercontent.com/25928906/224064334-03a24dcc-52b7-44c1-a468-8bddada34a70.png";>
   
   zookeeper 注册信息是
   ```
   [zk: localhost:2181(CONNECTED) 3] ls /dolphinscheduler/nodes/master
   [dolphinscheduler-master-0:5678]
   [zk: localhost:2181(CONNECTED) 4] ls /dolphinscheduler/nodes/worker
   [dolphinscheduler-worker-0:1234, dolphinscheduler-worker-1:1234, 
dolphinscheduler-worker-2:1234]
   ```
   然而我用Standalone部署,可以下发任务。
   worker分组是
   <img width="940" alt="image" 
src="https://user-images.githubusercontent.com/25928906/224065301-ebc9bdb8-247d-4d05-9349-f1a656077834.png";>
   
   
   
   
   
   ### What you expected to happen
   
   master可以下发任务到worker
   
   ### How to reproduce
   
   My dockerfile
   ```
   FROM csighub.tencentyun.com/tjdk/tencentkona8-tlinux
   RUN yum install -y  vim tar zip unzip procps psmisc pbzip2 lsof pigz
   RUN yum install -y centos-release-scl
   RUN yum install -y rh-python38
   RUN yum install -y hostname
   RUN rm /bin/python
   RUN ln -s /opt/rh/rh-python38/root/bin/python3 /bin/python
   RUN ln -s /opt/rh/rh-python38/root/bin/pip /bin/pip
   RUN ln -s /opt/jdk/bin/java /bin/java
   COPY ./requirements.txt /root/requirements.txt
   RUN  pip install --ignore-installed PyYAML==6.0 &&  pip install 
apollo-python-client==2.1.2 --extra-index-url http://pypi.woa.com/ 
--trusted-host pypi.woa.com \
        && python -m pip install -r /root/requirements.txt 
   
   RUN mkdir  /app
   COPY . /app/
   WORKDIR /app
   ENV TZ=Asia/Shanghai
   CMD ["bash","/app/run.sh"] 
   ```
   
   app/run.sh
   ```
   if [ -z "$TYPE" ]; then
       echo "[run.sh] environment variable TYPE not exist"
       exit 255
   fi
   
   if [ "$TYPE" == "zookeeper" ];then
       bash /app/apache-zookeeper-3.8.0-bin/bin/zkServer.sh start
   fi
   prefix="/app/apache-dolphinscheduler-3.1.4-bin"
   if [ "$TYPE" == "master" ];then
    bash $prefix/bin/dolphinscheduler-daemon.sh start master-server
   fi
   if [ "$TYPE" == "worker" ];then
     bash $prefix/bin/dolphinscheduler-daemon.sh start worker-server
   fi
   if [ "$TYPE" == "alert" ];then
     bash $prefix/bin/dolphinscheduler-daemon.sh start alert-server
   fi
   if [ "$TYPE" == "api" ];then
     bash $prefix/bin/dolphinscheduler-daemon.sh start api-server
   fi
   sleep 99999999999
   ```
   
   
   ### Anything else
   
   我更改了 /apache-dolphinscheduler-3.1.4-bin/bin/dolphinscheduler-daemon.sh
   
   export HOSTNAME=`hostname`  
   为
   
   export HOSTNAME=`hostname -I`  
   仍然不行
   
   ### Version
   
   3.1.x
   
   ### Are you willing to submit PR?
   
   - [X] Yes I am willing to submit a PR!
   
   ### Code of Conduct
   
   - [X] I agree to follow this project's [Code of 
Conduct](https://www.apache.org/foundation/policies/conduct)


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: [email protected]

For queries about this service, please contact Infrastructure at:
[email protected]

Reply via email to