[ 
https://issues.apache.org/jira/browse/FLINK-30941?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17685310#comment-17685310
 ] 

Matthias Pohl edited comment on FLINK-30941 at 2/7/23 1:42 PM:
---------------------------------------------------------------

There's also a "address already in use" error in between:
{code}
Feb 06 08:52:45 Step 5/14 : ENV FLINK_TGZ_URL=localhost:9999/flink.tgz     
FLINK_ASC_URL=     GPG_KEY=     CHECK_GPG=false
Feb 06 08:52:45  ---> Running in 562dc136a900
Traceback (most recent call last):
  File 
"/home/vsts/work/1/s/flink-end-to-end-tests/test-scripts/python3_fileserver.py",
 line 26, in <module>
    httpd = socketserver.TCPServer(("", 9999), handler)
  File "/usr/lib/python3.8/socketserver.py", line 452, in __init__
    self.server_bind()
  File "/usr/lib/python3.8/socketserver.py", line 466, in server_bind
    self.socket.bind(self.server_address)
OSError: [Errno 98] Address already in use
{code}
https://dev.azure.com/apache-flink/apache-flink/_build/results?buildId=45748&view=logs&j=bea52777-eaf8-5663-8482-18fbc3630e81&t=b2642e3a-5b86-574d-4c8a-f7e2842bfb14&l=4971

And a test_docker_embedded_job test failure
{code}
Feb 06 08:52:12 + gpg --batch --keyserver hkp://p80.pool.sks-keyservers.net:80 
--recv-keys B42F6819007F00F88E364FD4036A9C25BF357DD4
Feb 06 08:52:18 The command 'docker build --no-cache --network=host -t 
test_docker_embedded_job dev/test_docker_embedded_job-ubuntu' (pid: 230499) did 
not finish after 600 seconds.
Terminated
{code}
https://dev.azure.com/apache-flink/apache-flink/_build/results?buildId=45748&view=logs&j=bea52777-eaf8-5663-8482-18fbc3630e81&t=b2642e3a-5b86-574d-4c8a-f7e2842bfb14&l=4922

{{git clone}} also fails because a previous run wasn't properly cleaned up
{code}
Feb 06 08:42:03 Executing command: git clone 
https://github.com/apache/flink-docker.git --branch dev-1.16 --single-branch
fatal: destination path 'flink-docker' already exists and is not an empty 
directory.
Feb 06 08:42:03 Retry 1/5 exited 128, retrying in 1 seconds...
fatal: destination path 'flink-docker' already exists and is not an empty 
directory.
Feb 06 08:42:04 Retry 2/5 exited 128, retrying in 2 seconds...
fatal: destination path 'flink-docker' already exists and is not an empty 
directory.
Feb 06 08:42:06 Retry 3/5 exited 128, retrying in 4 seconds...
fatal: destination path 'flink-docker' already exists and is not an empty 
directory.
Feb 06 08:42:10 Retry 4/5 exited 128, retrying in 8 seconds...
fatal: destination path 'flink-docker' already exists and is not an empty 
directory.
Feb 06 08:42:18 Retry 5/5 exited 128, no more retries left.
{code}
https://dev.azure.com/apache-flink/apache-flink/_build/results?buildId=45748&view=logs&j=bea52777-eaf8-5663-8482-18fbc3630e81&t=b2642e3a-5b86-574d-4c8a-f7e2842bfb14&l=4709

It feels like this build have generally quite some inconsistencies


was (Author: mapohl):
There's also a "address already in use" error in between:
{code}
Feb 06 08:52:45 Step 5/14 : ENV FLINK_TGZ_URL=localhost:9999/flink.tgz     
FLINK_ASC_URL=     GPG_KEY=     CHECK_GPG=false
Feb 06 08:52:45  ---> Running in 562dc136a900
Traceback (most recent call last):
  File 
"/home/vsts/work/1/s/flink-end-to-end-tests/test-scripts/python3_fileserver.py",
 line 26, in <module>
    httpd = socketserver.TCPServer(("", 9999), handler)
  File "/usr/lib/python3.8/socketserver.py", line 452, in __init__
    self.server_bind()
  File "/usr/lib/python3.8/socketserver.py", line 466, in server_bind
    self.socket.bind(self.server_address)
OSError: [Errno 98] Address already in use
{code}
https://dev.azure.com/apache-flink/apache-flink/_build/results?buildId=45748&view=logs&j=bea52777-eaf8-5663-8482-18fbc3630e81&t=b2642e3a-5b86-574d-4c8a-f7e2842bfb14&l=4971

And a test_docker_embedded_job test failure
{code}
Feb 06 08:52:12 + gpg --batch --keyserver hkp://p80.pool.sks-keyservers.net:80 
--recv-keys B42F6819007F00F88E364FD4036A9C25BF357DD4
Feb 06 08:52:18 The command 'docker build --no-cache --network=host -t 
test_docker_embedded_job dev/test_docker_embedded_job-ubuntu' (pid: 230499) did 
not finish after 600 seconds.
Terminated
{code}
https://dev.azure.com/apache-flink/apache-flink/_build/results?buildId=45748&view=logs&j=bea52777-eaf8-5663-8482-18fbc3630e81&t=b2642e3a-5b86-574d-4c8a-f7e2842bfb14&l=4922

{{git clone}} also fails because a previous run wasn't properly cleaned up
{code}
Feb 06 08:42:03 Executing command: git clone 
https://github.com/apache/flink-docker.git --branch dev-1.16 --single-branch
fatal: destination path 'flink-docker' already exists and is not an empty 
directory.
Feb 06 08:42:03 Retry 1/5 exited 128, retrying in 1 seconds...
fatal: destination path 'flink-docker' already exists and is not an empty 
directory.
Feb 06 08:42:04 Retry 2/5 exited 128, retrying in 2 seconds...
fatal: destination path 'flink-docker' already exists and is not an empty 
directory.
Feb 06 08:42:06 Retry 3/5 exited 128, retrying in 4 seconds...
fatal: destination path 'flink-docker' already exists and is not an empty 
directory.
Feb 06 08:42:10 Retry 4/5 exited 128, retrying in 8 seconds...
fatal: destination path 'flink-docker' already exists and is not an empty 
directory.
Feb 06 08:42:18 Retry 5/5 exited 128, no more retries left.
{code}
https://dev.azure.com/apache-flink/apache-flink/_build/results?buildId=45748&view=logs&j=bea52777-eaf8-5663-8482-18fbc3630e81&t=b2642e3a-5b86-574d-4c8a-f7e2842bfb14&l=4709

> Error when downloading the flink binaries into the Docker container
> -------------------------------------------------------------------
>
>                 Key: FLINK-30941
>                 URL: https://issues.apache.org/jira/browse/FLINK-30941
>             Project: Flink
>          Issue Type: Bug
>          Components: Test Infrastructure
>    Affects Versions: 1.16.1
>            Reporter: Matthias Pohl
>            Priority: Major
>              Labels: test-stability
>
> Creating the Docker container failed because it failed to download the Flink 
> binaries:
> https://dev.azure.com/apache-flink/apache-flink/_build/results?buildId=45748&view=logs&j=bea52777-eaf8-5663-8482-18fbc3630e81&t=b2642e3a-5b86-574d-4c8a-f7e2842bfb14&l=5253
> {code}
> Feb 06 09:00:46 Step 10/14 : RUN set -ex;   wget -nv -O flink.tgz 
> "$FLINK_TGZ_URL";     if [ "$CHECK_GPG" = "true" ]; then     wget -nv -O 
> flink.tgz.asc "$FLINK_ASC_URL";     export GNUPGHOME="$(mktemp -d)";     for 
> server in ha.pool.sks-keyservers.net $(shuf -e                             
> hkp://p80.pool.sks-keyservers.net:80                             
> keyserver.ubuntu.com                             
> hkp://keyserver.ubuntu.com:80                             pgp.mit.edu) ; do   
>       gpg --batch --keyserver "$server" --recv-keys "$GPG_KEY" && break || : 
> ;     done &&     gpg --batch --verify flink.tgz.asc flink.tgz;     gpgconf 
> --kill all;     rm -rf "$GNUPGHOME" flink.tgz.asc;   fi;     tar -xf 
> flink.tgz --strip-components=1;   rm flink.tgz;     chown -R flink:flink .;   
>   sed -i 's/rest.address: localhost/rest.address: 0.0.0.0/g' 
> $FLINK_HOME/conf/flink-conf.yaml;   sed -i 's/rest.bind-address: 
> localhost/rest.bind-address: 0.0.0.0/g' $FLINK_HOME/conf/flink-conf.yaml;   
> sed -i 's/jobmanager.bind-host: localhost/jobmanager.bind-host: 0.0.0.0/g' 
> $FLINK_HOME/conf/flink-conf.yaml;   sed -i 's/taskmanager.bind-host: 
> localhost/taskmanager.bind-host: 0.0.0.0/g' $FLINK_HOME/conf/flink-conf.yaml; 
>   sed -i '/taskmanager.host: localhost/d' $FLINK_HOME/conf/flink-conf.yaml;
> Feb 06 09:00:46  ---> Running in f4bc5a29798a
> Feb 06 09:00:46 + wget -nv -O flink.tgz localhost:9999/flink.tgz
> Feb 06 09:00:46 failed: Connection refused.
> Feb 06 09:00:46 failed: Connection refused.
> The command '/bin/sh -c set -ex;   wget -nv -O flink.tgz "$FLINK_TGZ_URL";    
>  if [ "$CHECK_GPG" = "true" ]; then     wget -nv -O flink.tgz.asc 
> "$FLINK_ASC_URL";     export GNUPGHOME="$(mktemp -d)";     for server in 
> ha.pool.sks-keyservers.net $(shuf -e                             
> hkp://p80.pool.sks-keyservers.net:80                             
> keyserver.ubuntu.com                             
> hkp://keyserver.ubuntu.com:80                             pgp.mit.edu) ; do   
>       gpg --batch --keyserver "$server" --recv-keys "$GPG_KEY" && break || : 
> ;     done &&     gpg --batch --verify flink.tgz.asc flink.tgz;     gpgconf 
> --kill all;     rm -rf "$GNUPGHOME" flink.tgz.asc;   fi;     tar -xf 
> flink.tgz --strip-components=1;   rm flink.tgz;     chown -R flink:flink .;   
>   sed -i 's/rest.address: localhost/rest.address: 0.0.0.0/g' 
> $FLINK_HOME/conf/flink-conf.yaml;   sed -i 's/rest.bind-address: 
> localhost/rest.bind-address: 0.0.0.0/g' $FLINK_HOME/conf/flink-conf.yaml;   
> sed -i 's/jobmanager.bind-host: localhost/jobmanager.bind-host: 0.0.0.0/g' 
> $FLINK_HOME/conf/flink-conf.yaml;   sed -i 's/taskmanager.bind-host: 
> localhost/taskmanager.bind-host: 0.0.0.0/g' $FLINK_HOME/conf/flink-conf.yaml; 
>   sed -i '/taskmanager.host: localhost/d' $FLINK_HOME/conf/flink-conf.yaml;' 
> returned a non-zero code: 4
> {code}



--
This message was sent by Atlassian Jira
(v8.20.10#820010)

Reply via email to