Anne Rutten created ZEPPELIN-2621: ------------------------------------- Summary: Spark standalone Dockerfile downloads corrupt spark.tgz archive Key: ZEPPELIN-2621 URL: https://issues.apache.org/jira/browse/ZEPPELIN-2621 Project: Zeppelin Issue Type: Bug Affects Versions: 0.7.0 Reporter: Anne Rutten Priority: Minor
trying to build a docker image as per [these instructions|https://zeppelin.apache.org/docs/0.7.0/install/spark_cluster_mode.html#1-build-docker-file] generates the following error: {quote} > docker build -t "spark_standalone" . {quote} {quote} (...) Step 12 : RUN curl -s http://apache.mirror.cdnetworks.com/spark/spark-$SPARK_VERSION/spark-$SPARK_VERSION-bin-hadoop$HADOOP_PROFILE.tgz | tar -xz -C /usr/local/ ---> Running in adf4bd575e37 gzip: stdin: unexpected end of file tar: Unexpected EOF in archive tar: Unexpected EOF in archive tar: Error is not recoverable: exiting now The command '/bin/sh -c curl -s http://apache.mirror.cdnetworks.com/spark/spark-$SPARK_VERSION/spark-$SPARK_VERSION-bin-hadoop$HADOOP_PROFILE.tgz | tar -xz -C /usr/local/' returned a non-zero code: 2 {quote} I've tracked this down to the .tgz file being downloaded from http://apache.mirror.cdnetworks.com/ being corrupted. If i substitute with a cloudfront URL, e.g. https://d3kbcqa49mib13.cloudfront.net/, the image gets built successfully. -- This message was sent by Atlassian JIRA (v6.3.15#6346)