[
https://issues.apache.org/jira/browse/ARROW-2350?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16412969#comment-16412969
]
ASF GitHub Bot commented on ARROW-2350:
---------------------------------------
xhochy closed pull request #1787: ARROW-2350: Consolidated RUN step in
spark_integration Dockerfile
URL: https://github.com/apache/arrow/pull/1787
This is a PR merged from a forked repository.
As GitHub hides the original diff on merge, it is displayed below for
the sake of provenance:
As this is a foreign pull request (from a fork), the diff is supplied
below (as it won't show otherwise due to GitHub magic):
diff --git a/dev/spark_integration/Dockerfile b/dev/spark_integration/Dockerfile
index d1b3cf89f..f96ee3578 100644
--- a/dev/spark_integration/Dockerfile
+++ b/dev/spark_integration/Dockerfile
@@ -17,24 +17,23 @@
FROM maven:3.5.2-jdk-8-slim
# Basic OS utilities
-RUN apt-get update && apt-get install -y \
+RUN apt-get update && \
+ apt-get install -y \
wget \
- git build-essential \
- software-properties-common
-
-# This will install conda in /home/ubuntu/miniconda
-RUN wget -O /tmp/miniconda.sh \
+ git \
+ build-essential \
+ software-properties-common && \
+ apt-get clean && \
+ # install conda in /home/ubuntu/miniconda
+ wget -O /tmp/miniconda.sh \
https://repo.continuum.io/miniconda/Miniconda3-latest-Linux-x86_64.sh && \
bash /tmp/miniconda.sh -b -p /home/ubuntu/miniconda && \
- rm /tmp/miniconda.sh
-
-# Python dependencies
-RUN apt-get install -y \
- pkg-config
-
-# Create Conda environment
-ENV PATH="/home/ubuntu/miniconda/bin:${PATH}"
-RUN conda create -y -q -n pyarrow-dev \
+ rm /tmp/miniconda.sh && \
+ # Python dependencies
+ apt-get install -y \
+ pkg-config && \
+ # Create Conda environment
+ /home/ubuntu/miniconda/bin/conda create -y -q -n pyarrow-dev \
# Python
python=2.7 \
numpy \
@@ -59,7 +58,10 @@ RUN conda create -y -q -n pyarrow-dev \
jemalloc \
lz4-c \
zstd \
- -c conda-forge
+ -c conda-forge && \
+ /home/ubuntu/miniconda/bin/conda clean --all
+
+ENV PATH="/home/ubuntu/miniconda/bin:${PATH}"
ADD . /apache-arrow
WORKDIR /apache-arrow
----------------------------------------------------------------
This is an automated message from the Apache Git Service.
To respond to the message, please log on GitHub and use the
URL above to go to the specific comment.
For queries about this service, please contact Infrastructure at:
[email protected]
> Shrink size of spark_integration Docker container
> -------------------------------------------------
>
> Key: ARROW-2350
> URL: https://issues.apache.org/jira/browse/ARROW-2350
> Project: Apache Arrow
> Issue Type: Improvement
> Reporter: James Lamb
> Priority: Minor
> Labels: docker, pull-request-available, spark
> Fix For: 0.10.0
>
> Original Estimate: 10m
> Remaining Estimate: 10m
>
> I would like to propose a few changes to the spark_integration Dockerfile:
> [https://github.com/apache/arrow/tree/master/dev/spark_integration]
> The size of the resulting image can be reduced by making the following
> changes:
> * consolidating all RUN commands into a single RUN layer (reducing the
> number of layers)
> * running {color:#14892c}apt-get clean{color} to clear out the package cache
> * running {color:#14892c}conda clean --all{color} to clear out cached
> package tarballs, abandoned package versions, and other build artifacts from
> all the libraries that are conda installed
> I will be submitting a PR on GitHub shortly. Generating this issue first so I
> can tag my PR to it.
--
This message was sent by Atlassian JIRA
(v7.6.3#76005)