James Lamb created ARROW-2350: --------------------------------- Summary: Shrink size of spark_integration Docker container Key: ARROW-2350 URL: https://issues.apache.org/jira/browse/ARROW-2350 Project: Apache Arrow Issue Type: Improvement Reporter: James Lamb
I would like to propose a few changes to the spark_integration Dockerfile: [https://github.com/apache/arrow/tree/master/dev/spark_integration] The size of the resulting image can be reduced by making the following changes: * consolidating all RUN commands into a single RUN layer (reducing the number of layers) * running {color:#14892c}apt-get clean{color} to clear out the package cache * running {color:#14892c}conda clean --all{color} to clear out cached package tarballs, abandoned package versions, and other build artifacts from all the libraries that are conda installed I will be submitting a PR on GitHub shortly. Generating this issue first so I can tag my PR to it. -- This message was sent by Atlassian JIRA (v7.6.3#76005)