[ 
https://issues.apache.org/jira/browse/ARROW-2350?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16417632#comment-16417632
 ] 

ASF GitHub Bot commented on ARROW-2350:
---------------------------------------

xhochy commented on issue #1787: ARROW-2350: Consolidated RUN step in 
spark_integration Dockerfile
URL: https://github.com/apache/arrow/pull/1787#issuecomment-376941028
 
 
   @jameslamb yes, that would be great. It looks like we could trim down all of 
them. In some cases, we install python packages from `pip install git+…`. These 
steps should maybe stay separate so that we can only delete the docker cache 
for them and have a fast rebuild.

----------------------------------------------------------------
This is an automated message from the Apache Git Service.
To respond to the message, please log on GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
[email protected]


> Shrink size of spark_integration Docker container
> -------------------------------------------------
>
>                 Key: ARROW-2350
>                 URL: https://issues.apache.org/jira/browse/ARROW-2350
>             Project: Apache Arrow
>          Issue Type: Improvement
>            Reporter: James Lamb
>            Assignee: James Lamb
>            Priority: Minor
>              Labels: docker, pull-request-available, spark
>             Fix For: 0.10.0
>
>   Original Estimate: 10m
>  Remaining Estimate: 10m
>
> I would like to propose a few changes to the spark_integration Dockerfile:
> [https://github.com/apache/arrow/tree/master/dev/spark_integration]
> The size of the resulting image can be reduced by making the following 
> changes:
>  * consolidating all RUN commands into a single RUN layer (reducing the 
> number of layers)
>  * running {color:#14892c}apt-get clean{color} to clear out the package cache
>  * running {color:#14892c}conda clean --all{color} to clear out cached 
> package tarballs, abandoned package versions, and other build artifacts from 
> all the libraries that are conda installed
> I will be submitting a PR on GitHub shortly. Generating this issue first so I 
> can tag my PR to it.



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)

Reply via email to