Yikun Jiang created SPARK-43365:
-----------------------------------
Summary: Refactor Dockerfile and workflow based on base image
Key: SPARK-43365
URL: https://issues.apache.org/jira/browse/SPARK-43365
Project: Spark
Issue Type: Sub-task
Components: Spark Docker
Affects Versions: 3.5.0
Reporter: Yikun Jiang
https://github.com/docker-library/official-images/pull/13089?notification_referrer_id=NT_kwDOABp-orI0MzIwMzMwNzY5OjE3MzYzNTQ#issuecomment-1533540388
Would it be useful to save space by sharing layers by having one image from
another? 🤔 Something like the *java11-ubuntu as the "base" with r and python
variants FROM that and the r-python being FROM, probably, the larger one of
those?
Rough example Dockerfiles
{code:java}
FROM eclipse-temurin:11-jre-focal
# user stuff, install common deps, etc
...
# download/extract spark (maybe keeping python and R files too? they seem
relatively small compared to the rest)
{code}
{code:java}
# other images in separate Dockerfiles
FROM spark:3.3.0-scala2.12-java11-ubuntu
# get "/opt/spark/{python,R}/" contents if not kept in base
# install python or R (and things like R_HOME)
{code}
--
This message was sent by Atlassian Jira
(v8.20.10#820010)
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]