[ 
https://issues.apache.org/jira/browse/SPARK-56818?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Stan Buren updated SPARK-56818:
-------------------------------
    Description: 
The interactive Binder environments for all PySpark Quickstart tutorials are 
currently failing to build. This prevents users from utilizing the "Live 
Notebook" features directly from the official Spark documentation.
h3. Steps to Reproduce

1. Open the [Getting 
Started|https://spark.apache.org/docs/latest/api/python/getting_started/index.html]
 documentation.
2. Click on ANY of the following "Live Notebook" (Binder) buttons:
 * ["Live Notebook: 
DataFrame"|https://mybinder.org/v2/gh/apache/spark/c0690c763ba?filepath=python%2Fdocs%2Fsource%2Fgetting_started%2Fquickstart_df.ipynb]
 * ["Live Notebook: Spark 
Connect"|https://mybinder.org/v2/gh/apache/spark/c0690c763ba?filepath=python%2Fdocs%2Fsource%2Fgetting_started%2Fquickstart_connect.ipynb]
 * ["Live Notebook: pandas API on 
Spark"|https://mybinder.org/v2/gh/apache/spark/c0690c763ba?filepath=python%2Fdocs%2Fsource%2Fgetting_started%2Fquickstart_ps.ipynb]
3. Wait for the build process on mybinder.org.

h3. Actual Result

All three notebooks fail at step 39 of the Dockerfile with the exact same error.
{code:java|title=Build Log Snippet}
 > [7/8] RUN apt-get update && apt-get install -y openjdk-17-jre git coreutils:
 ...
 5.008 Package openjdk-17-jre is not available, but is referred to by another 
package.
 5.008 This may mean that the package is missing, has been obsoleted, or
 5.008 is only available from another source
 5.008 
 5.016 E: Package 'openjdk-17-jre' has no installation candidate
 ...
 ERROR: failed to solve: process "/bin/sh -c apt-get update && apt-get install 
-y openjdk-17-jre git coreutils" did not complete successfully: exit code: 100
{code}
{code:java|title=Affected Dockerfile Lines}
  37 |     USER root
  38 |     RUN chown -R ${NB_UID} ${HOME}
  39 | >>> RUN apt-get update && apt-get install -y openjdk-17-jre git coreutils
  40 |     USER ${NB_USER}
{code}
h3. Expected Result

The environment should build successfully for all Quickstart notebooks, 
allowing the user to interact with Jupyter.
h3. Root Cause Analysis

The build logs indicate that the environment is pulling packages from 
*{*}Debian Trixie (Testing){*}* repositories:
{code:java}
Hit:1 http://deb.debian.org/debian trixie InRelease
{code}
The current {{binder/Dockerfile}} (at commit {{{}c0690c763ba{}}}) attempts to 
install {{{}openjdk-17-jre{}}}, but this package appears to be missing or 
unavailable in the current Trixie package index. Since this build is associated 
with the *{*}v4.1.1-rc2{*}* release preparation, it blocks the interactive 
documentation for the upcoming version.

  was:
h3. Description
The interactive Binder environments for all PySpark Quickstart tutorials are 
currently failing to build. This prevents users from utilizing the "Live 
Notebook" features directly from the official Spark documentation.

h3. Steps to Reproduce
1. Open the [Getting 
Started|https://spark.apache.org/docs/latest/api/python/getting_started/index.html]
 documentation.
2. Click on ANY of the following "Live Notebook" (Binder) buttons:
* ["Live Notebook: 
DataFrame"|https://mybinder.org/v2/gh/apache/spark/c0690c763ba?filepath=python%2Fdocs%2Fsource%2Fgetting_started%2Fquickstart_df.ipynb]
* ["Live Notebook: Spark 
Connect"|https://mybinder.org/v2/gh/apache/spark/c0690c763ba?filepath=python%2Fdocs%2Fsource%2Fgetting_started%2Fquickstart_connect.ipynb]
* ["Live Notebook: pandas API on 
Spark"|https://mybinder.org/v2/gh/apache/spark/c0690c763ba?filepath=python%2Fdocs%2Fsource%2Fgetting_started%2Fquickstart_ps.ipynb]
3. Wait for the build process on mybinder.org.


h3. Actual Result
All three notebooks fail at step 39 of the Dockerfile with the exact same 
error. 

{code:title=Build Log Snippet}
 > [7/8] RUN apt-get update && apt-get install -y openjdk-17-jre git coreutils:
 ...
 5.008 Package openjdk-17-jre is not available, but is referred to by another 
package.
 5.008 This may mean that the package is missing, has been obsoleted, or
 5.008 is only available from another source
 5.008 
 5.016 E: Package 'openjdk-17-jre' has no installation candidate
 ...
 ERROR: failed to solve: process "/bin/sh -c apt-get update && apt-get install 
-y openjdk-17-jre git coreutils" did not complete successfully: exit code: 100
{code}

{code:title=Affected Dockerfile Lines}
  37 |     USER root
  38 |     RUN chown -R ${NB_UID} ${HOME}
  39 | >>> RUN apt-get update && apt-get install -y openjdk-17-jre git coreutils
  40 |     USER ${NB_USER}
{code}

h3. Expected Result
The environment should build successfully for all Quickstart notebooks, 
allowing the user to interact with Jupyter.

h3. Root Cause Analysis
The build logs indicate that the environment is pulling packages from **Debian 
Trixie (Testing)** repositories:
{code}
Hit:1 http://deb.debian.org/debian trixie InRelease
{code}
The current {{binder/Dockerfile}} (at commit {{c0690c763ba}}) attempts to 
install {{openjdk-17-jre}}, but this package appears to be missing or 
unavailable in the current Trixie package index. Since this build is associated 
with the **v4.1.1-rc2** release preparation, it blocks the interactive 
documentation for the upcoming version.


> [DOCS] Binder build failure: openjdk-17-jre package not found in Ubuntu 22.04 
> base image
> ----------------------------------------------------------------------------------------
>
>                 Key: SPARK-56818
>                 URL: https://issues.apache.org/jira/browse/SPARK-56818
>             Project: Spark
>          Issue Type: Bug
>          Components: Documentation
>    Affects Versions: 4.1.1
>         Environment: mybinder.org (Ubuntu 22.04 base image)
>            Reporter: Stan Buren
>            Priority: Major
>
> The interactive Binder environments for all PySpark Quickstart tutorials are 
> currently failing to build. This prevents users from utilizing the "Live 
> Notebook" features directly from the official Spark documentation.
> h3. Steps to Reproduce
> 1. Open the [Getting 
> Started|https://spark.apache.org/docs/latest/api/python/getting_started/index.html]
>  documentation.
> 2. Click on ANY of the following "Live Notebook" (Binder) buttons:
>  * ["Live Notebook: 
> DataFrame"|https://mybinder.org/v2/gh/apache/spark/c0690c763ba?filepath=python%2Fdocs%2Fsource%2Fgetting_started%2Fquickstart_df.ipynb]
>  * ["Live Notebook: Spark 
> Connect"|https://mybinder.org/v2/gh/apache/spark/c0690c763ba?filepath=python%2Fdocs%2Fsource%2Fgetting_started%2Fquickstart_connect.ipynb]
>  * ["Live Notebook: pandas API on 
> Spark"|https://mybinder.org/v2/gh/apache/spark/c0690c763ba?filepath=python%2Fdocs%2Fsource%2Fgetting_started%2Fquickstart_ps.ipynb]
> 3. Wait for the build process on mybinder.org.
> h3. Actual Result
> All three notebooks fail at step 39 of the Dockerfile with the exact same 
> error.
> {code:java|title=Build Log Snippet}
>  > [7/8] RUN apt-get update && apt-get install -y openjdk-17-jre git 
> coreutils:
>  ...
>  5.008 Package openjdk-17-jre is not available, but is referred to by another 
> package.
>  5.008 This may mean that the package is missing, has been obsoleted, or
>  5.008 is only available from another source
>  5.008 
>  5.016 E: Package 'openjdk-17-jre' has no installation candidate
>  ...
>  ERROR: failed to solve: process "/bin/sh -c apt-get update && apt-get 
> install -y openjdk-17-jre git coreutils" did not complete successfully: exit 
> code: 100
> {code}
> {code:java|title=Affected Dockerfile Lines}
>   37 |     USER root
>   38 |     RUN chown -R ${NB_UID} ${HOME}
>   39 | >>> RUN apt-get update && apt-get install -y openjdk-17-jre git 
> coreutils
>   40 |     USER ${NB_USER}
> {code}
> h3. Expected Result
> The environment should build successfully for all Quickstart notebooks, 
> allowing the user to interact with Jupyter.
> h3. Root Cause Analysis
> The build logs indicate that the environment is pulling packages from 
> *{*}Debian Trixie (Testing){*}* repositories:
> {code:java}
> Hit:1 http://deb.debian.org/debian trixie InRelease
> {code}
> The current {{binder/Dockerfile}} (at commit {{{}c0690c763ba{}}}) attempts to 
> install {{{}openjdk-17-jre{}}}, but this package appears to be missing or 
> unavailable in the current Trixie package index. Since this build is 
> associated with the *{*}v4.1.1-rc2{*}* release preparation, it blocks the 
> interactive documentation for the upcoming version.



--
This message was sent by Atlassian Jira
(v8.20.10#820010)

---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

Reply via email to