This is an automated email from the ASF dual-hosted git repository.

ruifengz pushed a commit to branch master
in repository https://gitbox.apache.org/repos/asf/spark.git


The following commit(s) were added to refs/heads/master by this push:
     new 5edd0ec696d6 [SPARK-46317][INFRA][FOLLOWUP] Install `lxml` in PyPy3.8
5edd0ec696d6 is described below

commit 5edd0ec696d6eb02a4bd8bbce086f0ad2d4b38d6
Author: Dongjoon Hyun <dh...@apple.com>
AuthorDate: Mon Dec 18 10:02:34 2023 +0800

    [SPARK-46317][INFRA][FOLLOWUP] Install `lxml` in PyPy3.8
    
    ### What changes were proposed in this pull request?
    
    This PR aims to install `lxml` in PyPy3.8.
    
    ### Why are the changes needed?
    
    #44247 seems to break the `PyPy3.8` daily CI.
    - https://github.com/apache/spark/actions/runs/7239278196/job/19721018105
    ```
    Traceback (most recent call last):
      File "/usr/local/pypy/pypy3.8/lib/pypy3.8/runpy.py", line 197, in 
_run_module_as_main
        return _run_code(code, main_globals, None,
      File "/usr/local/pypy/pypy3.8/lib/pypy3.8/runpy.py", line 87, in _run_code
        exec(code, run_globals)
      File "/__w/spark/spark/python/pyspark/sql/tests/test_session.py", line 
22, in <module>
        from lxml import etree
    ModuleNotFoundError: No module named 'lxml'
    ```
    
    ### Does this PR introduce _any_ user-facing change?
    
    No.
    
    ### How was this patch tested?
    
    Pass the CIs.
    
    ### Was this patch authored or co-authored using generative AI tooling?
    
    No.
    
    Closes #44391 from dongjoon-hyun/SPARK-46317.
    
    Authored-by: Dongjoon Hyun <dh...@apple.com>
    Signed-off-by: Ruifeng Zheng <ruife...@apache.org>
---
 dev/infra/Dockerfile | 2 +-
 1 file changed, 1 insertion(+), 1 deletion(-)

diff --git a/dev/infra/Dockerfile b/dev/infra/Dockerfile
index cade845d911b..9e81accbd3b7 100644
--- a/dev/infra/Dockerfile
+++ b/dev/infra/Dockerfile
@@ -92,7 +92,7 @@ RUN mkdir -p /usr/local/pypy/pypy3.8 && \
     ln -sf /usr/local/pypy/pypy3.8/bin/pypy /usr/local/bin/pypy3.8 && \
     ln -sf /usr/local/pypy/pypy3.8/bin/pypy /usr/local/bin/pypy3
 RUN curl -sS https://bootstrap.pypa.io/get-pip.py | pypy3
-RUN pypy3 -m pip install numpy 'six==1.16.0' 'pandas<=2.1.4' scipy coverage 
matplotlib
+RUN pypy3 -m pip install numpy 'six==1.16.0' 'pandas<=2.1.4' scipy coverage 
matplotlib lxml
 
 
 ARG BASIC_PIP_PKGS="numpy pyarrow>=14.0.0 six==1.16.0 pandas<=2.1.4 scipy 
unittest-xml-reporting plotly>=4.8 mlflow>=2.8.1 coverage matplotlib openpyxl 
memory-profiler>=0.61.0 scikit-learn>=1.3.2"


---------------------------------------------------------------------
To unsubscribe, e-mail: commits-unsubscr...@spark.apache.org
For additional commands, e-mail: commits-h...@spark.apache.org

Reply via email to