Hyukjin Kwon created SPARK-44957:
------------------------------------
Summary: Make PySpark (pyspark-sql module) tests passing without
any dependency
Key: SPARK-44957
URL: https://issues.apache.org/jira/browse/SPARK-44957
Project: Spark
Issue Type: Improvement
Components: Tests
Affects Versions: 4.0.0
Reporter: Hyukjin Kwon
{code}
./python/run-tests --python-executables=python3 --modules=pyspark-sql
Running PySpark tests. Output is in /.../spark/python/unit-tests.log
Will test against the following Python executables: ['python3']
Will test the following Python modules: ['pyspark-sql']
python3 python_implementation is CPython
python3 version is: Python 3.10.12
Starting test(python3):
pyspark.sql.tests.pandas.test_pandas_grouped_map_with_state (temp output:
/.../spark/python/target/8e530108-4d5e-46e4-88fb-8f0dfb7b47e2/python3__pyspark.sql.tests.pandas.test_pandas_grouped_map_with_state__jggatex7.log)
Starting test(python3): pyspark.sql.tests.pandas.test_pandas_grouped_map (temp
output:
/.../spark/python/target/3b6e9e5a-c479-408c-9365-8286330e8e7c/python3__pyspark.sql.tests.pandas.test_pandas_grouped_map__1lrovmur.log)
Starting test(python3): pyspark.sql.tests.pandas.test_pandas_cogrouped_map
(temp output:
/.../spark/python/target/68c7cf56-ed7a-453e-8d6d-3a0eb519d997/python3__pyspark.sql.tests.pandas.test_pandas_cogrouped_map__sw2875dr.log)
Starting test(python3): pyspark.sql.tests.pandas.test_pandas_map (temp output:
/.../spark/python/target/90712186-a104-4491-ae0d-2b5ab973991b/python3__pyspark.sql.tests.pandas.test_pandas_map__ysp4911q.log)
Traceback (most recent call last):
File "/.../miniconda3/envs/vanilla-3.10/lib/python3.10/runpy.py", line 196,
in _run_module_as_main
return _run_code(code, main_globals, None,
File "/.../miniconda3/envs/vanilla-3.10/lib/python3.10/runpy.py", line 86, in
_run_code
exec(code, run_globals)
File
"/.../workspace/forked/spark/python/pyspark/sql/tests/pandas/test_pandas_map.py",
line 27, in <module>
from pyspark.testing.sqlutils import (
File "/.../workspace/forked/spark/python/pyspark/testing/__init__.py", line
19, in <module>
from pyspark.testing.pandasutils import assertPandasOnSparkEqual
File "/.../workspace/forked/spark/python/pyspark/testing/pandasutils.py",
line 22, in <module>
import pandas as pd
ModuleNotFoundError: No module named 'pandas'
{code}
--
This message was sent by Atlassian Jira
(v8.20.10#820010)
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]