[ 
https://issues.apache.org/jira/browse/SPARK-32572?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17175376#comment-17175376
 ] 

Hyukjin Kwon commented on SPARK-32572:
--------------------------------------

[~fokko] the problem is that PySpark runs the tests in parallel via our own 
testing script https://github.com/apache/spark/blob/master/python/run-tests.py 
which runs each tests in a separate subprocess.
The root cause would probably be related to how we invoke the Python tests. 
Yeah, it would be nice if we can fix it with a minimal change.

> Run all the tests at once, instead of having separate entrypoints.
> ------------------------------------------------------------------
>
>                 Key: SPARK-32572
>                 URL: https://issues.apache.org/jira/browse/SPARK-32572
>             Project: Spark
>          Issue Type: Improvement
>          Components: PySpark
>    Affects Versions: 3.0.0
>            Reporter: Fokko Driesprong
>            Priority: Major
>
> Started with this comment thread: 
> https://github.com/apache/spark/pull/29121/files#r456683561
> Each file is invoked separately and has a separate entry point: 
> [https://github.com/apache/spark/blob/master/python/pyspark/ml/tests/test_wrapper.py#L120]
> We would replace 
> [https://github.com/apache/spark/blob/master/dev/run-tests.py#L470] this 
> function call to the subprocess with something that would invoke the python 
> tests.



--
This message was sent by Atlassian Jira
(v8.3.4#803005)

---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

Reply via email to