zhengruifeng opened a new pull request, #41330:
URL: https://github.com/apache/spark/pull/41330

   ### What changes were proposed in this pull request?
   reorganize `ps.DataFrame` unit tests, to follow the categories in
   
https://spark.apache.org/docs/latest/api/python/reference/pyspark.pandas/frame.html
   and https://pandas.pydata.org/docs/reference/frame.html
   
   
   ### Why are the changes needed?
   1, even after 
https://github.com/apache/spark/commit/58b6535cbf76f2b26fc08b94905a57dcc4d955f6,
 `pyspark.pandas.tests.test_dataframe` and 
`pyspark.pandas.tests.test_dataframe_slow` are still too slow, each one 
normally takes >5 minutes, and sometimes >10 minutes.
   
   2, the tests were not well organized, and [tests in 
Pandas](https://github.com/pandas-dev/pandas/tree/main/pandas/tests) are 
grouped by categories, we should follow it to make PS tests more maintainable
   
   ### Does this PR introduce _any_ user-facing change?
   no test-only
   
   ### How was this patch tested?
   updated CI


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: [email protected]

For queries about this service, please contact Infrastructure at:
[email protected]


---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

Reply via email to