Yikun commented on a change in pull request #34717:
URL: https://github.com/apache/spark/pull/34717#discussion_r758898358
##########
File path: python/docs/source/user_guide/sql/arrow_pandas.rst
##########
@@ -387,7 +387,7 @@ working with timestamps in ``pandas_udf``\s to get the best
performance, see
Recommended Pandas and PyArrow Versions
~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
-For usage with pyspark.sql, the minimum supported versions of Pandas is 0.23.2
and PyArrow is 1.0.0.
+For usage with pyspark.sql, the minimum supported versions of Pandas is 1.0.5
and PyArrow is 1.0.0.
Review comment:
How about:
For usage with pyspark.sql, the minimum supported versions of Pandas is
**1.0.5** and PyArrow is 1.0.0. **Lower versions (such as there are some known
issues under with v1.0.0, v1.0.1, see more in
[link](https://github.com/apache/spark/pull/34717)) or** higher versions may be
used, however, compatibility and data correctness can not be guaranteed and
should be verified by the user.
Maybe need more suggestion from native speaker. T_T, and if it's necessary
we could do it in next commits in this PR or followup.
--
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
To unsubscribe, e-mail: [email protected]
For queries about this service, please contact Infrastructure at:
[email protected]
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]