[ https://issues.apache.org/jira/browse/SPARK-36143?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17384999#comment-17384999 ]
Apache Spark commented on SPARK-36143: -------------------------------------- User 'xinrong-databricks' has created a pull request for this issue: https://github.com/apache/spark/pull/33466 > Adjust astype of Series with missing values to follow pandas > ------------------------------------------------------------ > > Key: SPARK-36143 > URL: https://issues.apache.org/jira/browse/SPARK-36143 > Project: Spark > Issue Type: Sub-task > Components: PySpark > Affects Versions: 3.2.0 > Reporter: Xinrong Meng > Priority: Major > > {code:java} > >>> pser = pd.Series([1, 2, np.nan], dtype=float) > >>> psser = ps.from_pandas(pser) > >>> pser.astype(int) > ... > ValueError: Cannot convert non-finite values (NA or inf) to integer > >>> psser.astype(int) > 0 1.0 > 1 2.0 > 2 NaN > dtype: float64 > {code} > As shown above, astype of Series with missing values doesn't behave the same > as pandas, we ought to adjust that. -- This message was sent by Atlassian Jira (v8.3.4#803005) --------------------------------------------------------------------- To unsubscribe, e-mail: issues-unsubscr...@spark.apache.org For additional commands, e-mail: issues-h...@spark.apache.org