Xinrong Meng created SPARK-36000:
------------------------------------
Summary: Support creating a ps.Series/Index with `Decimal('NaN')`
with Arrow disabled
Key: SPARK-36000
URL: https://issues.apache.org/jira/browse/SPARK-36000
Project: Spark
Issue Type: Story
Components: PySpark
Affects Versions: 3.2.0
Reporter: Xinrong Meng
{code:java}
>>> import decimal as d
>>> import pyspark.pandas as ps
>>> import numpy as np
>>> ps.utils.default_session().conf.set('spark.sql.execution.arrow.pyspark.enabled',
>>> True)
>>> ps.Series([d.Decimal(1.0), d.Decimal(2.0), d.Decimal(np.nan)])
0 1
1 2
2 None
dtype: object
>>> ps.utils.default_session().conf.set('spark.sql.execution.arrow.pyspark.enabled',
>>> False)
>>> ps.Series([d.Decimal(1.0), d.Decimal(2.0), d.Decimal(np.nan)])
21/07/02 15:01:07 ERROR Executor: Exception in task 6.0 in stage 13.0 (TID 51)
net.razorvine.pickle.PickleException: problem construction object:
java.lang.reflect.InvocationTargetException
...
{code}
As the code is shown above, we cannot create a Series with `Decimal('NaN')`
when Arrow disabled. We ought to fix that.
--
This message was sent by Atlassian Jira
(v8.3.4#803005)
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]