[jira] [Commented] (SPARK-42027) CreateDataframe from Pandas with Struct and Timestamp

Hyukjin Kwon (Jira) Sat, 15 Jul 2023 20:34:04 -0700


    [ 
https://issues.apache.org/jira/browse/SPARK-42027?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17743485#comment-17743485
 ]


Hyukjin Kwon commented on SPARK-42027:
--------------------------------------

I think it will require a pretty large fix to support this correctly from my 
cursory look (might be wrong). Please go ahead for a PR [~gdhuper] 

> CreateDataframe from Pandas with Struct and Timestamp
> -----------------------------------------------------
>
>                 Key: SPARK-42027
>                 URL: https://issues.apache.org/jira/browse/SPARK-42027
>             Project: Spark
>          Issue Type: Bug
>          Components: PySpark
>    Affects Versions: 3.4.0
>            Reporter: Martin Grund
>            Priority: Major
>
> The following should be supported and correctly truncate the nanosecond 
> timestamps.
> {code:python}
> from datetime import datetime, timezone, timedelta
> from pandas import Timestamp
> ts=Timestamp(year=2019, month=1, day=1, nanosecond=500, 
> tz=timezone(timedelta(hours=-8)))
> d = pd.DataFrame({"col1": [1], "col2": [{"a":1, "b":2.32, "c":ts}]})
> spark.createDataFrame(d).collect()
> {code}



--
This message was sent by Atlassian Jira
(v8.20.10#820010)

---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

[jira] [Commented] (SPARK-42027) CreateDataframe from Pandas with Struct and Timestamp

Reply via email to