[ 
https://issues.apache.org/jira/browse/BEAM-11627?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Brian Hulette updated BEAM-11627:
---------------------------------
    Component/s: dsl-dataframe

> Properly support `convert_dtype=True` in Series.apply
> -----------------------------------------------------
>
>                 Key: BEAM-11627
>                 URL: https://issues.apache.org/jira/browse/BEAM-11627
>             Project: Beam
>          Issue Type: Improvement
>          Components: dsl-dataframe, sdk-py-core
>            Reporter: Brian Hulette
>            Priority: P3
>              Labels: dataframe-api
>
> See 
> https://pandas.pydata.org/pandas-docs/stable/reference/api/pandas.Series.apply.html
> convert_dtype=True indicates that pandas should observe the output and set 
> the dtype to something other than object if possible. We should intercept 
> this argument and use type inference to set the dtype. We can't rely on 
> pandas' inference since our implementation can't observe the entire dataset. 



--
This message was sent by Atlassian Jira
(v8.3.4#803005)

Reply via email to