[
https://issues.apache.org/jira/browse/BEAM-14107?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Brian Hulette updated BEAM-14107:
---------------------------------
Description:
We should be able to build a deferred operation that calculates the number of
groups for GroupBy.ngroups.
len(groupby) might be possible ... but I think python will complain if we
return something other than an integer (like a deferred operation). If thats
the case we should raise an error that directs users to alternatives (e.g.
ngroups).
was:We should be able to build a deferred operation that calculates the
number of groups for these operations.
> DataFrame API: Support deferred GroupBy.__len__ and GroupBy.ngroups
> -------------------------------------------------------------------
>
> Key: BEAM-14107
> URL: https://issues.apache.org/jira/browse/BEAM-14107
> Project: Beam
> Issue Type: Improvement
> Components: dsl-dataframe, sdk-py-core
> Reporter: Andy Ye
> Priority: P3
> Labels: dataframe-api
>
> We should be able to build a deferred operation that calculates the number of
> groups for GroupBy.ngroups.
> len(groupby) might be possible ... but I think python will complain if we
> return something other than an integer (like a deferred operation). If thats
> the case we should raise an error that directs users to alternatives (e.g.
> ngroups).
--
This message was sent by Atlassian Jira
(v8.20.1#820001)