[ 
https://issues.apache.org/jira/browse/BEAM-14107?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Brian Hulette updated BEAM-14107:
---------------------------------
    Description: 
We should be able to build a deferred operation that calculates the number of 
groups for GroupBy.ngroups.

len(groupby) might be possible ... but I think python will complain if we 
return something other than an integer (like a deferred operation). If thats 
the case we should raise an error that directs users to alternatives (e.g. 
ngroups).

  was:We should be able to build a deferred operation that calculates the 
number of groups for these operations.


> DataFrame API: Support deferred GroupBy.__len__ and GroupBy.ngroups
> -------------------------------------------------------------------
>
>                 Key: BEAM-14107
>                 URL: https://issues.apache.org/jira/browse/BEAM-14107
>             Project: Beam
>          Issue Type: Improvement
>          Components: dsl-dataframe, sdk-py-core
>            Reporter: Andy Ye
>            Priority: P3
>              Labels: dataframe-api
>
> We should be able to build a deferred operation that calculates the number of 
> groups for GroupBy.ngroups.
> len(groupby) might be possible ... but I think python will complain if we 
> return something other than an integer (like a deferred operation). If thats 
> the case we should raise an error that directs users to alternatives (e.g. 
> ngroups).



--
This message was sent by Atlassian Jira
(v8.20.1#820001)

Reply via email to