Brian Hulette created BEAM-12169:
------------------------------------

             Summary: DataFrame API: Allow non-deferred column operations on 
categorical columns
                 Key: BEAM-12169
                 URL: https://issues.apache.org/jira/browse/BEAM-12169
             Project: Beam
          Issue Type: Improvement
          Components: sdk-py-core
            Reporter: Brian Hulette


There are several operations that we currently disallow because they produce a 
variable set of columns in the output based on the data (non-deferred-columns). 
However, for some dtypes (categorical, boolean) we can easily enumerate all the 
possible values that will be seen at execution time, so we can predict the 
columns that will be seen.

We should allow these operations in these special cases.



--
This message was sent by Atlassian Jira
(v8.3.4#803005)

Reply via email to