Brian Hulette created BEAM-12169:
------------------------------------
Summary: DataFrame API: Allow non-deferred column operations on
categorical columns
Key: BEAM-12169
URL: https://issues.apache.org/jira/browse/BEAM-12169
Project: Beam
Issue Type: Improvement
Components: sdk-py-core
Reporter: Brian Hulette
There are several operations that we currently disallow because they produce a
variable set of columns in the output based on the data (non-deferred-columns).
However, for some dtypes (categorical, boolean) we can easily enumerate all the
possible values that will be seen at execution time, so we can predict the
columns that will be seen.
We should allow these operations in these special cases.
--
This message was sent by Atlassian Jira
(v8.3.4#803005)