Brian Hulette created BEAM-11155:
------------------------------------
Summary: Series.str.repeat zipping operation produces incorrect
proxy
Key: BEAM-11155
URL: https://issues.apache.org/jira/browse/BEAM-11155
Project: Beam
Issue Type: Bug
Components: sdk-py-core
Reporter: Brian Hulette
Assignee: Brian Hulette
https://github.com/apache/beam/pull/13139#discussion_r513684704
This proxy is incorrectly inferred as bool.
{code}
In [10]: proxy.dtypes
Out[10]:
str object
repeats int64
dtype: object
In [11]: proxy.str.str.repeat(proxy.repeats)
Out[11]: Series([], Name: str, dtype: bool)
{code}
The actual operation does produce object though:
{code}
In [13]: df.str.str.repeat(df.repeats)
Out[13]:
0 AAA
1 B
2 CCCC
3 DDDDD
4 EE
Name: str, dtype: object
{code}
Currently we work around this by specifying the proxy manually, maybe it can be
fixed upstream?
--
This message was sent by Atlassian Jira
(v8.3.4#803005)