Brian Hulette created BEAM-11155:
------------------------------------

             Summary: Series.str.repeat zipping operation produces incorrect 
proxy
                 Key: BEAM-11155
                 URL: https://issues.apache.org/jira/browse/BEAM-11155
             Project: Beam
          Issue Type: Bug
          Components: sdk-py-core
            Reporter: Brian Hulette
            Assignee: Brian Hulette


https://github.com/apache/beam/pull/13139#discussion_r513684704

This proxy is incorrectly inferred as bool.

{code}
In [10]: proxy.dtypes
Out[10]: 
str        object
repeats     int64
dtype: object

In [11]: proxy.str.str.repeat(proxy.repeats)
Out[11]: Series([], Name: str, dtype: bool)
{code}

The actual operation does produce object though:

{code}
In [13]: df.str.str.repeat(df.repeats)
Out[13]: 
0      AAA
1        B
2     CCCC
3    DDDDD
4       EE
Name: str, dtype: object
{code}

Currently we work around this by specifying the proxy manually, maybe it can be 
fixed upstream?



--
This message was sent by Atlassian Jira
(v8.3.4#803005)

Reply via email to