[ 
https://issues.apache.org/jira/browse/BEAM-11064?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17214216#comment-17214216
 ] 

Brian Hulette commented on BEAM-11064:
--------------------------------------

It's distinct in that it tells us about the batched element type, which could 
be a DataFrame or an arrow RecordBatch or Table.

I think what you're suggesting is we should always use Rows/RowType as the 
element type. In the pandas/arrow case, row instances (if needed) would always 
be views into an underlying RecordBatch or DataFrame. Is that right?

> Consider adding a pandas typehint
> ---------------------------------
>
>                 Key: BEAM-11064
>                 URL: https://issues.apache.org/jira/browse/BEAM-11064
>             Project: Beam
>          Issue Type: Improvement
>          Components: sdk-py-core
>            Reporter: Brian Hulette
>            Assignee: Brian Hulette
>            Priority: P2
>
> The typehint should be parameterized by a proxy object. This way we could 
> retrieve the proxy for use in {{to_dataframe}}



--
This message was sent by Atlassian Jira
(v8.3.4#803005)

Reply via email to