[ 
https://issues.apache.org/jira/browse/ARROW-14379?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17529086#comment-17529086
 ] 

Dewey Dunnington commented on ARROW-14379:
------------------------------------------

Are there examples other than sf columns where this is relevant? It would be 
possible to make a generic list extension type that just calls {{serialize()}} 
on each element but it would probably be slow and we probably want to encourage 
other solutions. The other example I can think of is a list of models, maybe, 
for which the `broom::glance()` or `broom::tidy()` representation would fit in 
to Arrow format much better.

> [R] Create a custom extension of list that stores row-level metadata
> --------------------------------------------------------------------
>
>                 Key: ARROW-14379
>                 URL: https://issues.apache.org/jira/browse/ARROW-14379
>             Project: Apache Arrow
>          Issue Type: Sub-task
>          Components: R
>            Reporter: Jonathan Keane
>            Priority: Major
>
> Since lists can be nested, we should be able store each element as something 
> like {{list(value = "foo", attributes = list(attr1 = TRUE, attr2 = "baz"))}} 
> and then we can reconstitute that in the R conversion to transfer the 
> attributes element to attributes.
> This will be more efficient (since we get compression on the column + 
> metadata/attributes) and we also will be able to filter these + use them in 
> datasets since each row has all of the information about itself that it needs 
> to roundtrip.
> This would get us SF columns for free



--
This message was sent by Atlassian Jira
(v8.20.7#820007)

Reply via email to