[ 
https://issues.apache.org/jira/browse/ARROW-15535?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17486429#comment-17486429
 ] 

Jeroen van Straten commented on ARROW-15535:
--------------------------------------------

As a followup to this (or maybe a predecessor, not sure which makes more 
sense?), it may also be useful to expose an API to generate additional YAML for 
custom compute functions that the user registers at runtime.

> [C++] Investigate automatic generation of YAML for Substrait extension 
> functions & types
> ----------------------------------------------------------------------------------------
>
>                 Key: ARROW-15535
>                 URL: https://issues.apache.org/jira/browse/ARROW-15535
>             Project: Apache Arrow
>          Issue Type: Improvement
>          Components: C++
>            Reporter: Weston Pace
>            Priority: Major
>              Labels: substrait
>
> Arrow supports data types that are not in the Substrait spec (e.g. unsigned 
> integers) and supports functions that are not in the Substrait spec.
> As a convenience for consumers we can generate a YAML document that describes 
> all the things Arrow supports that are not part of the Substrait spec.  
> ARROW-15238 has an incomplete example of what this YAML might look like.
> This YAML can allow for things like better ahead-of-time validation by 
> milddleman validators and optimizers.  It is not strictly required to use 
> these features.
> We should investigate our ability to generate this automatically.  However, 
> this can get fairly complicated.  For example, we don't want to blindly 
> generate YAML for all functions and types because many of the functions and 
> types map to existing functions and types in the core Substrait spec.



--
This message was sent by Atlassian Jira
(v8.20.1#820001)

Reply via email to