thisisnic commented on issue #45438: URL: https://github.com/apache/arrow/issues/45438#issuecomment-2671499364
So the way it works in the arrow R package is that the dplyr code is translated into Arrow Expressions which can then be used by [Acero](https://arrow.apache.org/docs/cpp/acero/overview.html), the Arrow C++ engine, to execute queries on the data and return the results. What is the backend engine that would be running the queries here? If it's Acero, the functions would need to produce Arrow Expressions, so you'd either need to create the functions from existing bindings, which is what the example above is doing (but translating it automatically), though I can find examples from the Arrow codebase of doing this manually if helpful and is what you need? If you can't create your functions by combining existing Acero functions, you would then be in the territory of wanting to extend Acero in some way, which I can ping folks about the feasibility of it. Pinging @jonkeane here as well. -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: [email protected] For queries about this service, please contact Infrastructure at: [email protected]
