alamb opened a new issue, #17824: URL: https://github.com/apache/datafusion/issues/17824
### Is your feature request related to a problem or challenge? There are some features, such as table sampling, that are challenging to add to DataFusion The reason it is challenging is that I think the usecase and semantics will vary widely across systems, and thus I worry that anything we build into DataFusion will likely be fairly complicated as well as not what other systems may want. I think it is actually possible to implement table sampling with the existing APIS through a combination of 1. sql planner extension https://github.com/apache/datafusion/blob/main/datafusion-examples/examples/sql_dialect.rs 2. User defined extension nodes (aka add extension logical planning nodes) I would be willing to help make an example for this usecase, to show it is possible. I think it would be a nice showcase for how to extend systems using DataFusion without having to change the ecod ### Describe the solution you'd like A clear and well documented example of extending the SQL supported by DataFusion ### Describe alternatives you've considered Note that @theirix already has a great start here - https://github.com/apache/datafusion/pull/17633 I would like to assist completing this project ### Additional context Related ticket - https://github.com/apache/datafusion/issues/13563 from @theirix - The related PR from @theirix in https://github.com/apache/datafusion/pull/17633 - A blog post on the subject https://github.com/apache/datafusion-site/pull/97 from @Adez017 -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: [email protected] For queries about this service, please contact Infrastructure at: [email protected] --------------------------------------------------------------------- To unsubscribe, e-mail: [email protected] For additional commands, e-mail: [email protected]
