alamb opened a new issue, #9016: URL: https://github.com/apache/arrow-datafusion/issues/9016
### Is your feature request related to a problem or challenge? Someone asked in [discord](https://discord.com/channels/885562378132000778/1166447479609376850/1200557140096594053): > I'm looking at Datafusion and Polars as potential solutions for calculating averages over a sliding window of events, where the window is bound by event time. I've just come across Datafusion, would anyone be able to clarify if it's suitable for this use case? In essence, I have events streaming in via RPC that I want to feed into a a system that gives the above outcome. I am pretty sure this is exactly the case for using `UNBOUNDED` tables with explicitly defined `ORDER BY` from synnada and others. However, when I went to look for the documentation, I could't find any mention of this usecase or the unbounded tables ### Describe the solution you'd like I would like to help make it easier for people to use DataFusion for streaming usecases by: - [ ] Documenting the `UNBOUNDED` keyword [in the `CREATE EXTERNAL TABLE` documentation](https://arrow.apache.org/datafusion/user-guide/sql/ddl.html#create-external-table) - [ ] Add an example in https://github.com/apache/arrow-datafusion/tree/main/datafusion-examples/examples with a simple streaming example (perhaps implementing some simple version of the use case described in the description) - [ ] Add a [section to the library guide](https://arrow.apache.org/datafusion/library-user-guide/index.html) giving some basic ### Describe alternatives you've considered _No response_ ### Additional context _No response_ -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: [email protected] For queries about this service, please contact Infrastructure at: [email protected]
