alamb opened a new issue, #11336: URL: https://github.com/apache/datafusion/issues/11336
### Is your feature request related to a problem or challenge? As @efredine notes on https://github.com/apache/datafusion/pull/11290 / https://github.com/apache/datafusion/pull/11290#discussion_r1668904890: The in-memory examples are concise and its easy to get the gist of what's going on. But it also throws people in to the deep end of the Arrow format which lacks a gentle introduction IMO. The Arrow-rs documentation gets immediately into the weeds! ### Describe the solution you'd like It's likely that many users might never even need to know or access the arrow format directly. They will just read and write to csv or parquet. I don't think this needs to change, but perhaps what's missing is a section on how and when to use the Arrow format? A gentler introduction to Record Batches ### Describe alternatives you've considered Add a section to the user guide on "a gentle introduction to arrow" ### Additional context here is a ticke tracking such a thing upstream: https://github.com/apache/arrow-rs/issues/4071 I actually think the basic content / structure could be copied from https://jorgecarleitao.github.io/arrow2/main/guide/ with the examples being updated to reflect arrow-rs -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: [email protected] For queries about this service, please contact Infrastructure at: [email protected] --------------------------------------------------------------------- To unsubscribe, e-mail: [email protected] For additional commands, e-mail: [email protected]
