alamb opened a new issue, #11336:
URL: https://github.com/apache/datafusion/issues/11336

   ### Is your feature request related to a problem or challenge?
   
   As @efredine  notes on https://github.com/apache/datafusion/pull/11290 /  
https://github.com/apache/datafusion/pull/11290#discussion_r1668904890:
   
   The in-memory examples are concise and its easy to get the gist of what's 
going on. But it also throws people in to the deep end of the Arrow format 
which lacks a gentle introduction IMO. The Arrow-rs documentation gets 
immediately into the weeds!
   
   
   
   
   ### Describe the solution you'd like
   
   
   It's likely that many users might never even need to know or access the 
arrow format directly. They will just read and write to csv or parquet.
   
   I don't think this needs to change, but perhaps what's missing is a section 
on how and when to use the Arrow format? A gentler introduction to Record 
Batches
   
   
   
   ### Describe alternatives you've considered
   
   
   Add a section to the user guide on "a gentle introduction to arrow"
   
   ### Additional context
   
   here is a ticke tracking such a thing upstream: 
https://github.com/apache/arrow-rs/issues/4071
   
   I actually think the basic content / structure could be copied from 
https://jorgecarleitao.github.io/arrow2/main/guide/ with the examples being 
updated to reflect arrow-rs


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: [email protected]

For queries about this service, please contact Infrastructure at:
[email protected]


---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

Reply via email to