ianmcook commented on pull request #10014:
URL: https://github.com/apache/arrow/pull/10014#issuecomment-819169195


   > Thanks for doing this. Some initial thoughts. You're covering a lot of the 
"whats" here, and I encourage you to think next of the "whys". The README 
should persuade potential users that there's value in the package for them and 
that they should try it out. I see here that I can read a parquet file, or 
create a RecordBatch, or use dplyr on that, but why would I want to do that?
   
   I added a new section "What can `arrow` do?" at the top. This effectively 
answers the question "Why should I use this?" with "Because it can..." and a 
list of the major capabilities, each in just a few words.
   
   I moved all the details about the metadata and data objects to the "Using 
the Arrow C++ Library in R" where I think they fit better and are valuable as a 
reference for more advanced users. Here in the README, I retained only two 
short bullets describing what `Table` and `Dataset` are for, in terms that are 
less technical and more intended to explain what they do that R data frames 
don't.
   
   I expanded the dplyr example into a broader "Usage" section in which the 
dplyr content is one subsection.


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


Reply via email to