ianmcook commented on pull request #10014: URL: https://github.com/apache/arrow/pull/10014#issuecomment-819169195
> Thanks for doing this. Some initial thoughts. You're covering a lot of the "whats" here, and I encourage you to think next of the "whys". The README should persuade potential users that there's value in the package for them and that they should try it out. I see here that I can read a parquet file, or create a RecordBatch, or use dplyr on that, but why would I want to do that? I added a new section "What can `arrow` do?" at the top. This effectively answers the question "Why should I use this?" with "Because it can..." and a list of the major capabilities, each in just a few words. I moved all the details about the metadata and data objects to the "Using the Arrow C++ Library in R" where I think they fit better and are valuable as a reference for more advanced users. Here in the README, I retained only two short bullets describing what `Table` and `Dataset` are for, in terms that are less technical and more intended to explain what they do that R data frames don't. I expanded the dplyr example into a broader "Usage" section in which the dplyr content is one subsection. -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. For queries about this service, please contact Infrastructure at: us...@infra.apache.org