stephhazlitt commented on code in PR #14514:
URL: https://github.com/apache/arrow/pull/14514#discussion_r1029854271
##########
r/vignettes/dataset.Rmd:
##########
@@ -1,157 +1,100 @@
---
-title: "Working with Arrow Datasets and dplyr"
+title: "Working with multi-file data sets"
+description: >
+ Learn how to use Datasets to read, write, and analyze
+ multi-file larger-than-memory data
output: rmarkdown::html_vignette
-vignette: >
- %\VignetteIndexEntry{Working with Arrow Datasets and dplyr}
- %\VignetteEngine{knitr::rmarkdown}
- %\VignetteEncoding{UTF-8}
---
-Apache Arrow lets you work efficiently with large, multi-file datasets.
-The arrow R package provides a [dplyr](https://dplyr.tidyverse.org/) interface
to Arrow Datasets,
-and other tools for interactive exploration of Arrow data.
+Apache Arrow lets you work efficiently with multi-file data sets even when
that data set is too large to be loaded into memory. With the help of Arrow
Dataset objects you can analyze this kind of data using familiar
[dplyr](https://dplyr.tidyverse.org/) syntax. This article introduces Datasets
and shows you how to analyze them with dplyr and arrow: we'll start by ensuring
both packages are loaded
Review Comment:
```suggestion
Apache Arrow lets you work efficiently with single and multi-file data sets
even when that data set is too large to be loaded into memory. With the help of
Arrow Dataset objects you can analyze this kind of data using familiar
[dplyr](https://dplyr.tidyverse.org/) syntax. This article introduces Datasets
and shows you how to analyze them with dplyr and arrow: we'll start by ensuring
both packages are loaded
```
--
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
To unsubscribe, e-mail: [email protected]
For queries about this service, please contact Infrastructure at:
[email protected]