[I] [R] [Docs] Improve documentation of `col_types` [arrow]

via GitHub Mon, 27 Nov 2023 14:15:06 -0800


assignUser opened a new issue, #38903:
URL: https://github.com/apache/arrow/issues/38903


   ### Describe the enhancement requested
   
   In a recent 
[SO](https://stackoverflow.com/questions/77557377/how-to-convert-int-to-double-when-using-arrow-to-read-in-multiple-csvs-with-open)
 question about using partial schemas in `open_dataset` (which is possible 
using `col_types`) even a seasond arrow user did not know about the proper 
solution.
   
   The docs for open_dataset hide a lot of more specialized options behind a 
`...` and it it's not obvious how to find those as the linked dataset factory 
page also doesn't show all possibility. Some are explained in the specialized 
wrapper functions like 
https://arrow.apache.org/docs/r/reference/open_delim_dataset.html or 
https://arrow.apache.org/docs/r/reference/csv_convert_options.html but even 
there col_types is not described in a way that makes it obvious that it is to 
be used to pass in partial schemas. 
   
   At the minimum the doc strings for `col_types` should make the inteded uses 
case clear, ideally we should link to the detailed descriptions from 
`open_dataset` or find another way to document the possible options more 
visibly.
   
   ### Component(s)
   
   Documentation, R


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: issues-unsubscr...@arrow.apache.org.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org

[I] [R] [Docs] Improve documentation of `col_types` [arrow]

Reply via email to