lgaborini commented on issue #38638: URL: https://github.com/apache/arrow/issues/38638#issuecomment-1833498431
I have a probably related issue where `slice_sample(n = 100)` tends to sample the same rows (out of a Table with 2922121 rows), and from the beginning of the Table. \ The row count always respects `n`. If I specify the expected row count with a proportion: ```r nr <- nrow(tbl_df) slice_sample(tbl_df, prop = 100/nr) ``` I encounter the above issue (not exactly 100 rows but sometimes fewer or more), but the rows are truly randomized. -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: [email protected] For queries about this service, please contact Infrastructure at: [email protected]
