lgaborini commented on issue #38638:
URL: https://github.com/apache/arrow/issues/38638#issuecomment-1833498431

   I have a probably related issue where `slice_sample(n = 100)` tends to 
sample the same rows (out of a Table with 2922121 rows), and from the beginning 
of the Table. \
   The row count always respects `n`.
   
   If I specify the expected row count with a proportion:
   
   ```r
   nr <- nrow(tbl_df)
   slice_sample(tbl_df, prop = 100/nr)
   ```
   
   I encounter the above issue (not exactly 100 rows but sometimes fewer or 
more), but the rows are truly randomized.


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: [email protected]

For queries about this service, please contact Infrastructure at:
[email protected]

Reply via email to