amoeba commented on issue #39139:
URL: https://github.com/apache/arrow/issues/39139#issuecomment-1849194591

   Hi @AngelFelizR, thanks for the report and the PR.
   
   The arrow R package already provides a custom routine for `glimpse` for 
`FileSystemDataset` objects that avoids bringing the entire dataset into memory 
so the timing (>200sec) you're seeing is very strange. On my system, even a 
much larger dataset,
   
   > FileSystemDataset with 158 Parquet files
   > 1,672,590,319 rows x 24 columns
   
   only takes 1-2 seconds. Can you report back the values you get from these 
two functions?
   
   ```r
   getOption("pillar.width")
   getOption("width")
   ```
   
   I'll try to reproduce what you're seeing with the exact dataset you linked 
and also on a Windows machine.


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: [email protected]

For queries about this service, please contact Infrastructure at:
[email protected]

Reply via email to