amoeba commented on issue #39139:
URL: https://github.com/apache/arrow/issues/39139#issuecomment-1849194591
Hi @AngelFelizR, thanks for the report and the PR.
The arrow R package already provides a custom routine for `glimpse` for
`FileSystemDataset` objects that avoids bringing the entire dataset into memory
so the timing (>200sec) you're seeing is very strange. On my system, even a
much larger dataset,
> FileSystemDataset with 158 Parquet files
> 1,672,590,319 rows x 24 columns
only takes 1-2 seconds. Can you report back the values you get from these
two functions?
```r
getOption("pillar.width")
getOption("width")
```
I'll try to reproduce what you're seeing with the exact dataset you linked
and also on a Windows machine.
--
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
To unsubscribe, e-mail: [email protected]
For queries about this service, please contact Infrastructure at:
[email protected]