lidavidm commented on pull request #10060: URL: https://github.com/apache/arrow/pull/10060#issuecomment-828839144
> > Note this is somewhat of a regression for CSV files/if you call dim.Dataset in R as now we'll have to scan files instead of just immediately returning NA. > > I don't consider this a regression: you're going from "not implemented" to "implemented but not fast". Fair point. I was worried the slowness would be unexpected, if it's expected to be a cheap operation in R. I can split out the CSV optimization side and clean this up if we want to get this in sooner (& to keep the size of this down). -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. For queries about this service, please contact Infrastructure at: [email protected]
