[GitHub] [arrow] lidavidm commented on pull request #10060: ARROW-9697: [C++][Python][R][Dataset] Add CountRows for Scanner

GitBox Wed, 28 Apr 2021 16:13:17 -0700


lidavidm commented on pull request #10060:
URL: https://github.com/apache/arrow/pull/10060#issuecomment-828839144



   > > Note this is somewhat of a regression for CSV files/if you call 
dim.Dataset in R as now we'll have to scan files instead of just immediately 
returning NA.
   > 
   > I don't consider this a regression: you're going from "not implemented" to 
"implemented but not fast".
   
   Fair point. I was worried the slowness would be unexpected, if it's expected 
to be a cheap operation in R. I can split out the CSV optimization side and 
clean this up if we want to get this in sooner (& to keep the size of this 
down).


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
[email protected]

[GitHub] [arrow] lidavidm commented on pull request #10060: ARROW-9697: [C++][Python][R][Dataset] Add CountRows for Scanner

Reply via email to