alamb commented on PR #7461:
URL: https://github.com/apache/arrow-rs/pull/7461#issuecomment-2848051135

   > > > Thank you @alamb for this work, so we still need to investigate more. 
Is it possible that we can get a 10% data set from hit.parquet and do some 
benchmark from arrow-rs side.
   > > 
   > > 
   > > That is an interesting idea -- to make make a benchmark in arrow-rs that 
runs against hits.parquet (and hits_partitioned) directly 🤔 (and e.g. could 
require downloading those files before running).
   > 
   > I am trying to do the first step, may be we can download a partition 
hit.parquet, and pick it as the data set to arrow-rs, because we have 100 
partition file, it seems about %1 data which can be mocked.
   
   I have been thinking about it too -- I probably won't have a chance this 
weekend but can work on it next week 


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: github-unsubscr...@arrow.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org

Reply via email to