alamb opened a new pull request, #10735:
URL: https://github.com/apache/datafusion/pull/10735

   ## Which issue does this PR close?
   
   Part of https://github.com/apache/datafusion/issues/10453
   
   ## Rationale for this change
   
   Currently arrow_statistics take multiple seconds on my local machine (see 
screenshot)
   <img width="806" alt="Screenshot 2024-05-30 at 6 37 45 PM" 
src="https://github.com/apache/datafusion/assets/490673/657062f8-28c5-4bab-9abb-eaca157db013";>
   
   This is because they make the same parquet file over and over again.
   
   ## What changes are included in this PR?
   
   Update the test to reuse the created parquet files
   
   ## Are these changes tested?
   
   They are all tests. Now they go 2x faster on my local machine:
   
   ![Screenshot 2024-05-31 at 7 05 44 
AM](https://github.com/apache/datafusion/assets/490673/306e7c3f-d134-4c7a-b695-e604de807366)
   
   ## Are there any user-facing changes?
   
   No this is all tests


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: github-unsubscr...@datafusion.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


---------------------------------------------------------------------
To unsubscribe, e-mail: github-unsubscr...@datafusion.apache.org
For additional commands, e-mail: github-h...@datafusion.apache.org

Reply via email to