[GitHub] [parquet-mr] whcdjj commented on pull request #968: PARQUET-2149: Async IO implementation for ParquetFileReader

via GitHub Thu, 23 Feb 2023 22:48:19 -0800


whcdjj commented on PR #968:
URL: https://github.com/apache/parquet-mr/pull/968#issuecomment-1442878000


   Hi, I am very interested in this optimization and just have some questiones 
when testing in a cluster with 4nodes/96 cores using spark3.1.  Unfortunately， 
I see little improvement.
   I am confused than whether it is neccessary to keep 
spark.sql.parquet.enableVectorizedReader = false in spark when testing with 
spark 3.2 and how can i set the parquet buffer size. Sincerely ask for advice 
@parthchandra 


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: dev-unsubscr...@parquet.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org

[GitHub] [parquet-mr] whcdjj commented on pull request #968: PARQUET-2149: Async IO implementation for ParquetFileReader

Reply via email to