parthchandra commented on PR #968:
URL: https://github.com/apache/parquet-mr/pull/968#issuecomment-1445230916

   Looks correct to me. Couple of questions, are you running this on a cluster 
or on local system? Also, is the data on SSD's? If you are on a single machine, 
there might not be enough CPU and the async threads may be contending for time 
with the processing threads. We'll need some profiling info to get a better 
diagnosis. Also, with SSD's, reads are so fast from the file system that the 
async feature might show very little improvement. 
   You could turn on the debug logging level for FilePageReader and 
AsyncMultibufferInputStream.
   We can continue this in a thread outside this PR. 
   


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: dev-unsubscr...@parquet.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org

Reply via email to