whcdjj commented on PR #968:
URL: https://github.com/apache/parquet-mr/pull/968#issuecomment-1446487715

   > Looks correct to me. Couple of questions, are you running this on a 
cluster or on local system? Also, is the data on SSD's? If you are on a single 
machine, there might not be enough CPU and the async threads may be contending 
for time with the processing threads. We'll need some profiling info to get a 
better diagnosis. Also, with SSD's, reads are so fast from the file system that 
the async feature might show very little improvement. You could turn on the 
debug logging level for FilePageReader and AsyncMultibufferInputStream. We can 
continue this in a thread outside this PR.
   
   OK,I will test with the debug logging for more details these days.


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: dev-unsubscr...@parquet.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org

Reply via email to