abstractdog commented on PR #5558:
URL: https://github.com/apache/hive/pull/5558#issuecomment-2500042114

   before making this happen, I need to consider one more thing: the aggregated 
data seems to miss some counters
   I saw store_io_throttled=96 logged at the end of a query, when the 
filesystem is closed, which never made its way to the aggregated counters
   this throttled counter is automatically logged by hadoop on filesystem 
closure (fs objects are reused and belong to a single query now), which seems 
to be different then the IO stats collected on the task thread, need to 
investigate if I can merge them
   the challenge is that filesystem closure happens after the dag is completed 
(callback from AM to daemon), so collecting those might mean blocking the query 
itself, regarding which we need to be extra careful, I'm experimenting with 
this now


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: gitbox-unsubscr...@hive.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


---------------------------------------------------------------------
To unsubscribe, e-mail: gitbox-unsubscr...@hive.apache.org
For additional commands, e-mail: gitbox-h...@hive.apache.org

Reply via email to