jihoonson commented on pull request #10407:
URL: https://github.com/apache/druid/pull/10407#issuecomment-696438112


   > @jihoonson The proposal looks great, I did this PR because bytes ingested 
is not available for streaming or batch tasks. I see your proposal only 
includes metrics for batch tasks, probably we can do another PR for emitting 
bytes ingested for kafka/kinesis directly through `FireDepartment`.
   
   Yeah, it sounds reasonable to me to add bytes ingested for both streaming 
and batch. My proposal only talks about metrics for batch, but I have been also 
thinking about the metrics system for ingestion in general. Currently, both 
batch and streaming ingestion uses `FireDepartmentMetrics` which was designed 
for `RealtimeIndexTask` which is now deprecated. I'm not sure if it's good to 
continue to use it. Rather, I think batch and streaming tasks should use 
different classes for their own metrics since you will want to see different 
metrics per your ingestion type (ex, there is no handoff in batch ingestion 
while handoff time could be important in streaming ingestion). To do so, I have 
been thinking to add new classes each of which defines _all_ metrics useful for 
batch and streaming ingestion, respectively. I'm not sure why we should keep 
metrics separately in different classes such as the bytes ingested in 
`InputStats` added in this PR. Does this make sense?
   
   > I believe the bytes ingested reported through your proposed changes will 
be all types of `InputEntity` and will include the bytes read for determining 
partitions as well indexing ?
   
   Yes, correct. More precisely, most of metrics including both the bytes read 
and the bytes written will be available for individual phase (determining 
partitions, indexing, etc) as well as for the overall metrics across all phases.


----------------------------------------------------------------
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
[email protected]



---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

Reply via email to