gianm commented on code in PR #14582:
URL: https://github.com/apache/druid/pull/14582#discussion_r1267351914


##########
docs/operations/metrics.md:
##########
@@ -210,10 +210,12 @@ batch ingestion emit the following metrics. These metrics 
are deltas for each em
 
 |Metric|Description|Dimensions|Normal Value|
 |------|-----------|----------|------------|
-|`ingest/events/thrownAway`|Number of events rejected because they are either 
null, or filtered by the transform spec, or outside the 
windowPeriod.|`dataSource`, `taskId`, `taskType`, `groupId`, `tags`|0|
+|`ingest/events/processed`|Number of events processed per emission 
period.|`dataSource`, `taskId`, `taskType`, `groupId`, `tags`|Equal to the 
number of events per emission period.|
+|`ingest/events/processedWithError`|Number of events processed with some 
partial errors per emission period. Events processed with partial errors are 
counted towards both this metric and `ingest/events/processed`.|`dataSource`, 
`taskId`, `taskType`, `groupId`, `tags`|0|
 |`ingest/events/unparseable`|Number of events rejected because the events are 
unparseable.|`dataSource`, `taskId`, `taskType`, `groupId`, `tags`|0|
+|`ingest/events/thrownAway`|Number of events rejected because they are either 
null, or filtered by the transform spec, or outside the 
windowPeriod.|`dataSource`, `taskId`, `taskType`, `groupId`, `tags`|0|
 |`ingest/events/duplicate`|Number of events rejected because the events are 
duplicated.|`dataSource`, `taskId`, `taskType`, `groupId`, `tags`|0|
-|`ingest/events/processed`|Number of events successfully processed per 
emission period.|`dataSource`, `taskId`, `taskType`, `groupId`, `tags`|Equal to 
the number of events per emission period.|
+|`ingest/input/bytes`|Number of bytes read from input sources, after 
decompression but prior to parsing. This covers all data read, including data 
that does not end up being fully processed and ingested. For example, this 
includes data that ends up being rejected for being unparseable or filtered 
out.|`dataSource`, `taskId`, `taskType`, `groupId`, `tags`|Depends on amount of 
data read.|

Review Comment:
   We don't seem consistent about these: some others do both "depends on X" and 
"depends on the X". Anyway, I'll change it, since "depends on the X" seems 
slightly more common.



-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: [email protected]

For queries about this service, please contact Infrastructure at:
[email protected]


---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

Reply via email to