Re: [PR] chore: Add custom metric for native shuffle fetching batches from JVM [datafusion-comet]

via GitHub Thu, 21 Nov 2024 11:55:09 -0800


viirya commented on code in PR #1108:
URL: https://github.com/apache/datafusion-comet/pull/1108#discussion_r1852759262



##########
docs/source/user-guide/tuning.md:
##########
@@ -105,8 +105,15 @@ then any shuffle operations that cannot be supported in 
this mode will fall back
 
 ## Metrics
 
-Comet metrics are not directly comparable to Spark metrics in some cases.
+Some Comet metrics are not directly comparable to Spark metrics in some cases.
 
 `CometScanExec` uses nanoseconds for total scan time. Spark also measures scan 
time in nanoseconds but converts to
-milliseconds _per batch_ which can result in a large loss of precision. In one 
case we saw total scan time
-of 41 seconds reported as 23 seconds for example.
+milliseconds _per batch_ which can result in a large loss of precision.
+
+Comet also adds some custom metrics:
+
+### ShuffleWriterExec
+
+| Metric           | Description                                               
                                                                                
                                                |
+| ---------------- | 
-----------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------
 |
+| `jvm_fetch_time` | Measure the time it takes for `ShuffleWriterExec` to 
fetch an existing batch from the JVM. Note that this does not include the 
execution time of the query that produced the input batch. |

Review Comment:
   Looks like the new metric measures the time on fetching all batches, not 
just a batch?



-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: [email protected]

For queries about this service, please contact Infrastructure at:
[email protected]


---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

Re: [PR] chore: Add custom metric for native shuffle fetching batches from JVM [datafusion-comet]

Reply via email to