This is an automated email from the ASF dual-hosted git repository.
dongjoon pushed a commit to branch master
in repository https://gitbox.apache.org/repos/asf/spark.git
The following commit(s) were added to refs/heads/master by this push:
new 20be0b0bffcb [MINOR][DOCS] Use user-facing feature names in Python
metrics documentation
20be0b0bffcb is described below
commit 20be0b0bffcb3875c50af955ca89b9bd7d59fc8c
Author: Hyukjin Kwon <[email protected]>
AuthorDate: Sat Dec 16 14:14:05 2023 -0800
[MINOR][DOCS] Use user-facing feature names in Python metrics documentation
### What changes were proposed in this pull request?
This PR proposes to use proper user-facing feature names instead of the
names of logical plans in Python metrics documentation.
### Why are the changes needed?
- For better maintenance of the codes (e.g., renaming the plan)
- Users won't likely know much about the plan.
### Does this PR introduce _any_ user-facing change?
Yes, it changes the user-facing documentation for Python metrics
documentation.
### How was this patch tested?
CI in this PR should verify it.
### Was this patch authored or co-authored using generative AI tooling?
No.
Closes #44380 from HyukjinKwon/minor-doc-update.
Authored-by: Hyukjin Kwon <[email protected]>
Signed-off-by: Dongjoon Hyun <[email protected]>
---
docs/web-ui.md | 4 ++--
1 file changed, 2 insertions(+), 2 deletions(-)
diff --git a/docs/web-ui.md b/docs/web-ui.md
index cdf62e0d8ec0..c500860a201b 100644
--- a/docs/web-ui.md
+++ b/docs/web-ui.md
@@ -406,8 +406,8 @@ Here is the list of SQL metrics:
<tr><td> <code>time to build hash map</code> </td><td> the time spent on
building hash map </td><td> ShuffledHashJoin </td></tr>
<tr><td> <code>task commit time</code> </td><td> the time spent on committing
the output of a task after the writes succeed </td><td> any write operation on
a file-based table </td></tr>
<tr><td> <code>job commit time</code> </td><td> the time spent on committing
the output of a job after the writes succeed </td><td> any write operation on a
file-based table </td></tr>
-<tr><td> <code>data sent to Python workers</code> </td><td> the number of
bytes of serialized data sent to the Python workers </td><td> ArrowEvalPython,
AggregateInPandas, BatchEvalPython, FlatMapGroupsInPandas,
FlatMapsCoGroupsInPandas, FlatMapsCoGroupsInPandasWithState, MapInPandas,
PythonMapInArrow, WindowsInPandas </td></tr>
-<tr><td> <code>data returned from Python workers</code> </td><td> the number
of bytes of serialized data received back from the Python workers </td><td>
ArrowEvalPython, AggregateInPandas, BatchEvalPython, FlatMapGroupsInPandas,
FlatMapsCoGroupsInPandas, FlatMapsCoGroupsInPandasWithState, MapInPandas,
PythonMapInArrow, WindowsInPandas </td></tr>
+<tr><td> <code>data sent to Python workers</code> </td><td> the number of
bytes of serialized data sent to the Python workers </td><td> Python UDFs,
Pandas UDFs, Pandas Functions API and Python Data Source </td></tr>
+<tr><td> <code>data returned from Python workers</code> </td><td> the number
of bytes of serialized data received back from the Python workers </td><td>
Python UDFs, Pandas UDFS, Pandas Functions API and Python Data Source </td></tr>
</table>
## Structured Streaming Tab
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]