This is an automated email from the ASF dual-hosted git repository.
gurwls223 pushed a commit to branch master
in repository https://gitbox.apache.org/repos/asf/spark.git
The following commit(s) were added to refs/heads/master by this push:
new ac935f5074d [SPARK-46427][PYTHON][SQL] Change Python Data Source's
description to be pretty in explain
ac935f5074d is described below
commit ac935f5074daac3fc2511f196b32c98007b61e53
Author: Hyukjin Kwon <[email protected]>
AuthorDate: Fri Dec 15 22:02:31 2023 -0800
[SPARK-46427][PYTHON][SQL] Change Python Data Source's description to be
pretty in explain
### What changes were proposed in this pull request?
This PR implements `Scan.description` so it has a better string description
in `DataFrame.explain`.
```python
spark.table("pythonSourceTable").explain(True)
```
Before:
```
== Physical Plan ==
*(1) Project [x#0, y#1]
+- BatchScan test[x#0, y#1] class
org.apache.spark.sql.execution.python.PythonTableProvider$$anon$1$$anon$2
RuntimeFilters: []
```
After:
```
== Physical Plan ==
*(1) Project [x#0, y#1]
+- BatchScan test[x#0, y#1] (Python) RuntimeFilters: []
```
### Why are the changes needed?
Now it shows the class name for nested classes, which isn't quite pretty.
### Does this PR introduce _any_ user-facing change?
It changes the plan description but the main change has not been released
out yet. So no.
### How was this patch tested?
Manually tested as above.
### Was this patch authored or co-authored using generative AI tooling?
No.
Closes #44379 from HyukjinKwon/SPARK-46427.
Authored-by: Hyukjin Kwon <[email protected]>
Signed-off-by: Hyukjin Kwon <[email protected]>
---
.../apache/spark/sql/execution/python/UserDefinedPythonDataSource.scala | 2 ++
1 file changed, 2 insertions(+)
diff --git
a/sql/core/src/main/scala/org/apache/spark/sql/execution/python/UserDefinedPythonDataSource.scala
b/sql/core/src/main/scala/org/apache/spark/sql/execution/python/UserDefinedPythonDataSource.scala
index 5e978a90088..047a133a322 100644
---
a/sql/core/src/main/scala/org/apache/spark/sql/execution/python/UserDefinedPythonDataSource.scala
+++
b/sql/core/src/main/scala/org/apache/spark/sql/execution/python/UserDefinedPythonDataSource.scala
@@ -101,6 +101,8 @@ class PythonTableProvider extends TableProvider {
new PythonPartitionReaderFactory(
source, readerFunc, outputSchema, jobArtifactUUID)
}
+
+ override def description: String = "(Python)"
}
}
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]