[GitHub] [spark] itholic opened a new pull request, #41587: [SPARK-43654][SPARK-43655] Ensure that Spark Connect assigns correct column name

via GitHub Tue, 13 Jun 2023 21:36:57 -0700


itholic opened a new pull request, #41587:
URL: https://github.com/apache/spark/pull/41587


   ### What changes were proposed in this pull request?
   
   This PR proposes to fix Spark Connect Column work properly when assigning 
the column name.
   
   ### Why are the changes needed?
   
   To match the behavior between vanilla PySpark <> Spark Connect.
   
   In Spark Connect, when a column name is enclosed in backticks, it is used as 
the column name as it is.
   
   ```python
   >>> sdf = spark.range(10)
   >>> sdf["`id`"]
   Column<'`id`'>
   ```
   
   Whereas vanilla PySpark excludes the backticks:
   
   ```python
   >>> sdf = spark.range(10)
   >>> sdf["`id`"]
   Column<'id'>
   ```
   ### Does this PR introduce _any_ user-facing change?
   
   Yes, now the Spark Connect Column assigns its name as the same as vanilla 
PySpark.
   
   ### How was this patch tested?
   
   Reusing the existing UT.


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: [email protected]

For queries about this service, please contact Infrastructure at:
[email protected]


---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

[GitHub] [spark] itholic opened a new pull request, #41587: [SPARK-43654][SPARK-43655] Ensure that Spark Connect assigns correct column name

Reply via email to