nija-at opened a new pull request, #40054:
URL: https://github.com/apache/spark/pull/40054

   ### What changes were proposed in this pull request?
   
   Currently, the Spark Connect service's `client_type` attribute (which is 
really [user
   agent]) is set to `_SPARK_CONNECT_PYTHON` to signify PySpark.
   
   With this change, the connection for the Spark Connect remote accepts an 
optional
   `user_agent` parameter which is then passed down to the service.
   
   [user agent]: https://www.w3.org/WAI/UA/work/wiki/Definition_of_User_Agent
   
   ### Why are the changes needed?
   
   This enables partners using Spark Connect to set their application as the 
user agent,
   which then allows visibility and measurement of integrations and usages of 
spark
   connect.
   
   ### Does this PR introduce _any_ user-facing change?
   
   A new optional `user_agent` parameter is now recognized as part of the Spark 
Connect
   connection string.
   
   ### How was this patch tested?
   
   - unit tests attached
   - manually running the `pyspark` binary with the `user_agent` connection 
string set and
      verifying the payload sent to the server. Similar testing for the default.


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: [email protected]

For queries about this service, please contact Infrastructure at:
[email protected]


---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

Reply via email to