nija-at opened a new pull request, #40054: URL: https://github.com/apache/spark/pull/40054
### What changes were proposed in this pull request? Currently, the Spark Connect service's `client_type` attribute (which is really [user agent]) is set to `_SPARK_CONNECT_PYTHON` to signify PySpark. With this change, the connection for the Spark Connect remote accepts an optional `user_agent` parameter which is then passed down to the service. [user agent]: https://www.w3.org/WAI/UA/work/wiki/Definition_of_User_Agent ### Why are the changes needed? This enables partners using Spark Connect to set their application as the user agent, which then allows visibility and measurement of integrations and usages of spark connect. ### Does this PR introduce _any_ user-facing change? A new optional `user_agent` parameter is now recognized as part of the Spark Connect connection string. ### How was this patch tested? - unit tests attached - manually running the `pyspark` binary with the `user_agent` connection string set and verifying the payload sent to the server. Similar testing for the default. -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: [email protected] For queries about this service, please contact Infrastructure at: [email protected] --------------------------------------------------------------------- To unsubscribe, e-mail: [email protected] For additional commands, e-mail: [email protected]
