This is exactly the same case with the Py4J gateway server. We can easily implement that - I am one of the maintainers of Py4J fwiw and running a local Spark Connect server is already there apart from the PR https://github.com/apache/spark/pull/49107.
On Thu, 6 Feb 2025 at 10:40, Adam Binford <adam...@gmail.com> wrote: > -1 (non-binding) for me. I've commented on the PR for this ( > https://github.com/apache/spark/pull/49107), but in its current state > this seems like it would introduce a massive security vulnerability. If a > user launches a "Spark Connect enabled" cluster deploy mode job in a > multi-tenant YARN cluster, it will launch a wide open Spark Connect server > alongside the driver on any given compute host. Any other users could then > connect to this server and do whatever they wanted using the other users > credentials. If this issue is addressed I would change to 0. > > Best case scenario this was a small oversight that would have introduced a > major vulnerability, worst case scenario this was a coordinated effort to > slip a backdoor into a widely used application. Either way, this does not > lend itself to something that should be enabled by default without > rigorous testing in real world scenarios. > > This is just my opinion, but I don't understand why these conversations > have been happening for so long and this feature _still isn't even > available yet_. Having the feature be complete and available for user > testing seems like it should be a prerequisite to any discussion of making > it the default behavior, otherwise nobody knows exactly what the behavior > is you are trying to make the default. > > Adam > > On Wed, Feb 5, 2025 at 11:51 AM Chao Sun <sunc...@apache.org> wrote: > >> +1 >> >> On Wed, Feb 5, 2025 at 8:42 AM Martin Grund <mar...@databricks.com.invalid> >> wrote: >> >>> +1 >>> >>> On Wed, Feb 5, 2025 at 17:15 bo yang <bobyan...@gmail.com> wrote: >>> >>>> +1 (non-binding) >>>> >>>> On Wed, Feb 5, 2025 at 7:51 AM Jules Damji <jules.da...@gmail.com> >>>> wrote: >>>> >>>>> +1 (non-binding) >>>>> >>>>> Excuse the thumb typos >>>>> >>>>> >>>>> On Tue, 04 Feb 2025 at 11:06 PM, Wenchen Fan <cloud0...@gmail.com> >>>>> wrote: >>>>> >>>>>> Hi all, >>>>>> >>>>>> Given the positive feedback in the previous DISCUSS email >>>>>> <https://lists.apache.org/thread/loo1r84ovrzpskkn9cfmjfb0vwx4xnrq>, >>>>>> I'd like to start the vote for the proposal "Publish additional Spark >>>>>> distribution with Spark Connect enabled". >>>>>> >>>>>> Please vote for the next 72 hours: >>>>>> >>>>>> [ ] +1: Accept the proposal >>>>>> [ ] +0 >>>>>> [ ]- 1: I don’t think this is a good idea because … >>>>>> >>>>>> Best, >>>>>> Wenchen Fan >>>>>> >>>>> > > -- > Adam Binford >