Well 1. Spark Connect workers don't directly connect to the Spark Connect server. 2. Client applications use the Spark Connect API to interact with the Spark cluster through the server. 3. I suggest focusing on developing client applications that leverage the Spark Connect API.
The differences between Spark Standalone and Spark Connect: In Standalone mode, workers connect directly to the Spark master using the spark:// protocol and the master's hostname and port. Spark Connect uses a different architecture. Workers don't directly connect to the Spark Connect server. Instead, client applications connect to the Spark Connect server, which then interacts with the Spark cluster on their behalf. HTH Mich Talebzadeh, Architect | Data Science | Financial Crime | Forensic Analysis | GDPR view my Linkedin profile <https://www.linkedin.com/in/mich-talebzadeh-ph-d-5205b2/> On Thu, 23 Jan 2025 at 18:05, Andrew Petersen <aapet...@ncsu.edu.invalid> wrote: > Hello Spark community > > I am trying to connect a worker to the connect server. > > Following the documentation, I am able to get the spark-connect server to > run in the simple one node way. > > > Am I correct to assume that the spark-connect server can work with spark > workers? If so, how do I connect a spark worker to a spark-connect server? > I have a standalone spark setup, and I am used to using scripts that start > worker daemons and connect them to the master. I tried connecting a worker > to a connect server similar to how I would do it with a master. > > > First I start the connector service: > > $SPARK_HOME/sbin/start-connect-server.sh --packages > org.apache.spark:spark-connect_2.12:3.5.4 > > > Then I try to connect a worker: > > $SPARK_HOME/sbin/spark-daemon.sh start > org.apache.spark.deploy.worker.Worker 1 spark://nxxcxx:15002 > > However, I get an error: > > 25/01/23 11:57:19 INFO Worker: Connecting to master cxxnxx:15002... > > 25/01/23 11:57:19 INFO TransportClientFactory: Successfully created > connection to cxxnxx/192.xxx.x.xxx:15002 after 38 ms (0 ms spent in > bootstraps) > > 25/01/23 11:57:20 WARN TransportChannelHandler: Exception in connection > from cxxxnxx/192.xxx.x.xx:15002 > > java.lang.IllegalArgumentException: Too large frame: 19808389169144 > > > > > -- > Andrew Petersen, PhD > Advanced Computing, Office of Information Technology > 2620 Hillsborough Street > datascience.oit.ncsu.edu >