Re: [PR] Revert: [SPARK-47040][CONNECT] Allow Spark Connect Server Script to wait [spark]

2024-04-09 Thread via GitHub


dongjoon-hyun commented on PR #45852:
URL: https://github.com/apache/spark/pull/45852#issuecomment-2045379643

   Thank you so much for making this PR and letting us conclude this topic, 
@pan3793 !


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



Re: [PR] Revert: [SPARK-47040][CONNECT] Allow Spark Connect Server Script to wait [spark]

2024-04-09 Thread via GitHub


pan3793 closed pull request #45852: Revert: [SPARK-47040][CONNECT] Allow Spark 
Connect Server Script to wait
URL: https://github.com/apache/spark/pull/45852


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



Re: [PR] Revert: [SPARK-47040][CONNECT] Allow Spark Connect Server Script to wait [spark]

2024-04-09 Thread via GitHub


pan3793 commented on PR #45852:
URL: https://github.com/apache/spark/pull/45852#issuecomment-2044429278

   Thanks @dongjoon-hyun and all participate in this discussion, I think we 
already reached a consensus to promote `–wait` to all service for consistency, 
created SPARK-47778 and close this PR then.


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



Re: [PR] Revert: [SPARK-47040][CONNECT] Allow Spark Connect Server Script to wait [spark]

2024-04-08 Thread via GitHub


dongjoon-hyun commented on PR #45852:
URL: https://github.com/apache/spark/pull/45852#issuecomment-2043258249

   Hi, @pan3793 and all. Shall we conclude this thread?
   
   Given the discussion, I believe we can close this PR and move forward to 
generalize `--wait` for all services (if someone wants to implement that for 
consistency.)


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



Re: [PR] Revert: [SPARK-47040][CONNECT] Allow Spark Connect Server Script to wait [spark]

2024-04-06 Thread via GitHub


HyukjinKwon commented on PR #45852:
URL: https://github.com/apache/spark/pull/45852#issuecomment-2041222711

   I haven't looked through the code very closely but at high level seems fine 
to have that from a cursory look.


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



Re: [PR] Revert: [SPARK-47040][CONNECT] Allow Spark Connect Server Script to wait [spark]

2024-04-06 Thread via GitHub


pan3793 commented on PR #45852:
URL: https://github.com/apache/spark/pull/45852#issuecomment-2041131686

   @HyukjinKwon so you are good to promote the `--wait` to all services, right?


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



Re: [PR] Revert: [SPARK-47040][CONNECT] Allow Spark Connect Server Script to wait [spark]

2024-04-06 Thread via GitHub


pan3793 commented on PR #45852:
URL: https://github.com/apache/spark/pull/45852#issuecomment-2041131626

   @HyukjinKwon so you are good to promote the `--wait` to all services, right?


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



Re: [PR] Revert: [SPARK-47040][CONNECT] Allow Spark Connect Server Script to wait [spark]

2024-04-04 Thread via GitHub


HyukjinKwon commented on PR #45852:
URL: https://github.com/apache/spark/pull/45852#issuecomment-2038584100

   The examples you mentioned are mostly for development purpose. The examples 
I provided are more for user-facing. I think this `--wait` is the similar case. 
I think it's fine to add an easier option for those options users are expected 
to use often.


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



Re: [PR] Revert: [SPARK-47040][CONNECT] Allow Spark Connect Server Script to wait [spark]

2024-04-04 Thread via GitHub


pan3793 commented on PR #45852:
URL: https://github.com/apache/spark/pull/45852#issuecomment-2037913897

   @grundprinzip I appreciate your great contributions to Spark Connect and 
improving development efficiency, I just started exploring this new feature. I 
strongly agree that we should pay attention to improving development 
efficiency. But let's evaluate it relatively carefully for user-facing changes. 
Spark is a great product that has been brilliant for over a decade, we should 
respect the existing services, tools, and user habits when making user-facing 
changes, and I believe a consistent user experience is an important element for 
such a great product.


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



Re: [PR] Revert: [SPARK-47040][CONNECT] Allow Spark Connect Server Script to wait [spark]

2024-04-04 Thread via GitHub


pan3793 commented on PR #45852:
URL: https://github.com/apache/spark/pull/45852#issuecomment-2037707135

   > Mind pointing out the examples explicitly?
   
   @HyukjinKwon To clarify, the assumption is limited to dev/test purposes, not 
to end users. Examples: 
   - SPARK_PREPEND_CLASSES
   - SPARK_GENERATE_GOLDEN_FILES
   - SKIP_MIMA
   - SKIP_RDOC
   - SKIP_PYTHONDOC
   - SKIP_SQLDOC
   
   @gengliangwang I think it's also good if the community reaches a consensus 
to promote the `--wait` to all services, like SHS, STS.
   
   Another simple way is mentioning `SPARK_NO_DAEMONIZE` in `sbin/` scripts' 
help message.


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



Re: [PR] Revert: [SPARK-47040][CONNECT] Allow Spark Connect Server Script to wait [spark]

2024-04-03 Thread via GitHub


HyukjinKwon commented on PR #45852:
URL: https://github.com/apache/spark/pull/45852#issuecomment-2035811556

   > There are several server components in Spark, like Thrift Server, History 
Server, Master, Worker, only adding --wait on the Connect Server makes things 
inconsistent, while env vars approach is adopted widely in Spark for different 
development/test purposes.
   
   Mind pointing out the examples explicitly?


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



Re: [PR] Revert: [SPARK-47040][CONNECT] Allow Spark Connect Server Script to wait [spark]

2024-04-03 Thread via GitHub


gengliangwang commented on PR #45852:
URL: https://github.com/apache/spark/pull/45852#issuecomment-2035714026

   @pan3793 @dongjoon-hyun @grundprinzip How about creating an umbrella jira to 
support option `--wait` in the start scripts under `sbin/` dir? 
   1. There are just a few of them.
   2. Having such a command line option would be helpful. Many users just run 
`./sbin/sript -h` to see the available options and they may never be aware of 
the env variables.
   3. These new tasks would be great for new Spark developers.
   


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



Re: [PR] Revert: [SPARK-47040][CONNECT] Allow Spark Connect Server Script to wait [spark]

2024-04-03 Thread via GitHub


pan3793 commented on PR #45852:
URL: https://github.com/apache/spark/pull/45852#issuecomment-2035283307

   > "the other servers don't have this option"
   
   @grundprinzip That's the point, it violates the consistency. The option is 
exposed to end users, from the user's point of view, if 
`./sbin/start-connect-server.sh --wait` works, why 
`./sbin/start-thriftserver.sh --wait` does not work? What if I want to add a 
`--debug` option to `./sbin/start-thriftserver.sh --debug`, things go to mess 
without restrictions.


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



Re: [PR] Revert: [SPARK-47040][CONNECT] Allow Spark Connect Server Script to wait [spark]

2024-04-03 Thread via GitHub


grundprinzip commented on PR #45852:
URL: https://github.com/apache/spark/pull/45852#issuecomment-2035258484

   I absolutely appreciate the feedback. I spend most of my time developing 
Spark Connect and I can only attest that this option is very useful to me. I 
have not seen any reasoning besides "I prefer environment variables" and "the 
other servers don't have this option".
   
   I invite everyone to spend time developing features for Spark Connect (in 
particular Scala) and it will be obvious why this option is really helpful. 
Maybe the lack of experience in this dev-loop is driving this decision.
   
   But I stand by my opinion that improving developer efficiency and first 
experience should not be underestimated.


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



Re: [PR] Revert: [SPARK-47040][CONNECT] Allow Spark Connect Server Script to wait [spark]

2024-04-03 Thread via GitHub


dongjoon-hyun commented on PR #45852:
URL: https://github.com/apache/spark/pull/45852#issuecomment-2035225318

   Hi, @grundprinzip . Yes, we are debating in a healthy way. There is nothing 
wrong here. This is a part of healthy community process .
   
   For this part, I would not exclude @pan3793 from the user base.
   > nobody was complaining about it from the user base,


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



Re: [PR] Revert: [SPARK-47040][CONNECT] Allow Spark Connect Server Script to wait [spark]

2024-04-03 Thread via GitHub


grundprinzip commented on PR #45852:
URL: https://github.com/apache/spark/pull/45852#issuecomment-2035211798

   I don't fully understand why this is an issue. It's a convenience option for 
users. Folks are used to run `--help` on command line scripts and see useful 
output. For developers in particular looking at the spark log while running the 
Spark Connect service is very helpful. Knowing that an environment variable 
exists is much more effort than calling `--help`.
   
   The part that makes it even harder for me is that this has caused no issue, 
no harm, nobody was complaining about it from the user base, but we debate 
wether or not environment variables are better than command line options.


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



Re: [PR] Revert: [SPARK-47040][CONNECT] Allow Spark Connect Server Script to wait [spark]

2024-04-03 Thread via GitHub


yaooqinn commented on PR #45852:
URL: https://github.com/apache/spark/pull/45852#issuecomment-2034932677

   Thank you for pinging me @dongjoon-hyun.
   
   Just my humble opinion. Although users can use this feature, it is primarily 
intended for use by the Spark Connect developers, right? It's low-maintenance, 
so I think we can take care of their habits. Both sys environments and command 
line options have their advantages. If a developer frequently needs to switch 
between the frontend and backend during the development process, suffixed 
`--wait` would be more convenient as it can be easily backspaced or 
auto-suggested.
   
   Well, unfortunately, `--wait` is '$1' now. How about making it more lenient 
about the position?
   
   


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



Re: [PR] Revert: [SPARK-47040][CONNECT] Allow Spark Connect Server Script to wait [spark]

2024-04-03 Thread via GitHub


pan3793 commented on PR #45852:
URL: https://github.com/apache/spark/pull/45852#issuecomment-2034776090

   Additionally, `SPARK_NO_DAEMONIZE` is listed in `spark-env.sh.template`
   
   ```
   ➜  conf cat spark-env.sh.template | grep SPARK_NO_DAEMONIZE
   # - SPARK_NO_DAEMONIZE  Run the proposed command in the foreground. It will 
not output a PID file.
   ```


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



Re: [PR] Revert: [SPARK-47040][CONNECT] Allow Spark Connect Server Script to wait [spark]

2024-04-03 Thread via GitHub


pan3793 commented on PR #45852:
URL: https://github.com/apache/spark/pull/45852#issuecomment-2034771042

   > ... because it's not properly documented ...
   
   you can search SPARK_PREPEND_CLASSES on 
https://spark.apache.org/developer-tools.html
   
   > ... just use env var and update 
https://spark.apache.org/developer-tools.html to mention that.
   
   And I do emphasize in the PR description to update the docs to mention the 
env var `SPARK_NO_DAEMONIZE` usage.


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



Re: [PR] Revert: [SPARK-47040][CONNECT] Allow Spark Connect Server Script to wait [spark]

2024-04-03 Thread via GitHub


pan3793 commented on PR #45852:
URL: https://github.com/apache/spark/pull/45852#issuecomment-2034516901

   `SPARK_NO_DAEMONIZE=1 ./sbin/start-connect-server.sh` vs 
`./sbin/start-connect-server.sh --wait`. I don't see much benefit for the latter


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



Re: [PR] Revert: [SPARK-47040][CONNECT] Allow Spark Connect Server Script to wait [spark]

2024-04-03 Thread via GitHub


pan3793 commented on PR #45852:
URL: https://github.com/apache/spark/pull/45852#issuecomment-2034511867

   cc @dongjoon-hyun @HyukjinKwon @grundprinzip @LuciferYang 


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org