HyukjinKwon opened a new pull request, #56418: URL: https://github.com/apache/spark/pull/56418
### What changes were proposed in this pull request? Adds an experimental, documentation-only "Try PySpark in your browser" feature. Code blocks marked with the `pyspark-live` class get a **Run** button and execute real PySpark right in the page, with no server or install. It runs entirely in the browser using Pyodide (Python on WebAssembly) and a JVM-free Spark engine ([sail-wasm](https://github.com/HyukjinKwon/sail-wasm)). It is off by default and only turns on when the docs are built with `PYSPARK_DOCS_LIVE` set, so the normal docs and the PySpark package are untouched. This is similar to the in-browser console on the NumPy homepage (https://numpy.org/ - see the "Try NumPy" live shell). [sail-wasm](https://github.com/HyukjinKwon/sail-wasm) is a WebAssembly build of [Sail](https://github.com/lakehq/sail) (a Rust, DataFusion-based engine that implements the Spark Connect protocol). It runs the serialized Spark Connect plans the PySpark client produces and returns Arrow results - all in the browser tab, no JVM and no backend. The compiled wasm artifact is **not** part of Apache Spark; it is hosted separately by me via GitHub Releases, and the docs only load it at runtime when the feature is enabled. ### Why are the changes needed? So readers can try PySpark instantly from the docs to learn, without setting up a cluster, installing anything, or waiting for a remote notebook to start. ### Does this PR introduce _any_ user-facing change? No. Docs-only, and disabled unless the docs are built with `PYSPARK_DOCS_LIVE`. ### How was this patch tested? Built the docs both with and without `PYSPARK_DOCS_LIVE`: the default build is unchanged, and the enabled build shows working "Run" cells. ### Was this patch authored or co-authored using generative AI tooling? No. -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: [email protected] For queries about this service, please contact Infrastructure at: [email protected] --------------------------------------------------------------------- To unsubscribe, e-mail: [email protected] For additional commands, e-mail: [email protected]
