HyukjinKwon opened a new pull request, #56418:
URL: https://github.com/apache/spark/pull/56418

   ### What changes were proposed in this pull request?
   
   Adds an experimental, documentation-only "Try PySpark in your browser" 
feature. Code blocks marked with the `pyspark-live` class get a **Run** button 
and execute real PySpark right in the page, with no server or install. It runs 
entirely in the
   browser using Pyodide (Python on WebAssembly) and a JVM-free Spark engine 
([sail-wasm](https://github.com/HyukjinKwon/sail-wasm)). It is off by default 
and only turns on when the docs are built with `PYSPARK_DOCS_LIVE` set, so the 
normal docs and the PySpark package are untouched.
   
   This is similar to the in-browser console on the NumPy homepage 
(https://numpy.org/ - see the "Try NumPy" live shell).
   
   [sail-wasm](https://github.com/HyukjinKwon/sail-wasm) is a WebAssembly build 
of [Sail](https://github.com/lakehq/sail) (a Rust, DataFusion-based engine that 
implements the Spark Connect protocol). It runs the serialized Spark Connect 
plans
   the PySpark client produces and returns Arrow results - all in the browser 
tab, no JVM and no backend. The compiled wasm artifact is **not** part of 
Apache Spark; it is hosted separately by me via GitHub Releases, and the docs 
only load it at runtime when the feature is enabled.
   
   ### Why are the changes needed?
   
   So readers can try PySpark instantly from the docs to learn, without setting 
up a
   cluster, installing anything, or waiting for a remote notebook to start.
   
   ### Does this PR introduce _any_ user-facing change?
   
   No. Docs-only, and disabled unless the docs are built with 
`PYSPARK_DOCS_LIVE`.
   
   ### How was this patch tested?
   
   Built the docs both with and without `PYSPARK_DOCS_LIVE`: the default build 
is
   unchanged, and the enabled build shows working "Run" cells.
   
   ### Was this patch authored or co-authored using generative AI tooling?
   
   No.


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: [email protected]

For queries about this service, please contact Infrastructure at:
[email protected]


---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

Reply via email to