tusharchou commented on issue #2971: URL: https://github.com/apache/iceberg-python/issues/2971#issuecomment-3806652953
**Context** With the recent merger of PR #2528, the repository now supports `make notebook` and `make notebook-infra` commands. These provide an instant, containerized environment for users to test PyIceberg with and without a full Spark infrastructure. **Problem** The current "Getting Started" documentation does not yet reflect these new capabilities. New users might still be manually configuring environments or struggling to set up Spark, unaware that a single command can now provision a full lab with PyIceberg, Spark Connect, MinIO, and a REST catalog. **Proposal** We should update the documentation (specifically the `README.md` and the "Getting Started" site page) to highlight this new workflow. **Suggested Changes:** 1. **PyIceberg Quickstart:** Add a section explaining how to use `make notebook` to quickly launch a Jupyter environment for exploring the PyIceberg API. 2. **Full Integration Demo:** Add a section explaining `make notebook-infra` for users who want to test PyIceberg with a local Spark cluster (using Spark Connect) and MinIO. 3. **Reference Example Notebooks:** Link to the new `notebooks/pyiceberg_example.ipynb` and `notebooks/spark_integration_example.ipynb` as the recommended starting points. **Benefits** * Significantly lowers the barrier to entry for new contributors and users. * Standardizes the local development environment. * Showcases the PyIceberg + Spark Connect integration out of the box. -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: [email protected] For queries about this service, please contact Infrastructure at: [email protected] --------------------------------------------------------------------- To unsubscribe, e-mail: [email protected] For additional commands, e-mail: [email protected]
