GizzZmo opened a new pull request, #52656: URL: https://github.com/apache/spark/pull/52656
This pull request adds comprehensive top-level documentation to the repository, making it easier for both users and developers to find, understand, and navigate Spark's documentation and architecture. The main changes include the addition of an in-depth architecture overview, a complete documentation index, and improvements to the README.md to point to these new resources and module-level documentation. Major documentation enhancements: Documentation structure and discoverability: Added DOCUMENTATION_INDEX.md, providing a thorough, organized index of all documentation in the repository, including links and descriptions for user, developer, administrator, and API docs. Updated README.md to include direct links to the new architecture overview, development guide, contribution guide, and all major module and tool documentation, improving onboarding and navigation for new users. Architecture overview: Added a new ARCHITECTURE.md file with a detailed description of Spark’s architecture, including core components, execution model, key subsystems (memory, shuffle, storage, serialization), module structure, performance tuning, and contribution guidelines. These changes significantly improve the accessibility and clarity of Spark's documentation, making it easier for contributors and users to understand the project structure and find the resources they need. -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: [email protected] For queries about this service, please contact Infrastructure at: [email protected] --------------------------------------------------------------------- To unsubscribe, e-mail: [email protected] For additional commands, e-mail: [email protected]
