Transurgeon opened a new pull request, #37066: URL: https://github.com/apache/spark/pull/37066
Adding python packages to README and requirements.txt to fix issues with building python documentation There are currently two ways to install python packages for building python documentation: 1. running sudo pip install from the README.md ```sh $ sudo pip install 'sphinx<3.1.0' mkdocs numpy pydata_sphinx_theme ipython nbsphinx numpydoc sphinx-plotly-directive 'jinja2<3.0.0' ``` 2. running the requirements.txt file, from $SPARK_HOME ```sh pip install -r dev/requirements.txt ``` ### What changes were proposed in this pull request? 1. Adding 'markupsafe==2.0.1' to both /docs/README.md and /dev/requirements.txt 2. Syncing both files to contain the same packages 3. Adding pyarrow to /docs/README.md ### Why are the changes needed? 1. markupsafe==2.1.1 would run into errors when building python documentation (using 'make html')  [Similar stack overflow issue](https://stackoverflow.com/questions/72191560/importerror-cannot-import-name-soft-unicode-from-markupsafe) Downgrading to markupsafe==2.0.1 fixes missing 'soft_unicode' module issues 2. Adding missing packages from requirements.txt (Under Documentation-Python) to README.md, this fixes some additional bugs when trying to build docs from the README command. 3. Finally, running the README command runs into one last error:  This is because 'pyarrow' module has been added to requirements.txt (Under PySpark Optional Dependencies) but is still missing from the README command ### Does this PR introduce _any_ user-facing change? No. ### How was this patch tested? Documentation changes only. -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: [email protected] For queries about this service, please contact Infrastructure at: [email protected] --------------------------------------------------------------------- To unsubscribe, e-mail: [email protected] For additional commands, e-mail: [email protected]
