Transurgeon opened a new pull request, #37066:
URL: https://github.com/apache/spark/pull/37066

   Adding python packages to README and requirements.txt to fix issues with 
building python documentation
   
   There are currently two ways to install python packages for building python 
documentation:
   1. running sudo pip install from the README.md
   ```sh
   $ sudo pip install 'sphinx<3.1.0' mkdocs numpy pydata_sphinx_theme ipython 
nbsphinx numpydoc sphinx-plotly-directive 'jinja2<3.0.0'
   ```
   2. running the requirements.txt file, from $SPARK_HOME
   ```sh
   pip install -r dev/requirements.txt
   ```
   ### What changes were proposed in this pull request?
   1. Adding 'markupsafe==2.0.1' to both /docs/README.md and 
/dev/requirements.txt
   2. Syncing both files to contain the same packages 
   3. Adding pyarrow to /docs/README.md 
   
   
   ### Why are the changes needed?
   1. markupsafe==2.1.1 would run into errors when building python 
documentation (using 'make html')
   
![image](https://user-images.githubusercontent.com/89562186/177061721-4caca44e-1b39-4458-906e-9669c426fc68.png)
   [Similar stack overflow 
issue](https://stackoverflow.com/questions/72191560/importerror-cannot-import-name-soft-unicode-from-markupsafe)
   Downgrading to markupsafe==2.0.1 fixes missing 'soft_unicode' module issues
   2. Adding missing packages from requirements.txt (Under 
Documentation-Python) to README.md, this fixes some additional bugs when trying 
to build docs from the README command.
   3. Finally, running the README command runs into one last error:
   
![image](https://user-images.githubusercontent.com/89562186/177062155-0e6432c1-16be-4edd-8430-737376828a6a.png)
   This is because 'pyarrow' module has been added to requirements.txt (Under 
PySpark Optional Dependencies) but is still missing from the README command
   
   ### Does this PR introduce _any_ user-facing change?
   No.
   
   
   ### How was this patch tested?
   Documentation changes only.
   


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: [email protected]

For queries about this service, please contact Infrastructure at:
[email protected]


---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

Reply via email to