nchammas commented on a change in pull request #27376: [SPARK-30665][PYSPARK] Remove Pandoc dependency in PySpark setup.py URL: https://github.com/apache/spark/pull/27376#discussion_r372550656
########## File path: docs/README.md ########## @@ -38,7 +38,7 @@ installed. Also install the following libraries: ```sh $ sudo gem install jekyll jekyll-redirect-from rouge # Following is needed only for generating API docs -$ sudo pip install sphinx pypandoc mkdocs +$ sudo pip install sphinx mkdocs numpy Review comment: Without `numpy` I'm getting these errors when I try to build the PySpark ML API docs: ``` $ SKIP_SCALADOC=1 SKIP_RDOC=1 SKIP_SQLDOC=1 jekyll serve Configuration file: .../spark/docs/_config.yml Moving to python/docs directory and building sphinx. sphinx-build -b html -d _build/doctrees . _build/html Running Sphinx v2.3.1 loading pickled environment... done building [mo]: targets for 0 po files that are out of date building [html]: targets for 0 source files that are out of date updating environment: 0 added, 2 changed, 0 removed reading sources... [100%] pyspark.mllib WARNING: autodoc: failed to import module 'ml' from module 'pyspark'; the following exception was raised: No module named 'numpy' WARNING: autodoc: failed to import module 'ml.param' from module 'pyspark'; the following exception was raised: No module named 'numpy' ... ``` ---------------------------------------------------------------- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. For queries about this service, please contact Infrastructure at: [email protected] With regards, Apache Git Services --------------------------------------------------------------------- To unsubscribe, e-mail: [email protected] For additional commands, e-mail: [email protected]
