dbtsai commented on a change in pull request #28788:
URL: https://github.com/apache/spark/pull/28788#discussion_r439088312



##########
File path: docs/running-on-yarn.md
##########
@@ -82,6 +82,19 @@ In `cluster` mode, the driver runs on a different machine 
than the client, so `S
 
 Running Spark on YARN requires a binary distribution of Spark which is built 
with YARN support.
 Binary distributions can be downloaded from the [downloads 
page](https://spark.apache.org/downloads.html) of the project website.
+There are two variants of Spark binary distributions you can download. One is 
pre-built with a certain
+version of Apache Hadoop; this Spark distribution contains built-in Hadoop 
runtime, so we call it <code>with-hadoop</code> Spark
+distribution. The other one is pre-built with user-provided Hadoop; since this 
Spark distribution
+doesn't contain built-in Hadoop runtime, so users have to provide a Hadoop 
installation separately.

Review comment:
       Fixed. Thanks!




----------------------------------------------------------------
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
[email protected]



---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

Reply via email to