tgravescs commented on a change in pull request #31761:
URL: https://github.com/apache/spark/pull/31761#discussion_r594343928



##########
File path: docs/security.md
##########
@@ -838,6 +838,17 @@ The following options provides finer-grained control for 
this feature:
   </td>
   <td>3.0.0</td>
 </tr>
+<tr>
+  <td><code>spark.kerberos.renewal.exclude.hadoopFileSystems</code></td>
+  <td>(none)</td>
+  <td>
+    A comma-separated list of Hadoop filesystems for whose hosts will be 
excluded from from delegation
+    token renewal at resource scheduler. For example, 
<code>spark.kerberos.renewal.exclude.hadoopFileSystems=hdfs://nn1.com:8032,
+    hdfs://nn2.com:8032</code>. This is known to work under YARN for now, so 
YARN Resource Manager won't renew tokens for the application.
+    Note that as resource scheduler does not renew token, the application 
might not be long running once the token expires.

Review comment:
       the workload can definitely go on until it tries to access that 
filesystem.  There is no reason it couldn't fully succeed depending on where 
its writing data or if it writes data at all.
   We definitely could clarify though, perhaps something more like:
   , so any application running longer than the original token expiration that 
tries to use that token will fail.




----------------------------------------------------------------
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
[email protected]



---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

Reply via email to