Github user vanzin commented on a diff in the pull request:

    https://github.com/apache/spark/pull/4688#discussion_r28728492
  
    --- Diff: docs/security.md ---
    @@ -31,6 +31,7 @@ SSL must be configured on each node and configured for 
each component involved i
     
     ### YARN mode
     The key-store can be prepared on the client side and then distributed and 
used by the executors as the part of the application. It is possible because 
the user is able to deploy files before the application is started in YARN by 
using `spark.yarn.dist.files` or `spark.yarn.dist.archives` configuration 
settings. The responsibility for encryption of transferring these files is on 
YARN side and has nothing to do with Spark.
    +For long-running apps like Spark Streaming apps be able to write to HDFS, 
it is possible to pass a principal and keytab to `spark-submit` via the 
`--principal` and `--keytab` parameters respectively. The keytab passed in will 
be copied over to the machine running the Application Master securely via the 
Hadoop Distributed Cache. The Kerberos login will be periodically renewed using 
this principal and keytab and the delegation tokens required for HDFS will be 
generated periodically so the application can continue writing to HDFS.
    --- End diff --
    
    "...apps like Spark Streaming to be able..."


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at [email protected] or file a JIRA ticket
with INFRA.
---

---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

Reply via email to