[GitHub] spark issue #19103: [SPARK-21890] Credentials not being passed to add the to...

2017-09-07 Thread vanzin
Github user vanzin commented on the issue:

https://github.com/apache/spark/pull/19103
  
Looks good (since the master PR didn't merge). Merging to 2.2. @redsanket 
please close the PR manually.


---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] spark issue #19103: [SPARK-21890] Credentials not being passed to add the to...

2017-09-07 Thread AmplabJenkins
Github user AmplabJenkins commented on the issue:

https://github.com/apache/spark/pull/19103
  
Merged build finished. Test PASSed.


---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] spark issue #19103: [SPARK-21890] Credentials not being passed to add the to...

2017-09-07 Thread AmplabJenkins
Github user AmplabJenkins commented on the issue:

https://github.com/apache/spark/pull/19103
  
Test PASSed.
Refer to this link for build results (access rights to CI server needed): 
https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/81522/
Test PASSed.


---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] spark issue #19103: [SPARK-21890] Credentials not being passed to add the to...

2017-09-07 Thread SparkQA
Github user SparkQA commented on the issue:

https://github.com/apache/spark/pull/19103
  
**[Test build #81522 has 
finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/81522/testReport)**
 for PR 19103 at commit 
[`7043d98`](https://github.com/apache/spark/commit/7043d98ccb7f5cfc4e854f609afa3c380d274c36).
 * This patch passes all tests.
 * This patch merges cleanly.
 * This patch adds no public classes.


---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] spark issue #19103: [SPARK-21890] Credentials not being passed to add the to...

2017-09-07 Thread SparkQA
Github user SparkQA commented on the issue:

https://github.com/apache/spark/pull/19103
  
**[Test build #81522 has 
started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/81522/testReport)**
 for PR 19103 at commit 
[`7043d98`](https://github.com/apache/spark/commit/7043d98ccb7f5cfc4e854f609afa3c380d274c36).


---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] spark issue #19103: [SPARK-21890] Credentials not being passed to add the to...

2017-09-07 Thread tgravescs
Github user tgravescs commented on the issue:

https://github.com/apache/spark/pull/19103
  
Jenkins,test this please


---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] spark issue #19103: [SPARK-21890] Credentials not being passed to add the to...

2017-09-05 Thread redsanket
Github user redsanket commented on the issue:

https://github.com/apache/spark/pull/19103
  
@vanzin @tgravescs sorry for the delay, will put up a PR against master, we 
can move further discussion there, about the suggested improvements, I put up a 
PR against master just for workaround. 
https://github.com/apache/spark/pull/19140


---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] spark issue #19103: [SPARK-21890] Credentials not being passed to add the to...

2017-09-05 Thread tgravescs
Github user tgravescs commented on the issue:

https://github.com/apache/spark/pull/19103
  

yes user stated he will be opening one for master, but that is quite a bit 
different due to the credentials stuff moving around so I think this one will 
have to stay open anyway.  But I agree we should definitely look at the master 
one first. 


---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] spark issue #19103: [SPARK-21890] Credentials not being passed to add the to...

2017-09-05 Thread vanzin
Github user vanzin commented on the issue:

https://github.com/apache/spark/pull/19103
  
BTW Oozie can also disable the HDFS provider 
(`spark.yarn.security.credentials.hadoopfs.enabled=false`, I think). But it 
would be nice if Spark was able to do that by itself is the current UGI does 
not have a TGT (or, alternatively, some way to disable all of the credential 
providers with a single setting). But that's for a separate PR.

This one looks ok but it really needs to be opened against master instead.


---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] spark issue #19103: [SPARK-21890] Credentials not being passed to add the to...

2017-09-05 Thread tgravescs
Github user tgravescs commented on the issue:

https://github.com/apache/spark/pull/19103
  
hive and hbase token fetch can be turned off (ie 
spark.yarn.security.tokens.hive.enabled=false).  I thought they didn't work the 
same as hdfs core as far as not getting one if you have, but would need to 
check.  You tell oozie to get those before launching via the oozie credentials 
configurations.








---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] spark issue #19103: [SPARK-21890] Credentials not being passed to add the to...

2017-09-01 Thread vanzin
Github user vanzin commented on the issue:

https://github.com/apache/spark/pull/19103
  
I don't know; perhaps they'll fail, which is why I think the correct 
behavior would be to skip this credential manager code altogether if a TGT 
doesn't exist.

But that would at least be the same behavior as Spark 2.1, while the 
behavior in the HDFS provider has definitely changed.


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] spark issue #19103: [SPARK-21890] Credentials not being passed to add the to...

2017-09-01 Thread jerryshao
Github user jerryshao commented on the issue:

https://github.com/apache/spark/pull/19103
  
@vanzin From my understanding seems like it is a workaround to avoid 
issuing new HDFS tokens (since this user credential we already has HDFS 
tokens). But how to handle HBase/Hive thing without TGT?


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] spark issue #19103: [SPARK-21890] Credentials not being passed to add the to...

2017-09-01 Thread vanzin
Github user vanzin commented on the issue:

https://github.com/apache/spark/pull/19103
  
That's when using principal / keytab and generating new tokens; it's 
separate from the code path being changed here. The initial tokens are obtained 
in `Client.scala` with the current user's credentials.


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] spark issue #19103: [SPARK-21890] Credentials not being passed to add the to...

2017-09-01 Thread jerryshao
Github user jerryshao commented on the issue:

https://github.com/apache/spark/pull/19103
  
@tgravescs , I think it is in `AMCredentialRenewer` we explicitly create a 
new `Credential` every time when issuing new tokens.

```
// HACK:
// HDFS will not issue new delegation tokens, if the Credentials object
// passed in already has tokens for that FS even if the tokens are 
expired (it really only
// checks if there are tokens for the service, and not if they are 
valid). So the only real
// way to get new tokens is to make sure a different Credentials object 
is used each time to
// get new tokens and then the new tokens are copied over the current 
user's Credentials.
// So:
// - we login as a different user and get the UGI
// - use that UGI to get the tokens (see doAs block below)
// - copy the tokens over to the current user's credentials (this will 
overwrite the tokens
// in the current user's Credentials object for this FS).
```



---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] spark issue #19103: [SPARK-21890] Credentials not being passed to add the to...

2017-09-01 Thread vanzin
Github user vanzin commented on the issue:

https://github.com/apache/spark/pull/19103
  
In general it feels like this code shouldn't even be running if the current 
user doesn't have a TGT to start with.

But this patch restores the behavior from Spark 2.1, so if the PR is opened 
against master it should be ok to merge.


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] spark issue #19103: [SPARK-21890] Credentials not being passed to add the to...

2017-09-01 Thread jerryshao
Github user jerryshao commented on the issue:

https://github.com/apache/spark/pull/19103
  
>Oozie client gets the necessary tokens the application needs before 
launching. It passes those tokens along to the oozie launcher job (MR job) 
which will then actually call the Spark client to launch the spark app and pass 
the tokens along.

>The oozie launcher job cannot get anymore tokens because all it has is 
tokens ( you can't get tokens with tokens, you need tgt or keytab).

>The error here is because the launcher job runs the Spark Client to submit 
the spark job but the spark client doesn't see that it already has the hdfs 
tokens so it tries to get more, which ends with the exception.

So the problem is that Oozie will get tokens for Spark instead of letting 
Spark do itself, and in Oozie launcher we should not let Spark `Yarn#client` to 
get tokens itself since there might not have tgt available in Oozie launcher.

From my understanding of your issue, this seems like a more general issue 
regarding Oozie launcher and Spark token manage stuff. With the patch, looks 
like it only address the HDFS issue, how do we handle hive/hbase, looks like 
still have issues.

 


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] spark issue #19103: [SPARK-21890] Credentials not being passed to add the to...

2017-09-01 Thread vanzin
Github user vanzin commented on the issue:

https://github.com/apache/spark/pull/19103
  
This needs to be opened against master first.


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] spark issue #19103: [SPARK-21890] Credentials not being passed to add the to...

2017-09-01 Thread gatorsmile
Github user gatorsmile commented on the issue:

https://github.com/apache/spark/pull/19103
  
cc @vanzin @mgummelt 


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org