Looks like we're using authentication on our slaves. So you either need to
pass

-jnlpCredentials user:pass

on the command line, or change around the permissions in Jenkins to allow
anonymous users to connect/run jobs.

I'm not sure if it would make sense or not to add the user/pass in the
Jenkins plugin configuration screen or if it should be fetched another way.




On Thu, Nov 7, 2013 at 2:52 PM, Vinod Kone <vinodk...@gmail.com> wrote:

> Great. Let us know once you figure it out. Maybe I can add a FAQ to the
> plugin's README to help others (or you can contribute too :)).
>
>
> On Thu, Nov 7, 2013 at 11:40 AM, Whitney Sorenson 
> <wsoren...@hubspot.com>wrote:
>
>> I added the jenkins user on the slave - this was the missing piece. I'll
>> add this to my PR for the readme. Got much further now; now I'm getting a
>> 403 on the fetch:
>>
>> /jenkins/computer/mesos-jenkins-6f4719c8-1c61-4b28-b5ab-ba298e846840/slave-agent.jnlp:
>> 403 Forbidden at
>> hudson.remoting.Launcher.parseJnlpArguments(Launcher.java:261) at
>> hudson.remoting.Launcher.run(Launcher.java:215)
>>
>> and corresponding log on jenkins master:
>>
>> Nov 7, 2013 2:38:39 PM winstone.Logger logInternal INFO: While serving
>> http://localhost:8080/jenkins/computer/mesos-jenkins-6f4719c8-1c61-4b28-b5ab-ba298e846840/slave-agent.jnlp:
>> hudson.security.AccessDeniedException2: anonymous is missing the
>> Slave/Connect permission
>>
>> Going to look into what this means.
>>
>>
>>
>> On Thu, Nov 7, 2013 at 2:21 PM, Vinod Kone <vinodk...@gmail.com> wrote:
>>
>>> I looked at the code and it looks there are few places the executor
>>> might fail before it fetches the URI. Most of them have to do with
>>> incorrect permissions. The code was written to have any errors reported
>>> either in slave log or console or executor logs (there might be a bug here
>>> if we are in fact swallowing errors). IIUC, the executor log directory is
>>> empty in your case which suggests the executor died before it could even
>>> create "stdout" or "stderr" files in its sandbox (Is this true?).
>>>
>>> Couple of questions:
>>>
>>> What user is Jenkins master running as? Is that user known to the host
>>> on which mesos slave is running?
>>>
>>> How are you starting the mesos slave (e.g., cmd line flags)?
>>>
>>>
>>>
>>> On Thu, Nov 7, 2013 at 11:00 AM, Whitney Sorenson <wsoren...@hubspot.com
>>> > wrote:
>>>
>>>> The gist was compiled from that log. Here is the complete log from
>>>> toggling the jenkins plugin on / off (you see the ping statements
>>>> inbetween):
>>>>
>>>> https://gist.github.com/wsorenson/8bf64e44fd42da354fa0
>>>>
>>>>
>>>>
>>>>
>>>> On Thu, Nov 7, 2013 at 1:57 PM, Vinod Kone <vinodk...@gmail.com> wrote:
>>>>
>>>>> What does mesos-slave.err say?
>>>>>
>>>>>
>>>>> On Thu, Nov 7, 2013 at 10:49 AM, Whitney Sorenson <
>>>>> wsoren...@hubspot.com> wrote:
>>>>>
>>>>>> Hi Vinod,
>>>>>>
>>>>>> It's 0.14.0-rc4 in both.
>>>>>>
>>>>>> I believe we have logging working:
>>>>>>
>>>>>> -rw-r--r-- 1 root root         0 Oct 22 23:48 mesos-slave.out
>>>>>> lrwxrwxrwx 1 root root        63 Oct 22 23:48 mesos-slave.INFO ->
>>>>>> mesos-slave.carousel.invalid-user.log.INFO.20131022-234823.5797
>>>>>> lrwxrwxrwx 1 root root        66 Oct 22 23:49 mesos-slave.WARNING ->
>>>>>> mesos-slave.carousel.invalid-user.log.WARNING.20131022-234954.5797
>>>>>> drwxr-xr-x 2 root root      4096 Oct 22 23:49 .
>>>>>> -rw-rw-r-- 1 root root      4827 Nov  1 20:34
>>>>>> mesos-slave.carousel.invalid-user.log.WARNING.20131022-234954.5797
>>>>>> -rw-rw-r-- 1 root root  10408140 Nov  7 18:44
>>>>>> mesos-slave.carousel.invalid-user.log.INFO.20131022-234823.5797
>>>>>> -rw-r--r-- 1 root root  53759705 Nov  7 18:45 mesos-slave.err
>>>>>>
>>>>>> Is there something else to check? Is it possible the executor is
>>>>>> failing before it even attempts to fetch URIs?
>>>>>>
>>>>>> Ray - Thanks - yeah I found the jenkins logs. I was able to wget the
>>>>>> slave.jar, and even run it. The mesos-jenkins slaves are dead now, so I
>>>>>> can't connect to their slave-agent - but the jar does run. Not sure if 
>>>>>> the
>>>>>> window for trying to connect to one of the mesos launched slaves is long
>>>>>> enough to try before it is terminated due to failures. Interestingly, 
>>>>>> when
>>>>>> I try to connect to one of the existing slaves I get a 403.
>>>>>>
>>>>>> -Whitney
>>>>>>
>>>>>>
>>>>>>
>>>>>> On Thu, Nov 7, 2013 at 1:34 PM, Vinod Kone <vinodk...@gmail.com>wrote:
>>>>>>
>>>>>>> Hey Whitney,
>>>>>>>
>>>>>>> What version of mesos are you using (both in the cluster and the
>>>>>>> plugin)?
>>>>>>>
>>>>>>> The slave should print stuff to console when it is launching
>>>>>>> executor (e.g., "Fetching resources..."). I don't see that in the gist 
>>>>>>> you
>>>>>>> pasted. Are you capturing stdout/stderr of the slave?
>>>>>>>
>>>>>>>
>>>>>>> On Thu, Nov 7, 2013 at 10:30 AM, Whitney Sorenson <
>>>>>>> wsoren...@hubspot.com> wrote:
>>>>>>>
>>>>>>>> Thanks Ray.
>>>>>>>>
>>>>>>>> I have very similar issue (empty executor directories) - but don't
>>>>>>>> have any issues curling the slave.jar URI - and I don't have any 
>>>>>>>> existing
>>>>>>>> JNLP process running. I don't have a jenkins user - is that the only 
>>>>>>>> setup
>>>>>>>> you did on the slave?
>>>>>>>>
>>>>>>>> -Whitney
>>>>>>>>
>>>>>>>>
>>>>>>>>
>>>>>>>> On Thu, Nov 7, 2013 at 1:16 PM, Ray Rodriguez <rayrod2...@gmail.com
>>>>>>>> > wrote:
>>>>>>>>
>>>>>>>>> Hi Whitney I would have a look at this github issue where I work
>>>>>>>>> through some of my jenkins mesos-plugin issues with Vinod.  Might be 
>>>>>>>>> some
>>>>>>>>> of the same issues you are seeing.
>>>>>>>>> https://github.com/jenkinsci/mesos-plugin/issues/2
>>>>>>>>>
>>>>>>>>> Ray
>>>>>>>>>
>>>>>>>>>
>>>>>>>>>
>>>>>>>>> On Thu, Nov 7, 2013 at 1:07 PM, Whitney Sorenson <
>>>>>>>>> wsoren...@hubspot.com> wrote:
>>>>>>>>>
>>>>>>>>>> Hi all!
>>>>>>>>>>
>>>>>>>>>> I am trying to get the Jenkins Mesos plugin functioning. I was
>>>>>>>>>> able to get it installed on our Jenkins master.
>>>>>>>>>>
>>>>>>>>>> However, it's unclear if there are any required steps for setting
>>>>>>>>>> up the slaves. When a framework task is launched, it fails instantly 
>>>>>>>>>> and
>>>>>>>>>> there are no logs in the runs folder.
>>>>>>>>>>
>>>>>>>>>> Here's a gist with relevant logs from the slave:
>>>>>>>>>>
>>>>>>>>>>
>>>>>>>>>> https://gist.github.com/wsorenson/b3562c3e4a8992f9a46f/raw/ea5821c442d826456291330452208d8d7ac8418f/failing+jenkins+logs
>>>>>>>>>>
>>>>>>>>>> Any help on how to debug? At first, I thought maybe we needed
>>>>>>>>>> slave.jar or something but it looks like it's trying to fetch that 
>>>>>>>>>> from the
>>>>>>>>>> master using the URIs. To clarify, I have done no special jenkins 
>>>>>>>>>> related
>>>>>>>>>> setup (as per readme.md) on any of the slaves.
>>>>>>>>>>
>>>>>>>>>> -Whitney
>>>>>>>>>>
>>>>>>>>>
>>>>>>>>>
>>>>>>>>
>>>>>>>
>>>>>>
>>>>>
>>>>
>>>
>>
>

Reply via email to