The error that I am getting is simple. On slave in short (for each
executor):
===============================================
336-I1031 13:40:04.390418  5933 slave.cpp:2066] Monitoring executor default
of framework 201310311338-16842879-5050-5922-0001 forked at pid 6263
337-I1031 13:40:05.020405  5933 process_isolator.cpp:479] Telling slave of
terminated executor 'default' of framework
201310311338-16842879-5050-5922-0001
338-I1031 13:40:05.024759  5931 slave.cpp:2122] Executor 'default' of
framework 201310311338-16842879-5050-5922-0001 has exited with status 127
339:I1031 13:40:05.025979  5931 slave.cpp:1737] Handling status update
TASK_LOST (UUID: 85bc8a24-9f9c-46ce-b644-e7e0f6844d52) for task 7 of
framework 201310311338-16842879-5050-5922-0001 from @0.0.0.0:0
340:I1031 13:40:05.026260  5931 status_update_manager.cpp:305] Received
status update TASK_LOST (UUID: 85bc8a24-9f9c-46ce-b644-e7e0f6844d52) for
task 7 of framework 201310311338-16842879-5050-5922-0001
341:I1031 13:40:05.026358  5931 status_update_manager.cpp:356] Forwarding
status update TASK_LOST (UUID: 85bc8a24-9f9c-46ce-b644-e7e0f6844d52) for
task 7 of framework 201310311338-16842879-5050-5922-0001 to
[email protected]:5050
342-I1031 13:40:05.027835  5932 status_update_manager.cpp:380] Received
status update acknowledgement (UUID: 85bc8a24-9f9c-46ce-b644-e7e0f6844d52)
for task 7 of framework 201310311338-16842879-5050-5922-0001
343-I1031 13:40:05.028039  5931 slave.cpp:2257] Cleaning up executor
'default' of framework 201310311338-16842879-5050-5922-0001
344-I1031 13:40:05.028210  5932 gc.cpp:56] Scheduling
'/tmp/mesos/slaves/201310311323-16842879-5050-5037-0/frameworks/201310311338-16842879-5050-5922-0001/executors/default/runs/0c0cb61f-b119-4450-afaf-f547ac6d04bc'
for gc 6.99999967427259days in the future
========================================================
on mesos master for each executor:
========================================================
246-I1031 13:40:05.387863  5925 master.cpp:2118] Launching task 8 of
framework 201310311338-16842879-5050-5922-0001 with resources cpus(*):1;
mem(*):128 on slave 201310311323-16842879-5050-5037-0 (iStrader)
247-I1031 13:40:05.388151  5924 hierarchical_allocator_process.hpp:590]
Framework 201310311338-16842879-5050-5922-0001 filtered slave
201310311323-16842879-5050-5037-0 for 1secs
248-I1031 13:40:06.026221  5925 master.cpp:1476] Executor default of
framework 201310311338-16842879-5050-5922-0001 on slave
201310311323-16842879-5050-5037-0 (iStrader) exited with status 32512
249:I1031 13:40:06.026422  5926 master.cpp:1425] Status update TASK_LOST
(UUID: 35aed42c-810c-4465-9968-13a2eb66cb35) for task 8 of framework
201310311338-16842879-5050-5922-0001 from slave(1)@127.0.1.1:5051
250-I1031 13:40:06.026495  5926 master.hpp:403] Removing task 8 with
resources cpus(*):1; mem(*):128 on slave 201310311323-16842879-5050-5037-0
(iStrader)
251-I1031 13:40:06.026657  5926 hierarchical_allocator_process.hpp:637]
Recovered cpus(*):1; mem(*):128 (total allocatable: cpus(*):4; mem(*):2906;
disk(*):80784; ports(*):[31000-32000]) on slave
201310311323-16842879-5050-5037-0 from framework
201310311338-16842879-5050-5922-0001
252-2013-10-31 
13:40:06,126:5922(0x7f84732fd700):ZOO_DEBUG@zookeeper_process@1983:
Got ping response in 0 ms
=============================================================================

I don't get it why it is 127.0.1.1 and not 127.0.0.1. Weird isn't it? I am
looking inside master.cpp but I cannot find the reason.


ALSO, I am starting my master and slave in a single machine like this:
mesos-slave --master=zk://localhost:2181/mesos 2> ~/mesos-slave
mesos-master --zk=zk://localhost:2181/mesos 2> ~/mesos-master

see, I am just using localhost not the 127.0.0.1.


Best Regards
Mohamad Rezaei
-------------------
Researcher at PDC
KTH Royal Institute of Technology


On Wed, Oct 30, 2013 at 7:05 PM, Mohamad Rezaei <[email protected]> wrote:

> OK,
>
> I can see them myself, but I will send the log tomorrow. I just came back
> from university, and they started elictricity/fire tests! So no electricity
> until tomorrow. ;-)
>
>
> Best Regards
> Mohamad Rezaei
> -------------------
> Researcher at PDC
> KTH Royal Institute of Technology
>
>
> On Wed, Oct 30, 2013 at 6:13 PM, Vinod Kone <[email protected]> wrote:
>
>> I don't see the attachments. Did you forget to attach them? Or maybe the
>> Apache mail servers strip attachments?
>>
>> Can you explain a bit more on what your setup is and what are the exact
>> steps (commands) you used to repro the problem? Log outputs of
>> slave/master/framework would also make it easier to diagnose.
>>
>>
>> On Wed, Oct 30, 2013 at 9:11 AM, Mohamad Rezaei <[email protected]> wrote:
>>
>> > Hi all,
>> >
>> > I have moved the example java files to another location, and changed the
>> > name of all the classes but with the same internal code as the default
>> > examples. Also I have changed the two "test-executor" and
>> "test-framework"
>> > files accordingly so the can run the classes that I am mentioning. But I
>> > get task lost on slaves since I think slaves cannot find the
>> corresponding
>> > executors. I am not sure actually if this is the problem or not. I have
>> > attached the files in here.
>> >
>> > Did I have to do something else?
>> >
>> > Best Regards
>> > Mohamad Rezaei
>> > -------------------
>> > Researcher at PDC
>> > KTH Royal Institute of Technology
>> >
>>
>
>

Reply via email to