[ 
https://issues.apache.org/jira/browse/MESOS-479?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13664180#comment-13664180
 ] 

Benjamin Hindman commented on MESOS-479:
----------------------------------------

The test also hung on failure here:

Thread 1 (Thread 0x7fc8364b5720 (LWP 36613)):
#0  0x00007fc8349daee9 in pthread_cond_wait@@GLIBC_2.3.2 () from 
/lib64/libpthread.so.0
#1  0x00007fc835223c9b in arrive (this=0x292e450, pid=<value optimized out>) at 
../../../third_party/libprocess/src/gate.hpp:73
#2  process::ProcessManager::wait (this=0x292e450, pid=<value optimized out>) 
at ../../../third_party/libprocess/src/process.cpp:2669
#3  0x00007fc835233545 in process::wait (pid=..., duration=...) at 
../../../third_party/libprocess/src/process.cpp:3109
#4  0x00007fc834ef091e in wait (this=0x7fff81049190, __in_chrg=<value optimized 
out>) at ../../third_party/libprocess/include/process/process.hpp:357
#5  mesos::MesosSchedulerDriver::~MesosSchedulerDriver (this=0x7fff81049190, 
__in_chrg=<value optimized out>) at ../../src/sched/sched.cpp:757
#6  0x00000000005d8f23 in 
SlaveRecoveryTest_CleanupExecutor_Test<mesos::internal::slave::ProcessIsolator>::TestBody
 (this=<value optimized out>)
    at ../../src/tests/slave_recovery_tests.cpp:771
#7  0x00000000008646eb in 
testing::internal::HandleExceptionsInMethodIfSupported<testing::Test, void> 
(object=0x7fc8240055b0, 
    method=&virtual testing::Test::TestBody(), location=0x92fd37 "the test 
body") at gmock-1.6.0/gtest/src/gtest.cc:2126
#8  0x0000000000858579 in testing::Test::Run (this=0x7fc8240055b0) at 
gmock-1.6.0/gtest/src/gtest.cc:2162
#9  0x00000000008586b5 in testing::TestInfo::Run (this=0x2926f30) at 
gmock-1.6.0/gtest/src/gtest.cc:2338
#10 0x000000000085876d in testing::TestCase::Run (this=0x2925be0) at 
gmock-1.6.0/gtest/src/gtest.cc:2445
#11 0x000000000085b94d in testing::internal::UnitTestImpl::RunAllTests 
(this=0x2907f30) at gmock-1.6.0/gtest/src/gtest.cc:4237
#12 0x000000000085c345 in 
testing::internal::HandleSehExceptionsInMethodIfSupported<testing::internal::UnitTestImpl,
 bool> (object=<value optimized out>, 
    method=(bool 
(testing::internal::UnitTestImpl::*)(testing::internal::UnitTestImpl *)) 
0x85b6d0 <testing::internal::UnitTestImpl::RunAllTests()>, 
    location=0xffffffffffffffff <Address 0xffffffffffffffff out of bounds>) at 
gmock-1.6.0/gtest/src/gtest.cc:2090
#13 0x000000000086426e in 
testing::internal::HandleExceptionsInMethodIfSupported<testing::internal::UnitTestImpl,
 bool> (object=0x2907f30, method=
    (bool (testing::internal::UnitTestImpl::*)(testing::internal::UnitTestImpl 
*)) 0x85b6d0 <testing::internal::UnitTestImpl::RunAllTests()>, 
---Type <return> to continue, or q <return> to quit---
    location=0x9319c0 "auxiliary test code (environments or event listeners)") 
at gmock-1.6.0/gtest/src/gtest.cc:2126
#14 0x0000000000857afa in testing::UnitTest::Run (this=<value optimized out>) 
at gmock-1.6.0/gtest/src/gtest.cc:3874
#15 0x000000000048c3a2 in main (argc=1, argv=0x7fc82c002020) at 
../../src/tests/main.cpp:109
                
> SlaveRecoveryTest/0.CleanupExecutor failure.
> --------------------------------------------
>
>                 Key: MESOS-479
>                 URL: https://issues.apache.org/jira/browse/MESOS-479
>             Project: Mesos
>          Issue Type: Bug
>            Reporter: Benjamin Hindman
>
> [ RUN      ] SlaveRecoveryTest/0.CleanupExecutor
> Checkpointing SlaveInfo to 
> '/tmp/SlaveRecoveryTest_0_CleanupExecutor_WDbjOB/meta/slaves/201305220702-1828659978-49643-36613-0/slave.info'
> Checkpointing FrameworkInfo to 
> '/tmp/SlaveRecoveryTest_0_CleanupExecutor_WDbjOB/meta/slaves/201305220702-1828659978-49643-36613-0/frameworks/201305220702-1828659978-49643-36613-0000/framework.info'
> Checkpointing 'scheduler(84)@10.35.255.108:49643' to 
> '/tmp/SlaveRecoveryTest_0_CleanupExecutor_WDbjOB/meta/slaves/201305220702-1828659978-49643-36613-0/frameworks/201305220702-1828659978-49643-36613-0000/framework.pid'
> Checkpointing ExecutorInfo to 
> '/tmp/SlaveRecoveryTest_0_CleanupExecutor_WDbjOB/meta/slaves/201305220702-1828659978-49643-36613-0/frameworks/201305220702-1828659978-49643-36613-0000/executors/07360cb3-7b42-44b5-9942-b27802a18224/executor.info'
> Checkpointing Task to 
> '/tmp/SlaveRecoveryTest_0_CleanupExecutor_WDbjOB/meta/slaves/201305220702-1828659978-49643-36613-0/frameworks/201305220702-1828659978-49643-36613-0000/executors/07360cb3-7b42-44b5-9942-b27802a18224/runs/60657969-3cdc-46e3-ba9e-51c8db502ef9/tasks/07360cb3-7b42-44b5-9942-b27802a18224/task.info'
> Checkpointing forked pid 38518
> Checkpointing '38518' to 
> '/tmp/SlaveRecoveryTest_0_CleanupExecutor_WDbjOB/meta/slaves/201305220702-1828659978-49643-36613-0/frameworks/201305220702-1828659978-49643-36613-0000/executors/07360cb3-7b42-44b5-9942-b27802a18224/runs/60657969-3cdc-46e3-ba9e-51c8db502ef9/pids/forked.pid'
> Fetching resources into 
> '/tmp/SlaveRecoveryTest_0_CleanupExecutor_WDbjOB/slaves/201305220702-1828659978-49643-36613-0/frameworks/201305220702-1828659978-49643-36613-0000/executors/07360cb3-7b42-44b5-9942-b27802a18224/runs/60657969-3cdc-46e3-ba9e-51c8db502ef9'
> Checkpointing 'executor(1)@10.35.255.108:48801' to 
> '/tmp/SlaveRecoveryTest_0_CleanupExecutor_WDbjOB/meta/slaves/201305220702-1828659978-49643-36613-0/frameworks/201305220702-1828659978-49643-36613-0000/executors/07360cb3-7b42-44b5-9942-b27802a18224/runs/60657969-3cdc-46e3-ba9e-51c8db502ef9/pids/libprocess.pid'
> Registered executor on smfd-atr-11-sr1.devel.twitter.com
> Starting task 07360cb3-7b42-44b5-9942-b27802a18224
> Forked command at 38572
> sh -c 'sleep 1000'
> Checkpointing 'scheduler(84)@10.35.255.108:49643' to 
> '/tmp/SlaveRecoveryTest_0_CleanupExecutor_WDbjOB/meta/slaves/201305220702-1828659978-49643-36613-0/frameworks/201305220702-1828659978-49643-36613-0000/framework.pid'
> Checkpointing 'scheduler(84)@10.35.255.108:49643' to 
> '/tmp/SlaveRecoveryTest_0_CleanupExecutor_WDbjOB/meta/slaves/201305220702-1828659978-49643-36613-0/frameworks/201305220702-1828659978-49643-36613-0000/framework.pid'
> libprocess: process-isolator(19)@10.35.255.108:49643 terminating due to 
> basic_filebuf::underflow error reading the file
> Waited on process 38572, returned status 15
> WARNING: Logging before InitGoogleLogging() is written to STDERR
> I0522 07:02:54.414633 38554 process_utils.hpp:64] Stopping ... 38572
> Group members:
>   PID  PPID  PGID  SESS COMMAND
> 38572 38518 38572 38572 sleep 1000
> Session members:
>   PID  PPID  PGID  SESS COMMAND
> 38572 38518 38572 38572 sleep 1000
> Sent signal to 38572
> GMOCK WARNING:
> Uninteresting mock function call - returning directly.
>     Function call: slaveLost(0x7fff81049190, @0x7fc814001eb0 
> 201305220702-1828659978-49643-36613-0)
> Stack trace:
> ../../src/tests/slave_recovery_tests.cpp:764: Failure
> Value of: status.get().state()
>   Actual: TASK_LOST
> Expected: TASK_FAILED

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira

Reply via email to