Li Li created MESOS-7463:
----------------------------
Summary: e2e regression on mesos windows agent
Key: MESOS-7463
URL: https://issues.apache.org/jira/browse/MESOS-7463
Project: Mesos
Issue Type: Bug
Components: agent
Affects Versions: 1.4.0
Reporter: Li Li
Assignee: Li Li
With the latest windows agent, the “latest” subdir was created as symbolic link
at C:\mesos\w\meta\slaves\latest somehow, rather than a dir, and thus confuses
the windows agent logic. If I manually removed the symbolic link and created
the subdir of "latest", everything starts working again.
esos\w" --sandbox_directory="C:\mesos\sandbox" --strict="true"
--version="false" --work_dir="c:\mesos\w"I
I0505 17:22:49.470075 5500 slave.cpp:525] Agent resources: cpus(*):2;
mem(*):6143; disk(*):124925; ports(*):[31000-32000]
I0505 17:22:49.472075 5500 slave.cpp:533] Agent attributes: [ ]
I0505 17:22:49.473078 5500 slave.cpp:538] Agent hostname:
liwindows2016.1vbwyim4eaouzg5dmr2pv11jmc.dx.internal.cloudapp.net
I0505 17:22:49.477077 3088 status_update_manager.cpp:177] Pausing sending
status updates
2017-05-05 17:22:49,451:1916(0xc74):ZOO_INFO@log_env@1057: Client
environment:os.name=<not implemented>
2017-05-05 17:22:49,478:1916(0xc74):ZOO_INFO@log_env@1058: Client
environment:os.arch=<not implemented>
2017-05-05 17:22:49,478:1916(0xc74):ZOO_INFO@log_env@1059: Client
environment:os.version=<not implemented>
2017-05-05 17:22:49,479:1916(0xc74):ZOO_INFO@log_env@1065: Client
environment:user.name=<not implemented>
2017-05-05 17:22:49,480:1916(0xc74):ZOO_INFO@log_env@1076: Client
environment:user.home=<not implemented>
2017-05-05 17:22:49,480:1916(0xc74):ZOO_INFO@log_env@1083: Client
environment:user.dir=C:\mesos\w
2017-05-05 17:22:49,481:1916(0xc74):ZOO_INFO@zookeeper_init_internal@1126:
Initiating client connection, host=40.118.201.232:2181 sessionTimeout=10000
watcher=00007FF676883B37 sessionId=0 sessionPasswd=<null>
context=0000013DF9916130 flags=0
2017-05-05 17:22:49,568:1916(0x1868):ZOO_INFO@check_events@2372: initiated
connection to server [40.118.201.232:2181]
I0505 17:22:49.570092 612 state.cpp:62] Recovering state from
'c:\mesos\w\meta'
I0505 17:22:49.572082 612 state.cpp:710] No committed checkpointed resources
found at 'c:\mesos\w\meta\resources\resources.info'
2017-05-05 17:22:49,573:1916(0x1868):ZOO_INFO@check_events@2424: session
establishment complete on server [40.118.201.232:2181],
sessionId=0x15bdace10030009, negotiated timeout=10000
I0505 17:22:49.574095 3188 group.cpp:340] Group process
(zookeeper-group(1)@10.0.0.7:5051) connected to ZooKeeper
I0505 17:22:49.575093 3188 group.cpp:830] Syncing group operations: queue size
(joins, cancels, datas) = (0, 0, 0)
W0505 17:22:49.575093 612 state.cpp:147] Failed to find agent info file
'c:\mesos\w\meta\slaves\latest\slave.info'
I0505 17:22:49.575093 3188 group.cpp:418] Trying to create path '/mesos' in
ZooKeeper
I0505 17:22:49.579090 612 status_update_manager.cpp:203] Recovering status
update manager
I0505 17:22:49.580085 612 containerizer.cpp:608] Recovering containerizer
I0505 17:22:49.584092 3188 detector.cpp:152] Detected a new leader: (id='21')
I0505 17:22:49.587085 612 group.cpp:699] Trying to get
'/mesos/json.info_0000000021' in ZooKeeper
I0505 17:22:49.590093 6336 provisioner.cpp:410] Provisioner recovery complete
I0505 17:22:49.597093 6336 slave.cpp:5963] Finished recovery
I0505 17:22:49.600085 6336 slave.cpp:5996] Garbage collecting old agent latest
E0505 17:22:49.602087 6336 slave.cpp:6125] Failed to find the mtime of
'c:\mesos\w\meta\slaves\latest': Error invoking stat for
'c:\mesos\w\meta\slaves\latest': No such file or directory
I0505 17:22:49.603085 3088 gc.cpp:55] Scheduling 'c:\mesos\w\slaves\latest'
for gc 6.99990045027852days in the future
I0505 17:22:49.610085 612 zookeeper.cpp:262] A new leading master
([email protected]:5050) is detected
I0505 17:22:49.611085 612 slave.cpp:918] New master detected at
[email protected]:5050
I0505 17:22:49.611085 612 slave.cpp:942] No credentials provided. Attempting
to register without authentication
I0505 17:22:49.612087 612 slave.cpp:953] Detecting new master
I0505 17:22:49.611085 3088 status_update_manager.cpp:177] Pausing sending
status updates
I0505 17:22:49.688102 5076 slave.cpp:1121] Registered with master
[email protected]:5050; given agent ID
09561f38-257c-4ba0-993d-7180b0aa58cb-S2
I0505 17:22:49.689101 3188 status_update_manager.cpp:184] Resuming sending
status updates
F0505 17:22:49.690089 5076 paths.cpp:613] CHECK_SOME(os::rm(latest)): `os::rm`
could not remove 'c:\mesos\w\meta\slaves\latest': Access is denied.
Failed to remove latest symlink 'c:\mesos\w\meta\slaves\latest'
*** Check failure stack trace: ***
@ 00007FF676A27A0B google::LogMessage::Fail
@ 00007FF676A27910 google::LogMessage::SendToLog
@ 00007FF676A270F7 google::LogMessage::Flush
@ 00007FF676A28C31 google::LogMessageFatal::~LogMessageFatal
@ 00007FF6769D9047 _CheckFatal::~_CheckFatal
@ 00007FF677AD4BA0 mesos::internal::slave::paths::createSlaveDirectory
@ 00007FF677573F1A mesos::internal::slave::Slave::registered
@ 00007FF67773D51B
ProtobufProcess<mesos::internal::slave::Slave>::handler2<mesos::internal::SlaveRegisteredMessage,mesos::SlaveID
const & __ptr64,mesos::SlaveID const &
__ptr64,mesos::internal::MasterSlaveConnection const &
__ptr64,mesos::internal::MasterSlaveConnection c
@ 00007FF677639ADA (unknown)
@ 00007FF6777457A7 (unknown)
@ 00007FF677676786 (unknown)
@ 00007FF677645C8E (unknown)
@ 00007FF67762CB6B (unknown)
@ 00007FF67763FB58 (unknown)
@ 00007FF67774CD68 (unknown)
@ 00007FF67767F47C (unknown)
@ 00007FF67782B8FD (unknown)
@ 00007FF676C2AD3B ??
@ 00007FF67791135B ProtobufProcess<mesos::internal::slave::Slave>::visit
@ 00007FF676D36B68 process::MessageEvent::visit
@ 00007FF676D309C8 process::ProcessBase::serve
@ 00007FF676A55FD4 process::ProcessManager::resume
@ 00007FF676C29EDE ??
@ 00007FF676AB58B0
std::_Invoker_functor::_Call<`process::ProcessManager::init_threads'::`2'::<unnamed-type-worker>
>
@ 00007FF676B8F980
std::invoke<`process::ProcessManager::init_threads'::`2'::<unnamed-type-worker>
>
@ 00007FF676ACC59C
std::_LaunchPad<std::unique_ptr<std::tuple<`process::ProcessManager::init_threads'::`2'::<unnamed-type-worker>
>,std::default_delete<std::tuple<`process::ProcessManager::init_threads'::`2'::<unnamed-type-worker>
> > > >::_Execute<0>
@ 00007FF676CB44CA
std::_LaunchPad<std::unique_ptr<std::tuple<`process::ProcessManager::init_threads'::`2'::<unnamed-type-worker>
>,std::default_delete<std::tuple<`process::ProcessManager::init_threads'::`2'::<unnamed-type-worker>
> > > >::_Run
@ 00007FF676C75228
std::_LaunchPad<std::unique_ptr<std::tuple<`process::ProcessManager::init_threads'::`2'::<unnamed-type-worker>
>,std::default_delete<std::tuple<`process::ProcessManager::init_threads'::`2'::<unnamed-type-worker>
> > > >::_Go
@ 00007FF676C41F3D std::_Pad::_Call_func
@ 00007FF6790EFD78 invoke_thread_procedure
@ 00007FF6790EF821 __cdecl*)(void * __ptr64)
@ 00007FFD995D8364 BaseThreadInitThunk
--
This message was sent by Atlassian JIRA
(v6.3.15#6346)