Re: Review Request 71652: Avoid double reaping race in the command executor.

2019-10-23 Thread Vinod Kone

---
This is an automatically generated e-mail. To reply, visit:
https://reviews.apache.org/r/71652/#review218354
---


Ship it!




Ship It!

- Vinod Kone


On Oct. 23, 2019, 3:52 a.m., Benjamin Mahler wrote:
> 
> ---
> This is an automatically generated e-mail. To reply, visit:
> https://reviews.apache.org/r/71652/
> ---
> 
> (Updated Oct. 23, 2019, 3:52 a.m.)
> 
> 
> Review request for mesos, Gilbert Song, Greg Mann, Qian Zhang, and Vinod Kone.
> 
> 
> Bugs: MESOS-10007
> https://issues.apache.org/jira/browse/MESOS-10007
> 
> 
> Repository: mesos
> 
> 
> Description
> ---
> 
> Previously, it was possible for the command executor to miss
> the exit status of the subprocess since it performs its own
> reap and doesn't use the Subprocess::status. See MESOS-10007.
> 
> 
> Diffs
> -
> 
>   src/launcher/executor.cpp d9b39e5a66eb7bc739dede52f43da8ca9f7c09d1 
> 
> 
> Diff: https://reviews.apache.org/r/71652/diff/1/
> 
> 
> Testing
> ---
> 
> make check + manual testing from Charles in JIRA
> 
> 
> Thanks,
> 
> Benjamin Mahler
> 
>



Re: Review Request 71652: Avoid double reaping race in the command executor.

2019-10-23 Thread Mesos Reviewbot

---
This is an automatically generated e-mail. To reply, visit:
https://reviews.apache.org/r/71652/#review218351
---



Bad patch!

Reviews applied: [71652]

Failed command: ['bash', '-c', "set -o pipefail; export OS='ubuntu:14.04' 
BUILDTOOL='autotools' COMPILER='gcc' CONFIGURATION='--verbose 
--disable-libtool-wrappers --disable-parallel-test-execution' 
ENVIRONMENT='GLOG_v=1 MESOS_VERBOSE=1'; ./support/docker-build.sh 2>&1 | tee 
build_71652"]

Error:
..
24; ports:[31000-32000]; disk(reservations: 
[(DYNAMIC,storage),(DYNAMIC,storage/default-role,test-principal)])[MOUNT(org.apache.mesos.csi.test.local,/tmp/CSIVersion_StorageLocalResourceProviderTest_OperatorOperationsWithResourceProviderResources_v1_iXVDxw/2GB-31d22435-1c20-4b95-a133-7ab0f68239d1,test)]:2048,
 offered or allocated: {}) on agent 8b74f172-6660-452a-b8f4-44d69129bd0c-S0 
from framework 8b74f172-6660-452a-b8f4-44d69129bd0c-
I1023 07:58:46.228407 18972 master.cpp:12706] Removing offer 
8b74f172-6660-452a-b8f4-44d69129bd0c-O3
I1023 07:58:46.230870 18970 hierarchical.cpp:1615] Framework 
8b74f172-6660-452a-b8f4-44d69129bd0c- filtered agent 
8b74f172-6660-452a-b8f4-44d69129bd0c-S0 for 5secs
I1023 07:58:46.233952 18966 master.cpp:12571] Sending operation '' (uuid: 
4cfdc6ca-5cc7-4191-b168-c57673c55e81) to agent 
8b74f172-6660-452a-b8f4-44d69129bd0c-S0 at slave(1249)@172.17.0.2:38971 
(4574622b6dd8)
I1023 07:58:46.234721 18978 slave.cpp:4352] Ignoring new checkpointed resources 
and operations identical to the current version
I1023 07:58:46.237903 18958 master.cpp:6412] Processing REVIVE call for 
framework 8b74f172-6660-452a-b8f4-44d69129bd0c- (default) at 
scheduler-4e29bb53-3c92-4102-a204-eb032bb2529a@172.17.0.2:38971
I1023 07:58:46.238562 18975 hierarchical.cpp:1711] Unsuppressed offers and 
cleared filters for roles { storage/default-role } of framework 
8b74f172-6660-452a-b8f4-44d69129bd0c-
I1023 07:58:46.239331 18963 provider.cpp:498] Received APPLY_OPERATION event
I1023 07:58:46.239405 18963 provider.cpp:1351] Received CREATE operation '' 
(uuid: 4cfdc6ca-5cc7-4191-b168-c57673c55e81)
I1023 07:58:46.240499 18975 hierarchical.cpp:1843] Performed allocation for 1 
agents in 1.615303ms
I1023 07:58:46.24 18975 hierarchical.cpp:1843] Performed allocation for 1 
agents in 255828ns
I1023 07:58:46.241899 18974 master.cpp:10409] Sending offers [ 
8b74f172-6660-452a-b8f4-44d69129bd0c-O4 ] to framework 
8b74f172-6660-452a-b8f4-44d69129bd0c- (default) at 
scheduler-4e29bb53-3c92-4102-a204-eb032bb2529a@172.17.0.2:38971
I1023 07:58:46.242486 18972 sched.cpp:934] Scheduler::resourceOffers took 
74004ns
I1023 07:58:46.253813 18957 http.cpp:1115] HTTP POST for 
/slave(1249)/api/v1/resource_provider from 172.17.0.2:35270
I1023 07:58:46.255318 18968 slave.cpp:8483] Handling resource provider message 
'UPDATE_OPERATION_STATUS: (uuid: 9eac9abd-2697-47f4-a5c2-a46fffa0deed) for 
framework  (latest state: OPERATION_FINISHED, status update state: 
OPERATION_FINISHED)'
I1023 07:58:46.255730 18968 slave.cpp:8936] Updating the state of operation 
with no ID (uuid: 9eac9abd-2697-47f4-a5c2-a46fffa0deed) for an operation API 
call (latest state: OPERATION_FINISHED, status update state: OPERATION_FINISHED)
I1023 07:58:46.255831 18968 slave.cpp:8690] Forwarding status update of 
operation with no ID (operation_uuid: 9eac9abd-2697-47f4-a5c2-a46fffa0deed) for 
an operator API call
I1023 07:58:46.256301 18962 master.cpp:12223] Updating the state of operation 
'' (uuid: 9eac9abd-2697-47f4-a5c2-a46fffa0deed) for an operator API call 
(latest state: OPERATION_PENDING, status update state: OPERATION_FINISHED)
I1023 07:58:46.257138 18976 slave.cpp:4352] Ignoring new checkpointed resources 
and operations identical to the current version
I1023 07:58:46.334250 18972 status_update_manager_process.hpp:152] Received 
operation status update OPERATION_FINISHED (Status UUID: 
9bdddc3b-fb07-46d6-b924-212ab36ef6c6) for operation UUID 
4cfdc6ca-5cc7-4191-b168-c57673c55e81 on agent 
8b74f172-6660-452a-b8f4-44d69129bd0c-S0
I1023 07:58:46.334384 18972 status_update_manager_process.hpp:414] Creating 
operation status update stream 4cfdc6ca-5cc7-4191-b168-c57673c55e81 
checkpoint=true
I1023 07:58:46.334528 18963 provider.cpp:498] Received 
ACKNOWLEDGE_OPERATION_STATUS event
I1023 07:58:46.334925 18972 status_update_manager_process.hpp:929] 
Checkpointing UPDATE for operation status update OPERATION_FINISHED (Status 
UUID: 9bdddc3b-fb07-46d6-b924-212ab36ef6c6) for operation UUID 
4cfdc6ca-5cc7-4191-b168-c57673c55e81 on agent 
8b74f172-6660-452a-b8f4-44d69129bd0c-S0
I1023 07:58:46.409291 18972 status_update_manager_process.hpp:528] Forwarding 
operation status update OPERATION_FINISHED (Status UUID: 
9bdddc3b-fb07-46d6-b924-212ab36ef6c6) for operation UUID 
4cfdc6ca-5cc7-4191-b168-c57673c55e81 on agent 
8b74f172-6660-452a-b8f4-44d69129bd0c-S0
I1023 07:58:46.

Review Request 71652: Avoid double reaping race in the command executor.

2019-10-22 Thread Benjamin Mahler

---
This is an automatically generated e-mail. To reply, visit:
https://reviews.apache.org/r/71652/
---

Review request for mesos, Gilbert Song, Greg Mann, Qian Zhang, and Vinod Kone.


Bugs: MESOS-10007
https://issues.apache.org/jira/browse/MESOS-10007


Repository: mesos


Description
---

Previously, it was possible for the command executor to miss
the exit status of the subprocess since it performs its own
reap and doesn't use the Subprocess::status. See MESOS-10007.


Diffs
-

  src/launcher/executor.cpp d9b39e5a66eb7bc739dede52f43da8ca9f7c09d1 


Diff: https://reviews.apache.org/r/71652/diff/1/


Testing
---

make check + manual testing from Charles in JIRA


Thanks,

Benjamin Mahler