Dear Ralph,
I am sorry but I think I missed adding plm verbosity to 5 last time. Here is
the output of the complete program with and without -novm to the following
mpiexec.
mpiexec -mca state_base_verbose 10 -mca errmgr_base_verbose 10 -mca
plm_base_verbose 5 -mca btl tcp,sm,self -np 2 ./addhosttest
mpiexec -mca state_base_verbose 10 -mca errmgr_base_verbose 10 -mca
plm_base_verbose 5 -mca btl tcp,sm,self -novm -np 2 ./addhosttest
Here you can see that although I spawn only one process on grsacc18, something
is also done with grsacc19.
Sorry and thanks!
Suraj
{\rtf1\ansi\ansicpg1252\cocoartf1038\cocoasubrtf360
{\fonttbl\f0\fswiss\fcharset0 Helvetica;}
{\colortbl;\red255\green255\blue255;}
\paperw11900\paperh16840\margl1440\margr1440\vieww9000\viewh8400\viewkind0
\pard\tx566\tx1133\tx1700\tx2267\tx2834\tx3401\tx3968\tx4535\tx5102\tx5669\tx6236\tx6803\ql\qnatural\pardirnatural
\f0\fs24 \cf0 [grsacc20:09163] mca: base: components_register: registering state components\
[grsacc20:09163] mca: base: components_register: found loaded component app\
[grsacc20:09163] mca: base: components_register: component app has no register or open function\
[grsacc20:09163] mca: base: components_register: found loaded component hnp\
[grsacc20:09163] mca: base: components_register: component hnp has no register or open function\
[grsacc20:09163] mca: base: components_register: found loaded component novm\
[grsacc20:09163] mca: base: components_register: component novm register function successful\
[grsacc20:09163] mca: base: components_register: found loaded component orted\
[grsacc20:09163] mca: base: components_register: component orted has no register or open function\
[grsacc20:09163] mca: base: components_register: found loaded component staged_hnp\
[grsacc20:09163] mca: base: components_register: component staged_hnp has no register or open function\
[grsacc20:09163] mca: base: components_register: found loaded component staged_orted\
[grsacc20:09163] mca: base: components_register: component staged_orted has no register or open function\
[grsacc20:09163] mca: base: components_open: opening state components\
[grsacc20:09163] mca: base: components_open: found loaded component app\
[grsacc20:09163] mca: base: components_open: component app open function successful\
[grsacc20:09163] mca: base: components_open: found loaded component hnp\
[grsacc20:09163] mca: base: components_open: component hnp open function successful\
[grsacc20:09163] mca: base: components_open: found loaded component novm\
[grsacc20:09163] mca: base: components_open: component novm open function successful\
[grsacc20:09163] mca: base: components_open: found loaded component orted\
[grsacc20:09163] mca: base: components_open: component orted open function successful\
[grsacc20:09163] mca: base: components_open: found loaded component staged_hnp\
[grsacc20:09163] mca: base: components_open: component staged_hnp open function successful\
[grsacc20:09163] mca: base: components_open: found loaded component staged_orted\
[grsacc20:09163] mca: base: components_open: component staged_orted open function successful\
[grsacc20:09163] mca:base:select: Auto-selecting state components\
[grsacc20:09163] mca:base:select:(state) Querying component [app]\
[grsacc20:09163] mca:base:select:(state) Skipping component [app]. Query failed to return a module\
[grsacc20:09163] mca:base:select:(state) Querying component [hnp]\
[grsacc20:09163] mca:base:select:(state) Query of component [hnp] set priority to 60\
[grsacc20:09163] mca:base:select:(state) Querying component [novm]\
[grsacc20:09163] mca:base:select:(state) Skipping component [novm]. Query failed to return a module\
[grsacc20:09163] mca:base:select:(state) Querying component [orted]\
[grsacc20:09163] mca:base:select:(state) Skipping component [orted]. Query failed to return a module\
[grsacc20:09163] mca:base:select:(state) Querying component [staged_hnp]\
[grsacc20:09163] mca:base:select:(state) Skipping component [staged_hnp]. Query failed to return a module\
[grsacc20:09163] mca:base:select:(state) Querying component [staged_orted]\
[grsacc20:09163] mca:base:select:(state) Skipping component [staged_orted]. Query failed to return a module\
[grsacc20:09163] mca:base:select:(state) Selected component [hnp]\
[grsacc20:09163] mca: base: close: component app closed\
[grsacc20:09163] mca: base: close: unloading component app\
[grsacc20:09163] mca: base: close: component novm closed\
[grsacc20:09163] mca: base: close: unloading component novm\
[grsacc20:09163] mca: base: close: component orted closed\
[grsacc20:09163] mca: base: close: unloading component orted\
[grsacc20:09163] mca: base: close: component staged_hnp closed\
[grsacc20:09163] mca: base: close: unloading component staged_hnp\
[grsacc20:09163] mca: base: close: component staged_orted closed\
[grsacc20:09163] mca: base: close: unloading component staged_orted\
[grsacc20:09163] ORTE_JOB_STATE_MACHINE:\
[grsacc20:09163] State: PENDING INIT cbfunc: DEFINED\
[grsacc20:09163] State: INIT_COMPLETE cbfunc: DEFINED\
[grsacc20:09163] State: PENDING ALLOCATION cbfunc: DEFINED\
[grsacc20:09163] State: ALLOCATION COMPLETE cbfunc: DEFINED\
[grsacc20:09163] State: DAEMONS LAUNCHED cbfunc: DEFINED\
[grsacc20:09163] State: ALL DAEMONS REPORTED cbfunc: DEFINED\
[grsacc20:09163] State: VM READY cbfunc: DEFINED\
[grsacc20:09163] State: PENDING MAPPING cbfunc: DEFINED\
[grsacc20:09163] State: MAP COMPLETE cbfunc: DEFINED\
[grsacc20:09163] State: PENDING FINAL SYSTEM PREP cbfunc: DEFINED\
[grsacc20:09163] State: PENDING APP LAUNCH cbfunc: DEFINED\
[grsacc20:09163] State: LOCAL LAUNCH COMPLETE cbfunc: DEFINED\
[grsacc20:09163] State: RUNNING cbfunc: DEFINED\
[grsacc20:09163] State: SYNC REGISTERED cbfunc: DEFINED\
[grsacc20:09163] State: NORMALLY TERMINATED cbfunc: DEFINED\
[grsacc20:09163] State: NOTIFY COMPLETED cbfunc: DEFINED\
[grsacc20:09163] State: ALL JOBS COMPLETE cbfunc: DEFINED\
[grsacc20:09163] State: DAEMONS TERMINATED cbfunc: DEFINED\
[grsacc20:09163] State: FORCED EXIT cbfunc: DEFINED\
[grsacc20:09163] State: REPORT PROGRESS cbfunc: DEFINED\
[grsacc20:09163] ORTE_PROC_STATE_MACHINE:\
[grsacc20:09163] State: RUNNING cbfunc: DEFINED\
[grsacc20:09163] State: SYNC REGISTERED cbfunc: DEFINED\
[grsacc20:09163] State: IOF COMPLETE cbfunc: DEFINED\
[grsacc20:09163] State: WAITPID FIRED cbfunc: DEFINED\
[grsacc20:09163] State: NORMALLY TERMINATED cbfunc: DEFINED\
[grsacc20:09163] mca: base: components_register: registering errmgr components\
[grsacc20:09163] mca: base: components_register: found loaded component default_app\
[grsacc20:09163] mca: base: components_register: component default_app register function successful\
[grsacc20:09163] mca: base: components_register: found loaded component default_hnp\
[grsacc20:09163] mca: base: components_register: component default_hnp register function successful\
[grsacc20:09163] mca: base: components_register: found loaded component default_orted\
[grsacc20:09163] mca: base: components_register: component default_orted register function successful\
[grsacc20:09163] mca: base: components_open: opening errmgr components\
[grsacc20:09163] mca: base: components_open: found loaded component default_app\
[grsacc20:09163] mca: base: components_open: component default_app open function successful\
[grsacc20:09163] mca: base: components_open: found loaded component default_hnp\
[grsacc20:09163] mca: base: components_open: component default_hnp open function successful\
[grsacc20:09163] mca: base: components_open: found loaded component default_orted\
[grsacc20:09163] mca: base: components_open: component default_orted open function successful\
[grsacc20:09163] mca:base:select:( plm) Querying component [rsh]\
[grsacc20:09163] [[INVALID],INVALID] plm:rsh_lookup on agent ssh : rsh path NULL\
[grsacc20:09163] mca:base:select:( plm) Query of component [rsh] set priority to 10\
[grsacc20:09163] mca:base:select:( plm) Querying component [slurm]\
[grsacc20:09163] mca:base:select:( plm) Skipping component [slurm]. Query failed to return a module\
[grsacc20:09163] mca:base:select:( plm) Querying component [tm]\
[grsacc20:09163] mca:base:select:( plm) Query of component [tm] set priority to 75\
[grsacc20:09163] mca:base:select:( plm) Selected component [tm]\
[grsacc20:09163] plm:base:set_hnp_name: initial bias 9163 nodename hash 2203159203\
[grsacc20:09163] plm:base:set_hnp_name: final jobfam 10809\
[grsacc20:09163] mca:base:select: Auto-selecting errmgr components\
[grsacc20:09163] mca:base:select:(errmgr) Querying component [default_app]\
[grsacc20:09163] mca:base:select:(errmgr) Skipping component [default_app]. Query failed to return a module\
[grsacc20:09163] mca:base:select:(errmgr) Querying component [default_hnp]\
[grsacc20:09163] mca:base:select:(errmgr) Query of component [default_hnp] set priority to 1000\
[grsacc20:09163] mca:base:select:(errmgr) Querying component [default_orted]\
[grsacc20:09163] mca:base:select:(errmgr) Skipping component [default_orted]. Query failed to return a module\
[grsacc20:09163] mca:base:select:(errmgr) Selected component [default_hnp]\
[grsacc20:09163] mca: base: close: component default_app closed\
[grsacc20:09163] mca: base: close: unloading component default_app\
[grsacc20:09163] mca: base: close: component default_orted closed\
[grsacc20:09163] mca: base: close: unloading component default_orted\
[grsacc20:09163] [[10809,0],0] plm:base:receive start comm\
[grsacc20:09163] [[10809,0],0] ACTIVATE JOB [INVALID] STATE PENDING INIT AT plm_tm_module.c:157\
[grsacc20:09163] [[10809,0],0] ACTIVATING JOB [INVALID] STATE PENDING INIT PRI 4\
[grsacc20:09163] [[10809,0],0] plm:base:setup_job\
[grsacc20:09163] [[10809,0],0] ACTIVATE JOB [10809,1] STATE INIT_COMPLETE AT base/plm_base_launch_support.c:315\
[grsacc20:09163] [[10809,0],0] ACTIVATING JOB [10809,1] STATE INIT_COMPLETE PRI 4\
[grsacc20:09163] [[10809,0],0] ACTIVATE JOB [10809,1] STATE PENDING ALLOCATION AT base/plm_base_launch_support.c:326\
[grsacc20:09163] [[10809,0],0] ACTIVATING JOB [10809,1] STATE PENDING ALLOCATION PRI 4\
[grsacc20:09163] [[10809,0],0] ACTIVATE JOB [10809,1] STATE ALLOCATION COMPLETE AT base/ras_base_allocate.c:421\
[grsacc20:09163] [[10809,0],0] ACTIVATING JOB [10809,1] STATE ALLOCATION COMPLETE PRI 4\
[grsacc20:09163] [[10809,0],0] ACTIVATE JOB [10809,1] STATE PENDING DAEMON LAUNCH AT base/plm_base_launch_support.c:182\
[grsacc20:09163] [[10809,0],0] ACTIVATING JOB [10809,1] STATE PENDING DAEMON LAUNCH PRI 4\
[grsacc20:09163] [[10809,0],0] plm:base:setup_vm\
[grsacc20:09163] [[10809,0],0] plm:base:setup_vm creating map\
[grsacc20:09163] [[10809,0],0] plm:base:setup_vm add new daemon [[10809,0],1]\
[grsacc20:09163] [[10809,0],0] plm:base:setup_vm assigning new daemon [[10809,0],1] to node grsacc19\
[grsacc20:09163] [[10809,0],0] plm:tm: launching vm\
[grsacc20:09163] [[10809,0],0] plm:tm: final top-level argv:\
orted -mca ess tm -mca orte_ess_jobid 708378624 -mca orte_ess_vpid <template> -mca orte_ess_num_procs 2 -mca orte_hnp_uri "708378624.0;tcp://192.168.222.20:33738" -mca state_base_verbose 10 -mca errmgr_base_verbose 10 -mca plm_base_verbose 5 -mca btl tcp,sm,self\
[grsacc20:09163] [[10809,0],0] plm:tm: launching on node grsacc19\
[grsacc20:09163] [[10809,0],0] plm:tm: executing:\
orted -mca ess tm -mca orte_ess_jobid 708378624 -mca orte_ess_vpid 1 -mca orte_ess_num_procs 2 -mca orte_hnp_uri "708378624.0;tcp://192.168.222.20:33738" -mca state_base_verbose 10 -mca errmgr_base_verbose 10 -mca plm_base_verbose 5 -mca btl tcp,sm,self\
[grsacc20:09163] [[10809,0],0] plm:tm:launch: finished spawning orteds\
[grsacc19:00432] mca: base: components_register: registering state components\
[grsacc19:00432] mca: base: components_register: found loaded component app\
[grsacc19:00432] mca: base: components_register: component app has no register or open function\
[grsacc19:00432] mca: base: components_register: found loaded component hnp\
[grsacc19:00432] mca: base: components_register: component hnp has no register or open function\
[grsacc19:00432] mca: base: components_register: found loaded component novm\
[grsacc19:00432] mca: base: components_register: component novm register function successful\
[grsacc19:00432] mca: base: components_register: found loaded component orted\
[grsacc19:00432] mca: base: components_register: component orted has no register or open function\
[grsacc19:00432] mca: base: components_register: found loaded component staged_hnp\
[grsacc19:00432] mca: base: components_register: component staged_hnp has no register or open function\
[grsacc19:00432] mca: base: components_register: found loaded component staged_orted\
[grsacc19:00432] mca: base: components_register: component staged_orted has no register or open function\
[grsacc19:00432] mca: base: components_open: opening state components\
[grsacc19:00432] mca: base: components_open: found loaded component app\
[grsacc19:00432] mca: base: components_open: component app open function successful\
[grsacc19:00432] mca: base: components_open: found loaded component hnp\
[grsacc19:00432] mca: base: components_open: component hnp open function successful\
[grsacc19:00432] mca: base: components_open: found loaded component novm\
[grsacc19:00432] mca: base: components_open: component novm open function successful\
[grsacc19:00432] mca: base: components_open: found loaded component orted\
[grsacc19:00432] mca: base: components_open: component orted open function successful\
[grsacc19:00432] mca: base: components_open: found loaded component staged_hnp\
[grsacc19:00432] mca: base: components_open: component staged_hnp open function successful\
[grsacc19:00432] mca: base: components_open: found loaded component staged_orted\
[grsacc19:00432] mca: base: components_open: component staged_orted open function successful\
[grsacc19:00432] mca:base:select: Auto-selecting state components\
[grsacc19:00432] mca:base:select:(state) Querying component [app]\
[grsacc19:00432] mca:base:select:(state) Skipping component [app]. Query failed to return a module\
[grsacc19:00432] mca:base:select:(state) Querying component [hnp]\
[grsacc19:00432] mca:base:select:(state) Skipping component [hnp]. Query failed to return a module\
[grsacc19:00432] mca:base:select:(state) Querying component [novm]\
[grsacc19:00432] mca:base:select:(state) Skipping component [novm]. Query failed to return a module\
[grsacc19:00432] mca:base:select:(state) Querying component [orted]\
[grsacc19:00432] mca:base:select:(state) Query of component [orted] set priority to 100\
[grsacc19:00432] mca:base:select:(state) Querying component [staged_hnp]\
[grsacc19:00432] mca:base:select:(state) Skipping component [staged_hnp]. Query failed to return a module\
[grsacc19:00432] mca:base:select:(state) Querying component [staged_orted]\
[grsacc19:00432] mca:base:select:(state) Skipping component [staged_orted]. Query failed to return a module\
[grsacc19:00432] mca:base:select:(state) Selected component [orted]\
[grsacc19:00432] mca: base: close: component app closed\
[grsacc19:00432] mca: base: close: unloading component app\
[grsacc19:00432] mca: base: close: component hnp closed\
[grsacc19:00432] mca: base: close: unloading component hnp\
[grsacc19:00432] mca: base: close: component novm closed\
[grsacc19:00432] mca: base: close: unloading component novm\
[grsacc19:00432] mca: base: close: component staged_hnp closed\
[grsacc19:00432] mca: base: close: unloading component staged_hnp\
[grsacc19:00432] mca: base: close: component staged_orted closed\
[grsacc19:00432] mca: base: close: unloading component staged_orted\
[grsacc19:00432] ORTE_JOB_STATE_MACHINE:\
[grsacc19:00432] State: LOCAL LAUNCH COMPLETE cbfunc: DEFINED\
[grsacc19:00432] State: FORCED EXIT cbfunc: DEFINED\
[grsacc19:00432] State: DAEMONS TERMINATED cbfunc: DEFINED\
[grsacc19:00432] ORTE_PROC_STATE_MACHINE:\
[grsacc19:00432] State: RUNNING cbfunc: DEFINED\
[grsacc19:00432] State: SYNC REGISTERED cbfunc: DEFINED\
[grsacc19:00432] State: IOF COMPLETE cbfunc: DEFINED\
[grsacc19:00432] State: WAITPID FIRED cbfunc: DEFINED\
[grsacc19:00432] mca: base: components_register: registering errmgr components\
[grsacc19:00432] mca: base: components_register: found loaded component default_app\
[grsacc19:00432] mca: base: components_register: component default_app register function successful\
[grsacc19:00432] mca: base: components_register: found loaded component default_hnp\
[grsacc19:00432] mca: base: components_register: component default_hnp register function successful\
[grsacc19:00432] mca: base: components_register: found loaded component default_orted\
[grsacc19:00432] mca: base: components_register: component default_orted register function successful\
[grsacc19:00432] mca: base: components_open: opening errmgr components\
[grsacc19:00432] mca: base: components_open: found loaded component default_app\
[grsacc19:00432] mca: base: components_open: component default_app open function successful\
[grsacc19:00432] mca: base: components_open: found loaded component default_hnp\
[grsacc19:00432] mca: base: components_open: component default_hnp open function successful\
[grsacc19:00432] mca: base: components_open: found loaded component default_orted\
[grsacc19:00432] mca: base: components_open: component default_orted open function successful\
[grsacc19:00432] mca:base:select:( plm) Querying component [rsh]\
[grsacc19:00432] [[10809,0],1] plm:rsh_lookup on agent ssh : rsh path NULL\
[grsacc19:00432] mca:base:select:( plm) Query of component [rsh] set priority to 10\
[grsacc19:00432] mca:base:select:( plm) Selected component [rsh]\
[grsacc19:00432] mca:base:select: Auto-selecting errmgr components\
[grsacc19:00432] mca:base:select:(errmgr) Querying component [default_app]\
[grsacc19:00432] mca:base:select:(errmgr) Skipping component [default_app]. Query failed to return a module\
[grsacc19:00432] mca:base:select:(errmgr) Querying component [default_hnp]\
[grsacc19:00432] mca:base:select:(errmgr) Skipping component [default_hnp]. Query failed to return a module\
[grsacc19:00432] mca:base:select:(errmgr) Querying component [default_orted]\
[grsacc19:00432] mca:base:select:(errmgr) Query of component [default_orted] set priority to 1000\
[grsacc19:00432] mca:base:select:(errmgr) Selected component [default_orted]\
[grsacc19:00432] mca: base: close: component default_app closed\
[grsacc19:00432] mca: base: close: unloading component default_app\
[grsacc19:00432] mca: base: close: component default_hnp closed\
[grsacc19:00432] mca: base: close: unloading component default_hnp\
[grsacc19:00432] [[10809,0],1] plm:rsh_setup on agent ssh : rsh path NULL\
[grsacc19:00432] [[10809,0],1] plm:base:receive start comm\
[grsacc20:09163] [[10809,0],0] plm:base:orted_report_launch from daemon [[10809,0],1]\
[grsacc20:09163] [[10809,0],0] plm:base:orted_report_launch from daemon [[10809,0],1] on node grsacc19\
[grsacc20:09163] [[10809,0],0] RECEIVED TOPOLOGY FROM NODE grsacc19\
[grsacc20:09163] [[10809,0],0] NEW TOPOLOGY - ADDING\
[grsacc20:09163] [[10809,0],0] plm:base:orted_report_launch completed for daemon [[10809,0],1] at contact 708378624.1;tcp://192.168.222.19:56156\
[grsacc20:09163] [[10809,0],0] ACTIVATE JOB [10809,1] STATE ALL DAEMONS REPORTED AT base/plm_base_launch_support.c:842\
[grsacc20:09163] [[10809,0],0] ACTIVATING JOB [10809,1] STATE ALL DAEMONS REPORTED PRI 4\
[grsacc20:09163] [[10809,0],0] ACTIVATE JOB [10809,1] STATE VM READY AT base/plm_base_launch_support.c:170\
[grsacc20:09163] [[10809,0],0] ACTIVATING JOB [10809,1] STATE VM READY PRI 4\
[grsacc20:09163] [[10809,0],0] ACTIVATE JOB [10809,1] STATE PENDING MAPPING AT base/plm_base_launch_support.c:207\
[grsacc20:09163] [[10809,0],0] ACTIVATING JOB [10809,1] STATE PENDING MAPPING PRI 4\
[grsacc20:09163] [[10809,0],0] ACTIVATE JOB [10809,1] STATE MAP COMPLETE AT base/rmaps_base_map_job.c:316\
[grsacc20:09163] [[10809,0],0] ACTIVATING JOB [10809,1] STATE MAP COMPLETE PRI 4\
[grsacc20:09163] [[10809,0],0] ACTIVATE JOB [10809,1] STATE PENDING FINAL SYSTEM PREP AT base/plm_base_launch_support.c:233\
[grsacc20:09163] [[10809,0],0] ACTIVATING JOB [10809,1] STATE PENDING FINAL SYSTEM PREP PRI 4\
[grsacc20:09163] [[10809,0],0] ACTIVATE JOB [10809,1] STATE PENDING APP LAUNCH AT base/plm_base_launch_support.c:410\
[grsacc20:09163] [[10809,0],0] ACTIVATING JOB [10809,1] STATE PENDING APP LAUNCH PRI 4\
[grsacc20:09163] [[10809,0],0] plm:base:launch_apps for job [10809,1]\
[grsacc20:09163] [[10809,0],0] ACTIVATE PROC [[10809,1],0] STATE RUNNING AT base/odls_base_default_fns.c:1545\
[grsacc20:09163] [[10809,0],0] ACTIVATING PROC [[10809,1],0] STATE RUNNING PRI 4\
[grsacc20:09163] [[10809,0],0] ACTIVATE JOB [10809,1] STATE LOCAL LAUNCH COMPLETE AT base/odls_base_default_fns.c:1593\
[grsacc20:09163] [[10809,0],0] ACTIVATING JOB [10809,1] STATE LOCAL LAUNCH COMPLETE PRI 4\
[grsacc20:09163] [[10809,0],0] state:base:track_procs called for proc [[10809,1],0] state RUNNING\
[grsacc20:09165] mca: base: components_register: registering state components\
[grsacc20:09165] mca: base: components_register: found loaded component app\
[grsacc20:09165] mca: base: components_register: component app has no register or open function\
[grsacc20:09165] mca: base: components_register: found loaded component hnp\
[grsacc20:09165] mca: base: components_register: component hnp has no register or open function\
[grsacc20:09165] mca: base: components_register: found loaded component novm\
[grsacc20:09165] mca: base: components_register: component novm register function successful\
[grsacc20:09165] mca: base: components_register: found loaded component orted\
[grsacc20:09165] mca: base: components_register: component orted has no register or open function\
[grsacc20:09165] mca: base: components_register: found loaded component staged_hnp\
[grsacc20:09165] mca: base: components_register: component staged_hnp has no register or open function\
[grsacc20:09165] mca: base: components_register: found loaded component staged_orted\
[grsacc20:09165] mca: base: components_register: component staged_orted has no register or open function\
[grsacc20:09165] mca: base: components_open: opening state components\
[grsacc20:09165] mca: base: components_open: found loaded component app\
[grsacc20:09165] mca: base: components_open: component app open function successful\
[grsacc20:09165] mca: base: components_open: found loaded component hnp\
[grsacc20:09165] mca: base: components_open: component hnp open function successful\
[grsacc20:09165] mca: base: components_open: found loaded component novm\
[grsacc20:09165] mca: base: components_open: component novm open function successful\
[grsacc20:09165] mca: base: components_open: found loaded component orted\
[grsacc20:09165] mca: base: components_open: component orted open function successful\
[grsacc20:09165] mca: base: components_open: found loaded component staged_hnp\
[grsacc20:09165] mca: base: components_open: component staged_hnp open function successful\
[grsacc20:09165] mca: base: components_open: found loaded component staged_orted\
[grsacc20:09165] mca: base: components_open: component staged_orted open function successful\
[grsacc20:09165] mca:base:select: Auto-selecting state components\
[grsacc20:09165] mca:base:select:(state) Querying component [app]\
[grsacc20:09165] mca:base:select:(state) Query of component [app] set priority to 1000\
[grsacc20:09165] mca:base:select:(state) Querying component [hnp]\
[grsacc20:09165] mca:base:select:(state) Skipping component [hnp]. Query failed to return a module\
[grsacc20:09165] mca:base:select:(state) Querying component [novm]\
[grsacc20:09165] mca:base:select:(state) Skipping component [novm]. Query failed to return a module\
[grsacc20:09165] mca:base:select:(state) Querying component [orted]\
[grsacc20:09165] mca:base:select:(state) Skipping component [orted]. Query failed to return a module\
[grsacc20:09165] mca:base:select:(state) Querying component [staged_hnp]\
[grsacc20:09165] mca:base:select:(state) Skipping component [staged_hnp]. Query failed to return a module\
[grsacc20:09165] mca:base:select:(state) Querying component [staged_orted]\
[grsacc20:09165] mca:base:select:(state) Skipping component [staged_orted]. Query failed to return a module\
[grsacc20:09165] mca:base:select:(state) Selected component [app]\
[grsacc20:09165] mca: base: close: component hnp closed\
[grsacc20:09165] mca: base: close: unloading component hnp\
[grsacc20:09165] mca: base: close: component novm closed\
[grsacc20:09165] mca: base: close: unloading component novm\
[grsacc20:09165] mca: base: close: component orted closed\
[grsacc20:09165] mca: base: close: unloading component orted\
[grsacc20:09165] mca: base: close: component staged_hnp closed\
[grsacc20:09165] mca: base: close: unloading component staged_hnp\
[grsacc20:09165] mca: base: close: component staged_orted closed\
[grsacc20:09165] mca: base: close: unloading component staged_orted\
[grsacc20:09165] mca: base: components_register: registering errmgr components\
[grsacc20:09165] mca: base: components_register: found loaded component default_app\
[grsacc20:09165] mca: base: components_register: component default_app register function successful\
[grsacc20:09165] mca: base: components_register: found loaded component default_hnp\
[grsacc20:09165] mca: base: components_register: component default_hnp register function successful\
[grsacc20:09165] mca: base: components_register: found loaded component default_orted\
[grsacc20:09165] mca: base: components_register: component default_orted register function successful\
[grsacc20:09165] mca: base: components_open: opening errmgr components\
[grsacc20:09165] mca: base: components_open: found loaded component default_app\
[grsacc20:09165] mca: base: components_open: component default_app open function successful\
[grsacc20:09165] mca: base: components_open: found loaded component default_hnp\
[grsacc20:09165] mca: base: components_open: component default_hnp open function successful\
[grsacc20:09165] mca: base: components_open: found loaded component default_orted\
[grsacc20:09165] mca: base: components_open: component default_orted open function successful\
[grsacc20:09165] mca:base:select: Auto-selecting errmgr components\
[grsacc20:09165] mca:base:select:(errmgr) Querying component [default_app]\
[grsacc20:09165] mca:base:select:(errmgr) Query of component [default_app] set priority to 1000\
[grsacc20:09165] mca:base:select:(errmgr) Querying component [default_hnp]\
[grsacc20:09165] mca:base:select:(errmgr) Skipping component [default_hnp]. Query failed to return a module\
[grsacc20:09165] mca:base:select:(errmgr) Querying component [default_orted]\
[grsacc20:09165] mca:base:select:(errmgr) Skipping component [default_orted]. Query failed to return a module\
[grsacc20:09165] mca:base:select:(errmgr) Selected component [default_app]\
[grsacc20:09165] mca: base: close: component default_hnp closed\
[grsacc20:09165] mca: base: close: unloading component default_hnp\
[grsacc20:09165] mca: base: close: component default_orted closed\
[grsacc20:09165] mca: base: close: unloading component default_orted\
[grsacc19:00432] [[10809,0],1] ACTIVATE PROC [[10809,1],1] STATE RUNNING AT base/odls_base_default_fns.c:1545\
[grsacc19:00432] [[10809,0],1] ACTIVATING PROC [[10809,1],1] STATE RUNNING PRI 4\
[grsacc19:00432] [[10809,0],1] ACTIVATE JOB [10809,1] STATE LOCAL LAUNCH COMPLETE AT base/odls_base_default_fns.c:1593\
[grsacc19:00432] [[10809,0],1] ACTIVATING JOB [10809,1] STATE LOCAL LAUNCH COMPLETE PRI 4\
[grsacc19:00432] [[10809,0],1] state:orted:track_procs called for proc [[10809,1],1] state RUNNING\
[grsacc19:00432] [[10809,0],1] state:orted:track_jobs sending local launch complete for job [10809,1]\
[grsacc20:09163] [[10809,0],0] plm:base:receive processing msg\
[grsacc20:09163] [[10809,0],0] plm:base:receive update proc state command from [[10809,0],1]\
[grsacc20:09163] [[10809,0],0] plm:base:receive got update_proc_state for job [10809,1]\
[grsacc20:09163] [[10809,0],0] plm:base:receive got update_proc_state for vpid 1 state RUNNING exit_code 0\
[grsacc20:09163] [[10809,0],0] ACTIVATE PROC [[10809,1],1] STATE RUNNING AT base/plm_base_receive.c:296\
[grsacc20:09163] [[10809,0],0] ACTIVATING PROC [[10809,1],1] STATE RUNNING PRI 4\
[grsacc20:09163] [[10809,0],0] plm:base:receive done processing commands\
[grsacc19:00433] mca: base: components_register: registering state components\
[grsacc19:00433] mca: base: components_register: found loaded component app\
[grsacc19:00433] mca: base: components_register: component app has no register or open function\
[grsacc19:00433] mca: base: components_register: found loaded component hnp\
[grsacc19:00433] mca: base: components_register: component hnp has no register or open function\
[grsacc19:00433] mca: base: components_register: found loaded component novm\
[grsacc19:00433] mca: base: components_register: component novm register function successful\
[grsacc19:00433] mca: base: components_register: found loaded component orted\
[grsacc19:00433] mca: base: components_register: component orted has no register or open function\
[grsacc19:00433] mca: base: components_register: found loaded component staged_hnp\
[grsacc19:00433] mca: base: components_register: component staged_hnp has no register or open function\
[grsacc19:00433] mca: base: components_register: found loaded component staged_orted\
[grsacc19:00433] mca: base: components_register: component staged_orted has no register or open function\
[grsacc19:00433] mca: base: components_open: opening state components\
[grsacc19:00433] mca: base: components_open: found loaded component app\
[grsacc19:00433] mca: base: components_open: component app open function successful\
[grsacc19:00433] mca: base: components_open: found loaded component hnp\
[grsacc19:00433] mca: base: components_open: component hnp open function successful\
[grsacc19:00433] mca: base: components_open: found loaded component novm\
[grsacc19:00433] mca: base: components_open: component novm open function successful\
[grsacc19:00433] mca: base: components_open: found loaded component orted\
[grsacc19:00433] mca: base: components_open: component orted open function successful\
[grsacc19:00433] mca: base: components_open: found loaded component staged_hnp\
[grsacc19:00433] mca: base: components_open: component staged_hnp open function successful\
[grsacc19:00433] mca: base: components_open: found loaded component staged_orted\
[grsacc19:00433] mca: base: components_open: component staged_orted open function successful\
[grsacc19:00433] mca:base:select: Auto-selecting state components\
[grsacc19:00433] mca:base:select:(state) Querying component [app]\
[grsacc19:00433] mca:base:select:(state) Query of component [app] set priority to 1000\
[grsacc19:00433] mca:base:select:(state) Querying component [hnp]\
[grsacc19:00433] mca:base:select:(state) Skipping component [hnp]. Query failed to return a module\
[grsacc19:00433] mca:base:select:(state) Querying component [novm]\
[grsacc19:00433] mca:base:select:(state) Skipping component [novm]. Query failed to return a module\
[grsacc19:00433] mca:base:select:(state) Querying component [orted]\
[grsacc19:00433] mca:base:select:(state) Skipping component [orted]. Query failed to return a module\
[grsacc19:00433] mca:base:select:(state) Querying component [staged_hnp]\
[grsacc19:00433] mca:base:select:(state) Skipping component [staged_hnp]. Query failed to return a module\
[grsacc19:00433] mca:base:select:(state) Querying component [staged_orted]\
[grsacc19:00433] mca:base:select:(state) Skipping component [staged_orted]. Query failed to return a module\
[grsacc19:00433] mca:base:select:(state) Selected component [app]\
[grsacc19:00433] mca: base: close: component hnp closed\
[grsacc19:00433] mca: base: close: unloading component hnp\
[grsacc19:00433] mca: base: close: component novm closed\
[grsacc19:00433] mca: base: close: unloading component novm\
[grsacc19:00433] mca: base: close: component orted closed\
[grsacc19:00433] mca: base: close: unloading component orted\
[grsacc19:00433] mca: base: close: component staged_hnp closed\
[grsacc19:00433] mca: base: close: unloading component staged_hnp\
[grsacc19:00433] mca: base: close: component staged_orted closed\
[grsacc19:00433] mca: base: close: unloading component staged_orted\
[grsacc20:09163] [[10809,0],0] state:base:track_procs called for proc [[10809,1],1] state RUNNING\
[grsacc20:09163] [[10809,0],0] ACTIVATE JOB [10809,1] STATE RUNNING AT base/state_base_fns.c:482\
[grsacc20:09163] [[10809,0],0] ACTIVATING JOB [10809,1] STATE RUNNING PRI 4\
[grsacc20:09163] [[10809,0],0] plm:base:launch wiring up iof for job [10809,1]\
[grsacc19:00433] mca: base: components_register: registering errmgr components\
[grsacc19:00433] mca: base: components_register: found loaded component default_app\
[grsacc19:00433] mca: base: components_register: component default_app register function successful\
[grsacc19:00433] mca: base: components_register: found loaded component default_hnp\
[grsacc19:00433] mca: base: components_register: component default_hnp register function successful\
[grsacc19:00433] mca: base: components_register: found loaded component default_orted\
[grsacc19:00433] mca: base: components_register: component default_orted register function successful\
[grsacc19:00433] mca: base: components_open: opening errmgr components\
[grsacc19:00433] mca: base: components_open: found loaded component default_app\
[grsacc19:00433] mca: base: components_open: component default_app open function successful\
[grsacc19:00433] mca: base: components_open: found loaded component default_hnp\
[grsacc19:00433] mca: base: components_open: component default_hnp open function successful\
[grsacc19:00433] mca: base: components_open: found loaded component default_orted\
[grsacc19:00433] mca: base: components_open: component default_orted open function successful\
[grsacc19:00433] mca:base:select: Auto-selecting errmgr components\
[grsacc19:00433] mca:base:select:(errmgr) Querying component [default_app]\
[grsacc19:00433] mca:base:select:(errmgr) Query of component [default_app] set priority to 1000\
[grsacc19:00433] mca:base:select:(errmgr) Querying component [default_hnp]\
[grsacc19:00433] mca:base:select:(errmgr) Skipping component [default_hnp]. Query failed to return a module\
[grsacc19:00433] mca:base:select:(errmgr) Querying component [default_orted]\
[grsacc19:00433] mca:base:select:(errmgr) Skipping component [default_orted]. Query failed to return a module\
[grsacc19:00433] mca:base:select:(errmgr) Selected component [default_app]\
[grsacc19:00433] mca: base: close: component default_hnp closed\
[grsacc19:00433] mca: base: close: unloading component default_hnp\
[grsacc19:00433] mca: base: close: component default_orted closed\
[grsacc19:00433] mca: base: close: unloading component default_orted\
[grsacc20:09163] [[10809,0],0] ACTIVATE PROC [[10809,1],0] STATE SYNC REGISTERED AT base/odls_base_default_fns.c:1836\
[grsacc20:09163] [[10809,0],0] ACTIVATING PROC [[10809,1],0] STATE SYNC REGISTERED PRI 4\
[grsacc20:09163] [[10809,0],0] state:base:track_procs called for proc [[10809,1],0] state SYNC REGISTERED\
[grsacc19:00432] [[10809,0],1] ACTIVATE PROC [[10809,1],1] STATE SYNC REGISTERED AT base/odls_base_default_fns.[grsacc20:09163] [[10809,0],0] plm:base:receive processing msg\
c:1836\
[grsacc20:09163] [[10809,0],0] ACTIVATE PROC [[10809,1],1] STATE SYNC REGISTERED AT base/plm_base_receive.c:354\
[grsa[grsacc20:09163] [[10809,0],0] ACTIVATING PROC [[10809,1],1] STATE SYNC REGISTERED PRI 4\
cc19:00432] [[10[grsacc20:09163] [[10809,0],0] plm:base:receive done processing commands\
809,0],1] A[grsacc20:09163] [[10809,0],0] state:base:track_procs called for proc [[10809,1],1] state SYNC REGISTERED\
CTIVA[grsacc20:09163] [[10809,0],0] ACTIVATE JOB [10809,1] STATE SYNC REGISTERED AT base/state_base_fns.c:490\
TING[grsacc20:09163] [[10809,0],0] ACTIVATING JOB [10809,1] STATE SYNC REGISTERED PRI 4\
PROC[grsacc20:09163] [[10809,0],0] plm:base:launch registered event\
[[1[grsacc20:09163] [[10809,0],0] plm:base:launch job [10809,1] is not a dynamic spawn\
0809[grsacc20:09163] [[10809,0],0] ACTIVATE JOB [10809,1] STATE READY FOR DEBUGGERS AT base/plm_base_launch_support.c:609\
,1],1[grsacc20:09163] [[10809,0],0] ACTIVATING JOB [10809,1] STATE READY FOR DEBUGGERS PRI 4\
] STATE SYNC REGISTERED PRI 4\
[grsacc19:00432] [[10809,0],1] state:orted:track_procs called for proc [[10809,1],1] state SYNC REGISTERED\
[grsacc19:00432] [[10809,0],1] state:orted: sending contact info to HNP\
[grsacc20:09163] [[10809,0],0] plm:base:receive processing msg\
[grsacc20:09163] [[10809,0],0] plm:base:receive job launch command from [[10809,1],0]\
[grsacc20:09163] [[10809,0],0] plm:base:receive adding hosts\
[grsacc20:09163] [[10809,0],0] plm:base:receive calling spawn\
[grsacc20:09163] [[10809,0],0] ACTIVATE JOB [INVALID] STATE PENDING INIT AT plm_tm_module.c:157\
[grsacc20:09163] [[10809,0],0] ACTIVATING JOB [INVALID] STATE PENDING INIT PRI 4\
[grsacc20:09163] [[10809,0],0] plm:base:receive done processing commands\
all info set, ready to spawn\
[grsacc20:09163] [[10809,0],0] plm:base:setup_job\
[grsacc20:09163] [[10809,0],0] ACTIVATE JOB [10809,2] STATE INIT_COMPLETE AT base/plm_base_launch_support.c:315\
[grsacc20:09163] [[10809,0],0] ACTIVATING JOB [10809,2] STATE INIT_COMPLETE PRI 4\
[grsacc20:09163] [[10809,0],0] ACTIVATE JOB [10809,2] STATE PENDING ALLOCATION AT base/plm_base_launch_support.c:326\
[grsacc20:09163] [[10809,0],0] ACTIVATING JOB [10809,2] STATE PENDING ALLOCATION PRI 4\
[grsacc20:09163] [[10809,0],0] ACTIVATE JOB [10809,2] STATE ALLOCATION COMPLETE AT base/ras_base_allocate.c:421\
[grsacc20:09163] [[10809,0],0] ACTIVATING JOB [10809,2] STATE ALLOCATION COMPLETE PRI 4\
[grsacc20:09163] [[10809,0],0] ACTIVATE JOB [10809,2] STATE PENDING DAEMON LAUNCH AT base/plm_base_launch_support.c:182\
[grsacc20:09163] [[10809,0],0] ACTIVATING JOB [10809,2] STATE PENDING DAEMON LAUNCH PRI 4\
[grsacc20:09163] [[10809,0],0] plm:base:setup_vm\
[grsacc20:09163] [[10809,0],0] plm:base:setup_vm add new daemon [[10809,0],2]\
[grsacc20:09163] [[10809,0],0] plm:base:setup_vm assigning new daemon [[10809,0],2] to node grsacc18\
[grsacc20:09163] [[10809,0],0] plm:tm: launching vm\
[grsacc20:09163] [[10809,0],0] plm:tm: final top-level argv:\
orted -mca ess tm -mca orte_ess_jobid 708378624 -mca orte_ess_vpid <template> -mca orte_ess_num_procs 3 -mca orte_hnp_uri "708378624.0;tcp://192.168.222.20:33738" -mca state_base_verbose 10 -mca errmgr_base_verbose 10 -mca plm_base_verbose 5 -mca btl tcp,sm,self\
[grsacc20:09163] [[10809,0],0] plm:tm: launching on node grsacc19\
[grsacc20:09163] [[10809,0],0] plm:tm: executing:\
orted -mca ess tm -mca orte_ess_jobid 708378624 -mca orte_ess_vpid 1 -mca orte_ess_num_procs 3 -mca orte_hnp_uri "708378624.0;tcp://192.168.222.20:33738" -mca state_base_verbose 10 -mca errmgr_base_verbose 10 -mca plm_base_verbose 5 -mca btl tcp,sm,self\
[grsacc20:09163] [[10809,0],0] plm:tm: launching on node grsacc18\
[grsacc20:09163] [[10809,0],0] plm:tm: executing:\
orted -mca ess tm -mca orte_ess_jobid 708378624 -mca orte_ess_vpid 2 -mca orte_ess_num_procs 3 -mca orte_hnp_uri "708378624.0;tcp://192.168.222.20:33738" -mca state_base_verbose 10 -mca errmgr_base_verbose 10 -mca plm_base_verbose 5 -mca btl tcp,sm,self\
[grsacc20:09163] [[10809,0],0] plm:tm:launch: finished spawning orteds\
[grsacc19:00435] mca: base: components_register: registering state components\
[grsacc19:00435] mca: base: components_register: found loaded component app\
[grsacc19:00435] mca: base: components_register: component app has no register or open function\
[grsacc19:00435] mca: base: components_register: found loaded component hnp\
[grsacc19:00435] mca: base: components_register: component hnp has no register or open function\
[grsacc19:00435] mca: base: components_register: found loaded component novm\
[grsacc19:00435] mca: base: components_register: component novm register function successful\
[grsacc19:00435] mca: base: components_register: found loaded component orted\
[grsacc19:00435] mca: base: components_register: component orted has no register or open function\
[grsacc19:00435] mca: base: components_register: found loaded component staged_hnp\
[grsacc19:00435] mca: base: components_register: component staged_hnp has no register or open function\
[grsacc19:00435] mca: base: components_register: found loaded component staged_orted\
[grsacc19:00435] mca: base: components_register: component staged_orted has no register or open function\
[grsacc19:00435] mca: base: components_open: opening state components\
[grsacc19:00435] mca: base: components_open: found loaded component app\
[grsacc19:00435] mca: base: components_open: component app open function successful\
[grsacc19:00435] mca: base: components_open: found loaded component hnp\
[grsacc19:00435] mca: base: components_open: component hnp open function successful\
[grsacc19:00435] mca: base: components_open: found loaded component novm\
[grsacc19:00435] mca: base: components_open: component novm open function successful\
[grsacc19:00435] mca: base: components_open: found loaded component orted\
[grsacc19:00435] mca: base: components_open: component orted open function successful\
[grsacc19:00435] mca: base: components_open: found loaded component staged_hnp\
[grsacc19:00435] mca: base: components_open: component staged_hnp open function successful\
[grsacc19:00435] mca: base: components_open: found loaded component staged_orted\
[grsacc19:00435] mca: base: components_open: component staged_orted open function successful\
[grsacc19:00435] mca:base:select: Auto-selecting state components\
[grsacc19:00435] mca:base:select:(state) Querying component [app]\
[grsacc19:00435] mca:base:select:(state) Skipping component [app]. Query failed to return a module\
[grsacc19:00435] mca:base:select:(state) Querying component [hnp]\
[grsacc19:00435] mca:base:select:(state) Skipping component [hnp]. Query failed to return a module\
[grsacc19:00435] mca:base:select:(state) Querying component [novm]\
[grsacc19:00435] mca:base:select:(state) Skipping component [novm]. Query failed to return a module\
[grsacc19:00435] mca:base:select:(state) Querying component [orted]\
[grsacc19:00435] mca:base:select:(state) Query of component [orted] set priority to 100\
[grsacc19:00435] mca:base:select:(state) Querying component [staged_hnp]\
[grsacc19:00435] mca:base:select:(state) Skipping component [staged_hnp]. Query failed to return a module\
[grsacc19:00435] mca:base:select:(state) Querying component [staged_orted]\
[grsacc19:00435] mca:base:select:(state) Skipping component [staged_orted]. Query failed to return a module\
[grsacc19:00435] mca:base:select:(state) Selected component [orted]\
[grsacc19:00435] mca: base: close: component app closed\
[grsacc19:00435] mca: base: close: unloading component app\
[grsacc19:00435] mca: base: close: component hnp closed\
[grsacc19:00435] mca: base: close: unloading component hnp\
[grsacc19:00435] mca: base: close: component novm closed\
[grsacc19:00435] mca: base: close: unloading component novm\
[grsacc19:00435] mca: base: close: component staged_hnp closed\
[grsacc19:00435] mca: base: close: unloading component staged_hnp\
[grsacc19:00435] mca: base: close: component staged_orted closed\
[grsacc19:00435] mca: base: close: unloading component staged_orted\
[grsacc19:00435] ORTE_JOB_STATE_MACHINE:\
[grsacc19:00435] State: LOCAL LAUNCH COMPLETE cbfunc: DEFINED\
[grsacc19:00435] State: FORCED EXIT cbfunc: DEFINED\
[grsacc19:00435] State: DAEMONS TERMINATED cbfunc: DEFINED\
[grsacc19:00435] ORTE_PROC_STATE_MACHINE:\
[grsacc19:00435] State: RUNNING cbfunc: DEFINED\
[grsacc19:00435] State: SYNC REGISTERED cbfunc: DEFINED\
[grsacc19:00435] State: IOF COMPLETE cbfunc: DEFINED\
[grsacc19:00435] State: WAITPID FIRED cbfunc: DEFINED\
[grsacc19:00435] mca: base: components_register: registering errmgr components\
[grsacc19:00435] mca: base: components_register: found loaded component default_app\
[grsacc19:00435] mca: base: components_register: component default_app register function successful\
[grsacc19:00435] mca: base: components_register: found loaded component default_hnp\
[grsacc19:00435] mca: base: components_register: component default_hnp register function successful\
[grsacc19:00435] mca: base: components_register: found loaded component default_orted\
[grsacc19:00435] mca: base: components_register: component default_orted register function successful\
[grsacc19:00435] mca: base: components_open: opening errmgr components\
[grsacc19:00435] mca: base: components_open: found loaded component default_app\
[grsacc19:00435] mca: base: components_open: component default_app open function successful\
[grsacc19:00435] mca: base: components_open: found loaded component default_hnp\
[grsacc19:00435] mca: base: components_open: component default_hnp open function successful\
[grsacc19:00435] mca: base: components_open: found loaded component default_orted\
[grsacc19:00435] mca: base: components_open: component default_orted open function successful\
[grsacc19:00435] mca:base:select:( plm) Querying component [rsh]\
[grsacc19:00435] [[10809,0],1] plm:rsh_lookup on agent ssh : rsh path NULL\
[grsacc19:00435] mca:base:select:( plm) Query of component [rsh] set priority to 10\
[grsacc19:00435] mca:base:select:( plm) Selected component [rsh]\
[grsacc19:00435] mca:base:select: Auto-selecting errmgr components\
[grsacc19:00435] mca:base:select:(errmgr) Querying component [default_app]\
[grsacc19:00435] mca:base:select:(errmgr) Skipping component [default_app]. Query failed to return a module\
[grsacc19:00435] mca:base:select:(errmgr) Querying component [default_hnp]\
[grsacc19:00435] mca:base:select:(errmgr) Skipping component [default_hnp]. Query failed to return a module\
[grsacc19:00435] mca:base:select:(errmgr) Querying component [default_orted]\
[grsacc19:00435] mca:base:select:(errmgr) Query of component [default_orted] set priority to 1000\
[grsacc19:00435] mca:base:select:(errmgr) Selected component [default_orted]\
[grsacc19:00435] mca: base: close: component default_app closed\
[grsacc19:00435] mca: base: close: unloading component default_app\
[grsacc19:00435] mca: base: close: component default_hnp closed\
[grsacc19:00435] mca: base: close: unloading component default_hnp\
[grsacc19:00435] [[10809,0],1] plm:rsh_setup on agent ssh : rsh path NULL\
[grsacc19:00435] [[10809,0],1] plm:base:receive start comm\
[grsacc19:00432] [[10809,0],1] ACTIVATE PROC [[10809,0],0] STATE LIFELINE LOST AT oob_tcp_component.c:1102\
[grsacc19:00435] [[10809,0],1] FORCE-TERMINATE AT oob_tcp_sendrecv.c:430\
[grsacc19:00435] [[10809,0],1] ACTIVATE JOB NULL STATE FORCED EXIT AT oob_tcp_sendrecv.c:430\
[grsacc19:00435] [[10809,0],1] ACTIVATING JOB NULL STATE FORCED EXIT PRI 0\
[grsacc19:00435[grsacc19:00432] [[10809,0],1] ACTIVATING PROC [[10809,0],0] STATE LIFELINE LOST PRI 0\
[grsacc19:00432] [[10809,0],1] errmgr:default_orted:proc_errors process [[10809,0],0] error state LIFELINE LOST\
[grsacc19:00432] [[10809,0],1] errmgr:orted lifeline lost] mca: base: close: component default_orted closed\
[grsacc19:00435] mca: base: close: unloading component default_orted\
[grsacc19:00435] [[10809,0],1] plm:base:receive stop comm\
[grsacc19:00435] mca: base: close: component orted closed\
[grsacc19:00435] mca - exiting\
: base: close: unloading component orted\
[grsacc18:20715] mca: base: components_register: registering state components\
[grsacc18:20715] mca: base: components_register: found loaded component app\
[grsacc18:20715] mca: base: components_register: component app has no register or open function\
[grsacc18:20715] mca: base: components_register: found loaded component hnp\
[grsacc18:20715] mca: base: components_register: component hnp has no register or open function\
[grsacc18:20715] mca: base: components_register: found loaded component novm\
[grsacc18:20715] mca: base: components_register: component novm register function successfu[grsacc19:00432] [[10809,0],1] FORCE-TERMINATE AT errmgr_default_orted.c:259\
l\
[grsacc18:20715] mca: base: components_register: found loaded component orted\
[grsacc18:20715] mca: base: components_register: component orted has no register or open function\
[grsacc18:20715] mca: base: components_register: found loaded component staged[grsacc19:00432] [[10809,0],1] ACTIVATE JOB NULL STATE FORCED EXIT AT errmgr_default_orted.c:259\
[grsacc19:00432] [[10809,0],1] ACTIVATING JOB NULL STATE FORCED EXIT PRI 0\
[grsacc19:00432] mca: base: close: component default_orted closed\
[grsacc19:00432] m_hnp\
[grsacc18:20715] mca: base: components_register: component staged_hnp has no register or open function\
[grsacc18:20715] mca: base: components_register: found loaded component staged_orted\
[grsacc18:20715] mca: base: components_register: component stagca: base: close: unloading component default_orted\
[grsacc19:00432] [[10809,0],1] plm:base:receive stop comm\
[grsacc19:00432] mca: base: close: component orted closed\
[grsacc19:00432] mca: base: close: unloading component orted\
ed_orted has no register or open function\
[grsacc18:20715] mca: base: components_open: opening state components\
[grsacc18:20715] mca: base: components_open: found loaded component app\
[grsacc18:20715] mca: base: components_open: component app open function successful\
[grsacc18:20715] mca: base: components_open: found loaded component hnp\
[grsacc18:20715] mca: base: components_open: component hnp open function successful\
[grsacc18:20715] mca: base: components_open: found loaded component novm\
[grsacc18:20715] mca: base: components_open: component novm open function successful\
[grsacc18:20715] mca: base: components_open: found loaded component orted\
[grsacc18:20715] mca: base: components_open: component orted open function successful\
[grsacc18:20715] mca: base: components_open: found loaded component staged_hnp\
[grsacc18:20715] mca: base: components_open: component staged_hnp open function successful\
[grsacc18:20715] mca: base: components_open: found loaded component staged_orted\
[grsacc18:20715] mca: base: components_open: component staged_orted open function successful\
[grsacc18:20715] mca:base:select: Auto-selecting state components\
[grsacc18:20715] mca:base:select:(state) Querying component [app]\
[grsacc18:20715] mca:base:select:(state) Skipping component [app]. Query failed to return a module\
[grsacc18:20715] mca:base:select:(state) Querying component [hnp]\
[grsacc18:20715] mca:base:select:(state) Skipping component [hnp]. Query failed to return a module\
[grsacc18:20715] mca:base:select:(state) Querying component [novm]\
[grsacc18:20715] mca:base:select:(state) Skipping component [novm]. Query failed to return a module\
[grsacc18:20715] mca:base:select:(state) Querying component [orted]\
[grsacc18:20715] mca:base:select:(state) Query of component [orted] set priority to 100\
[grsacc18:20715] mca:base:select:(state) Querying component [staged_hnp]\
[grsacc18:20715] mca:base:select:(state) Skipping component [staged_hnp]. Query failed to return a module\
[grsacc18:20715] mca:base:select:(state) Querying component [staged_orted]\
[grsacc18:20715] mca:base:select:(state) Skipping component [staged_orted]. Query failed to return a module\
[grsacc18:20715] mca:base:select:(state) Selected component [orted]\
[grsacc18:20715] mca: base: close: component app closed\
[grsacc18:20715] mca: base: close: unloading component app\
[grsacc18:20715] mca: base: close: component hnp closed\
[grsacc18:20715] mca: base: close: unloading component hnp\
[grsacc18:20715] mca: base: close: component novm closed\
[grsacc18:20715] mca: base: close: unloading component novm\
[grsacc18:20715] mca: base: close: component staged_hnp closed\
[grsacc18:20715] mca: base: close: unloading component staged_hnp\
[grsacc18:20715] mca: base: close: component staged_orted closed\
[grsacc18:20715] mca: base: close: unloading component staged_orted\
[grsacc18:20715] ORTE_JOB_STATE_MACHINE:\
[grsacc18:20715] State: LOCAL LAUNCH COMPLETE cbfunc: DEFINED\
[grsacc18:20715] State: FORCED EXIT cbfunc: DEFINED\
[grsacc18:20715] State: DAEMONS TERMINATED cbfunc: DEFINED\
[grsacc18:20715] ORTE_PROC_STATE_MACHINE:\
[grsacc18:20715] State: RUNNING cbfunc: DEFINED\
[grsacc18:20715] State: SYNC REGISTERED cbfunc: DEFINED\
[grsacc18:20715] State: IOF COMPLETE cbfunc: DEFINED\
[grsacc18:20715] State: WAITPID FIRED cbfunc: DEFINED\
[grsacc18:20715] mca: base: components_register: registering errmgr components\
[grsacc18:20715] mca: base: components_register: found loaded component default_app\
[grsacc18:20715] mca: base: components_register: component default_app register function successful\
[grsacc18:20715] mca: base: components_register: found loaded component default_hnp\
[grsacc18:20715] mca: base: components_register: component default_hnp register function successful\
[grsacc18:20715] mca: base: components_register: found loaded component default_orted\
[grsacc18:20715] mca: base: components_register: component default_orted register function successful\
[grsacc18:20715] mca: base: components_open: opening errmgr components\
[grsacc18:20715] mca: base: components_open: found loaded component default_app\
[grsacc18:20715] mca: base: components_open: component default_app open function successful\
[grsacc18:20715] mca: base: components_open: found loaded component default_hnp\
[grsacc18:20715] mca: base: components_open: component default_hnp open function successful\
[grsacc18:20715] mca: base: components_open: found loaded component default_orted\
[grsacc18:20715] mca: base: components_open: component default_orted open function successful\
[grsacc18:20715] mca:base:select:( plm) Querying component [rsh]\
[grsacc18:20715] [[10809,0],2] plm:rsh_lookup on agent ssh : rsh path NULL\
[grsacc18:20715] mca:base:select:( plm) Query of component [rsh] set priority to 10\
[grsacc18:20715] mca:base:select:( plm) Selected component [rsh]\
[grsacc18:20715] mca:base:select: Auto-selecting errmgr components\
[grsacc18:20715] mca:base:select:(errmgr) Querying component [default_app]\
[grsacc18:20715] mca:base:select:(errmgr) Skipping component [default_app]. Query failed to return a module\
[grsacc18:20715] mca:base:select:(errmgr) Querying component [default_hnp]\
[grsacc18:20715] mca:base:select:(errmgr) Skipping component [default_hnp]. Query failed to return a module\
[grsacc18:20715] mca:base:select:(errmgr) Querying component [default_orted]\
[grsacc18:20715] mca:base:select:(errmgr) Query of component [default_orted] set priority to 1000\
[grsacc18:20715] mca:base:select:(errmgr) Selected component [default_orted]\
[grsacc18:20715] mca: base: close: component default_app closed\
[grsacc18:20715] mca: base: close: unloading component default_app\
[grsacc18:20715] mca: base: close: component default_hnp closed\
[grsacc18:20715] mca: base: close: unloading component default_hnp\
[grsacc18:20715] [[10809,0],2] plm:rsh_setup on agent ssh : rsh path NULL\
[grsacc18:20715] [[10809,0],2] plm:base:receive start comm\
[grsacc20:09163] [[10809,0],0] plm:base:orted_report_launch from daemon [[10809,0],2]\
[grsacc20:09163] [[10809,0],0] plm:base:orted_report_launch from daemon [[10809,0],2] on node grsacc18\
[grsacc20:09163] [[10809,0],0] plm:base:orted_report_launch completed for daemon [[10809,0],2] at contact 708378624.2;tcp://192.168.222.18:41110\
[grsacc20:09163] [[10809,0],0] ACTIVATE JOB [10809,2] STATE ALL DAEMONS REPORTED AT base/plm_base_launch_support.c:842\
[grsacc20:09163] [[10809,0],0] ACTIVATING JOB [10809,2] STATE ALL DAEMONS REPORTED PRI 4\
[grsacc20:09163] [[10809,0],0] ACTIVATE JOB [10809,2] STATE VM READY AT base/plm_base_launch_support.c:170\
[grsacc20:09163] [[10809,0],0] ACTIVATING JOB [10809,2] STATE VM READY PRI 4\
[grsacc20:09163] [[10809,0],0] ACTIVATE JOB [10809,2] STATE PENDING MAPPING AT base/plm_base_launch_support.c:207\
[grsacc20:09163] [[10809,0],0] ACTIVATING JOB [10809,2] STATE PENDING MAPPING PRI 4\
[grsacc20:09163] [[10809,0],0] ACTIVATE JOB [10809,2] STATE MAP COMPLETE AT base/rmaps_base_map_job.c:316\
[grsacc20:09163] [[10809,0],0] ACTIVATING JOB [10809,2] STATE MAP COMPLETE PRI 4\
[grsacc20:09163] [[10809,0],0] ACTIVATE JOB [10809,2] STATE PENDING FINAL SYSTEM PREP AT base/plm_base_launch_support.c:233\
[grsacc20:09163] [[10809,0],0] ACTIVATING JOB [10809,2] STATE PENDING FINAL SYSTEM PREP PRI 4\
[grsacc20:09163] [[10809,0],0] ACTIVATE JOB [10809,2] STATE PENDING APP LAUNCH AT base/plm_base_launch_support.c:410\
[grsacc20:09163] [[10809,0],0] ACTIVATING JOB [10809,2] STATE PENDING APP LAUNCH PRI 4\
[grsacc20:09163] [[10809,0],0] plm:base:launch_apps for job [10809,2]\
[grsacc20:09163] [[10809,0],0] ACTIVATE JOB [10809,2] STATE LOCAL LAUNCH COMPLETE AT base/odls_base_default_fns.c:1593\
[grsacc20:09163] [[10809,0],0] ACTIVATING JOB [10809,2] STATE LOCAL LAUNCH COMPLETE PRI 4\
[grsacc20:09163] [[10809,0],0] ACTIVATE PROC [[10809,0],1] STATE COMMUNICATION FAILURE AT oob_tcp_component.c:1104\
[grsacc20:09163] [[10809,0],0] ACTIVATING PROC [[10809,0],1] STATE COMMUNICATION FAILURE PRI 0\
[grsacc20:09163] [[10809,0],0] errmgr:default_hnp: for proc [[10809,0],1] state COMMUNICATION FAILURE\
[grsacc20:09163] [[10809,0],0] Comm failure: daemon [[10809,0],1] - aborting\
[grsacc20:09163] [[10809,0],0] errmgr:default_hnp: abort called on job [10809,0]\
[grsacc20:09163] [[10809,0],0] errmgr:default_hnp: ordering orted termination\
[grsacc20:09163] [[10809,0],0] plm:base:orted_cmd sending orted_exit commands\
[grsacc20:09163] [[10809,0],0] ACTIVATE JOB NULL STATE DAEMONS TERMINATED AT orted/orted_comm.c:465\
[grsacc20:09163] [[10809,0],0] ACTIVATING JOB NULL STATE DAEMONS TERMINATED PRI 0\
[grsacc20:09163] [[10809,0],0] plm:base:receive stop comm\
[grsacc20:09163] mca: base: close: component default_hnp closed\
[grsacc20:09163] mca: base: close: unloading component default_hnp\
[grsacc20:09163] mca: base: close: component hnp closed\
[grsacc20:09163] mca: base: close: unloading component hnp\
-bash-4.1$ [grsacc18:20715] [[10809,0],2] ACTIVATE PROC [[10809,2],0] STATE RUNNING AT base/odls_base_default_fns.c:1545\
[grsacc18:20715] [[10809,0],2] ACTIVATING PROC [[10809,2],0] STATE RUNNING PRI 4\
[grsacc18:20715] [[10809,0],2] ACTIVATE JOB [10809,2] STATE LOCAL LAUNCH COMPLETE AT base/odls_base_default_fns.c:1593\
[grsacc18:20715] [[10809,0],2] ACTIVATING JOB [10809,2] STATE LOCAL LAUNCH COMPLETE PRI 4\
[grsacc18:20715] [[10809,0],2] state:orted:track_procs called for proc [[10809,2],0] state RUNNING\
[grsacc18:20715] [[10809,0],2] state:orted:track_jobs sending local launch complete for job [10809,2]\
[grsacc18:20715] [[10809,0],2] ACTIVATE PROC [[10809,0],0] STATE LIFELINE LOST AT oob_tcp_component.c:1102\
[grsacc18:20715] [[10809,0],2] ACTIVATING PROC [[10809,0],0] STATE LIFELINE LOST PRI 0\
[grsacc18:20715] [[10809,0],2] errmgr:default_orted:proc_errors process [[10809,0],0] error state LIFELINE LOST\
[grsacc18:20715] [[10809,0],2] errmgr:orted lifeline lost - exiting\
[grsacc18:20715] [[10809,0],2] FORCE-TERMINATE AT errmgr_default_orted.c:259\
[grsacc18:20715] [[10809,0],2] ACTIVATE JOB NULL STATE FORCED EXIT AT errmgr_default_orted.c:259\
[grsacc18:20715] [[10809,0],2] ACTIVATING JOB NULL STATE FORCED EXIT PRI 0\
[grsacc18:20715] mca: base: close: component default_orted closed\
[grsacc18:20715] mca: base: close: unloading component default_orted\
[grsacc18:20715] [[10809,0],2] plm:base:receive stop comm\
[grsacc18:20715] mca: base: close: component orted closed\
[grsacc18:20715] mca: base: close: unloading component orted}
{\rtf1\ansi\ansicpg1252\cocoartf1038\cocoasubrtf360
{\fonttbl\f0\fswiss\fcharset0 Helvetica;}
{\colortbl;\red255\green255\blue255;}
\paperw11900\paperh16840\margl1440\margr1440\vieww9000\viewh8400\viewkind0
\pard\tx566\tx1133\tx1700\tx2267\tx2834\tx3401\tx3968\tx4535\tx5102\tx5669\tx6236\tx6803\ql\qnatural\pardirnatural
\f0\fs24 \cf0 [grsacc20:09171] mca: base: components_register: registering state components\
[grsacc20:09171] mca: base: components_register: found loaded component app\
[grsacc20:09171] mca: base: components_register: component app has no register or open function\
[grsacc20:09171] mca: base: components_register: found loaded component hnp\
[grsacc20:09171] mca: base: components_register: component hnp has no register or open function\
[grsacc20:09171] mca: base: components_register: found loaded component novm\
[grsacc20:09171] mca: base: components_register: component novm register function successful\
[grsacc20:09171] mca: base: components_register: found loaded component orted\
[grsacc20:09171] mca: base: components_register: component orted has no register or open function\
[grsacc20:09171] mca: base: components_register: found loaded component staged_hnp\
[grsacc20:09171] mca: base: components_register: component staged_hnp has no register or open function\
[grsacc20:09171] mca: base: components_register: found loaded component staged_orted\
[grsacc20:09171] mca: base: components_register: component staged_orted has no register or open function\
[grsacc20:09171] mca: base: components_open: opening state components\
[grsacc20:09171] mca: base: components_open: found loaded component app\
[grsacc20:09171] mca: base: components_open: component app open function successful\
[grsacc20:09171] mca: base: components_open: found loaded component hnp\
[grsacc20:09171] mca: base: components_open: component hnp open function successful\
[grsacc20:09171] mca: base: components_open: found loaded component novm\
[grsacc20:09171] mca: base: components_open: component novm open function successful\
[grsacc20:09171] mca: base: components_open: found loaded component orted\
[grsacc20:09171] mca: base: components_open: component orted open function successful\
[grsacc20:09171] mca: base: components_open: found loaded component staged_hnp\
[grsacc20:09171] mca: base: components_open: component staged_hnp open function successful\
[grsacc20:09171] mca: base: components_open: found loaded component staged_orted\
[grsacc20:09171] mca: base: components_open: component staged_orted open function successful\
[grsacc20:09171] mca:base:select: Auto-selecting state components\
[grsacc20:09171] mca:base:select:(state) Querying component [app]\
[grsacc20:09171] mca:base:select:(state) Skipping component [app]. Query failed to return a module\
[grsacc20:09171] mca:base:select:(state) Querying component [hnp]\
[grsacc20:09171] mca:base:select:(state) Query of component [hnp] set priority to 60\
[grsacc20:09171] mca:base:select:(state) Querying component [novm]\
[grsacc20:09171] mca:base:select:(state) Query of component [novm] set priority to 1000\
[grsacc20:09171] mca:base:select:(state) Querying component [orted]\
[grsacc20:09171] mca:base:select:(state) Skipping component [orted]. Query failed to return a module\
[grsacc20:09171] mca:base:select:(state) Querying component [staged_hnp]\
[grsacc20:09171] mca:base:select:(state) Skipping component [staged_hnp]. Query failed to return a module\
[grsacc20:09171] mca:base:select:(state) Querying component [staged_orted]\
[grsacc20:09171] mca:base:select:(state) Skipping component [staged_orted]. Query failed to return a module\
[grsacc20:09171] mca:base:select:(state) Selected component [novm]\
[grsacc20:09171] mca: base: close: component app closed\
[grsacc20:09171] mca: base: close: unloading component app\
[grsacc20:09171] mca: base: close: component hnp closed\
[grsacc20:09171] mca: base: close: unloading component hnp\
[grsacc20:09171] mca: base: close: component orted closed\
[grsacc20:09171] mca: base: close: unloading component orted\
[grsacc20:09171] mca: base: close: component staged_hnp closed\
[grsacc20:09171] mca: base: close: unloading component staged_hnp\
[grsacc20:09171] mca: base: close: component staged_orted closed\
[grsacc20:09171] mca: base: close: unloading component staged_orted\
[grsacc20:09171] ORTE_JOB_STATE_MACHINE:\
[grsacc20:09171] State: PENDING INIT cbfunc: DEFINED\
[grsacc20:09171] State: INIT_COMPLETE cbfunc: DEFINED\
[grsacc20:09171] State: PENDING ALLOCATION cbfunc: DEFINED\
[grsacc20:09171] State: ALLOCATION COMPLETE cbfunc: DEFINED\
[grsacc20:09171] State: DAEMONS LAUNCHED cbfunc: DEFINED\
[grsacc20:09171] State: ALL DAEMONS REPORTED cbfunc: DEFINED\
[grsacc20:09171] State: VM READY cbfunc: DEFINED\
[grsacc20:09171] State: PENDING MAPPING cbfunc: DEFINED\
[grsacc20:09171] State: MAP COMPLETE cbfunc: DEFINED\
[grsacc20:09171] State: PENDING FINAL SYSTEM PREP cbfunc: DEFINED\
[grsacc20:09171] State: PENDING APP LAUNCH cbfunc: DEFINED\
[grsacc20:09171] State: LOCAL LAUNCH COMPLETE cbfunc: DEFINED\
[grsacc20:09171] State: RUNNING cbfunc: DEFINED\
[grsacc20:09171] State: SYNC REGISTERED cbfunc: DEFINED\
[grsacc20:09171] State: NORMALLY TERMINATED cbfunc: DEFINED\
[grsacc20:09171] State: NOTIFY COMPLETED cbfunc: DEFINED\
[grsacc20:09171] State: ALL JOBS COMPLETE cbfunc: DEFINED\
[grsacc20:09171] State: DAEMONS TERMINATED cbfunc: DEFINED\
[grsacc20:09171] State: FORCED EXIT cbfunc: DEFINED\
[grsacc20:09171] State: REPORT PROGRESS cbfunc: DEFINED\
[grsacc20:09171] ORTE_PROC_STATE_MACHINE:\
[grsacc20:09171] State: RUNNING cbfunc: DEFINED\
[grsacc20:09171] State: SYNC REGISTERED cbfunc: DEFINED\
[grsacc20:09171] State: IOF COMPLETE cbfunc: DEFINED\
[grsacc20:09171] State: WAITPID FIRED cbfunc: DEFINED\
[grsacc20:09171] State: NORMALLY TERMINATED cbfunc: DEFINED\
[grsacc20:09171] mca: base: components_register: registering errmgr components\
[grsacc20:09171] mca: base: components_register: found loaded component default_app\
[grsacc20:09171] mca: base: components_register: component default_app register function successful\
[grsacc20:09171] mca: base: components_register: found loaded component default_hnp\
[grsacc20:09171] mca: base: components_register: component default_hnp register function successful\
[grsacc20:09171] mca: base: components_register: found loaded component default_orted\
[grsacc20:09171] mca: base: components_register: component default_orted register function successful\
[grsacc20:09171] mca: base: components_open: opening errmgr components\
[grsacc20:09171] mca: base: components_open: found loaded component default_app\
[grsacc20:09171] mca: base: components_open: component default_app open function successful\
[grsacc20:09171] mca: base: components_open: found loaded component default_hnp\
[grsacc20:09171] mca: base: components_open: component default_hnp open function successful\
[grsacc20:09171] mca: base: components_open: found loaded component default_orted\
[grsacc20:09171] mca: base: components_open: component default_orted open function successful\
[grsacc20:09171] mca:base:select:( plm) Querying component [rsh]\
[grsacc20:09171] [[INVALID],INVALID] plm:rsh_lookup on agent ssh : rsh path NULL\
[grsacc20:09171] mca:base:select:( plm) Query of component [rsh] set priority to 10\
[grsacc20:09171] mca:base:select:( plm) Querying component [slurm]\
[grsacc20:09171] mca:base:select:( plm) Skipping component [slurm]. Query failed to return a module\
[grsacc20:09171] mca:base:select:( plm) Querying component [tm]\
[grsacc20:09171] mca:base:select:( plm) Query of component [tm] set priority to 75\
[grsacc20:09171] mca:base:select:( plm) Selected component [tm]\
[grsacc20:09171] plm:base:set_hnp_name: initial bias 9171 nodename hash 2203159203\
[grsacc20:09171] plm:base:set_hnp_name: final jobfam 10785\
[grsacc20:09171] mca:base:select: Auto-selecting errmgr components\
[grsacc20:09171] mca:base:select:(errmgr) Querying component [default_app]\
[grsacc20:09171] mca:base:select:(errmgr) Skipping component [default_app]. Query failed to return a module\
[grsacc20:09171] mca:base:select:(errmgr) Querying component [default_hnp]\
[grsacc20:09171] mca:base:select:(errmgr) Query of component [default_hnp] set priority to 1000\
[grsacc20:09171] mca:base:select:(errmgr) Querying component [default_orted]\
[grsacc20:09171] mca:base:select:(errmgr) Skipping component [default_orted]. Query failed to return a module\
[grsacc20:09171] mca:base:select:(errmgr) Selected component [default_hnp]\
[grsacc20:09171] mca: base: close: component default_app closed\
[grsacc20:09171] mca: base: close: unloading component default_app\
[grsacc20:09171] mca: base: close: component default_orted closed\
[grsacc20:09171] mca: base: close: unloading component default_orted\
[grsacc20:09171] [[10785,0],0] plm:base:receive start comm\
[grsacc20:09171] [[10785,0],0] ACTIVATE JOB [INVALID] STATE PENDING INIT AT plm_tm_module.c:157\
[grsacc20:09171] [[10785,0],0] ACTIVATING JOB [INVALID] STATE PENDING INIT PRI 4\
[grsacc20:09171] [[10785,0],0] plm:base:setup_job\
[grsacc20:09171] [[10785,0],0] ACTIVATE JOB [10785,1] STATE INIT_COMPLETE AT base/plm_base_launch_support.c:315\
[grsacc20:09171] [[10785,0],0] ACTIVATING JOB [10785,1] STATE INIT_COMPLETE PRI 4\
[grsacc20:09171] [[10785,0],0] ACTIVATE JOB [10785,1] STATE PENDING ALLOCATION AT base/plm_base_launch_support.c:326\
[grsacc20:09171] [[10785,0],0] ACTIVATING JOB [10785,1] STATE PENDING ALLOCATION PRI 4\
[grsacc20:09171] [[10785,0],0] ACTIVATE JOB [10785,1] STATE ALLOCATION COMPLETE AT base/ras_base_allocate.c:421\
[grsacc20:09171] [[10785,0],0] ACTIVATING JOB [10785,1] STATE ALLOCATION COMPLETE PRI 4\
[grsacc20:09171] [[10785,0],0] ACTIVATE JOB [10785,1] STATE PENDING MAPPING AT state_novm.c:235\
[grsacc20:09171] [[10785,0],0] ACTIVATING JOB [10785,1] STATE PENDING MAPPING PRI 4\
[grsacc20:09171] [[10785,0],0] ACTIVATE JOB [10785,1] STATE MAP COMPLETE AT base/rmaps_base_map_job.c:316\
[grsacc20:09171] [[10785,0],0] ACTIVATING JOB [10785,1] STATE MAP COMPLETE PRI 4\
[grsacc20:09171] [[10785,0],0] ACTIVATE JOB [10785,1] STATE PENDING DAEMON LAUNCH AT state_novm.c:250\
[grsacc20:09171] [[10785,0],0] ACTIVATING JOB [10785,1] STATE PENDING DAEMON LAUNCH PRI 4\
[grsacc20:09171] [[10785,0],0] plm:base:setup_vm\
[grsacc20:09171] [[10785,0],0] plm:base:setup_vm creating map\
[grsacc20:09171] [[10785,0],0] plm:base:setup_vm add new daemon [[10785,0],1]\
[grsacc20:09171] [[10785,0],0] plm:base:setup_vm assigning new daemon [[10785,0],1] to node grsacc19\
[grsacc20:09171] [[10785,0],0] plm:tm: launching vm\
[grsacc20:09171] [[10785,0],0] plm:tm: final top-level argv:\
orted -mca ess tm -mca orte_ess_jobid 706805760 -mca orte_ess_vpid <template> -mca orte_ess_num_procs 2 -mca orte_hnp_uri "706805760.0;tcp://192.168.222.20:46019" -mca state_base_verbose 10 -mca errmgr_base_verbose 10 -mca plm_base_verbose 5 -mca btl tcp,sm,self\
[grsacc20:09171] [[10785,0],0] plm:tm: launching on node grsacc19\
[grsacc20:09171] [[10785,0],0] plm:tm: executing:\
orted -mca ess tm -mca orte_ess_jobid 706805760 -mca orte_ess_vpid 1 -mca orte_ess_num_procs 2 -mca orte_hnp_uri "706805760.0;tcp://192.168.222.20:46019" -mca state_base_verbose 10 -mca errmgr_base_verbose 10 -mca plm_base_verbose 5 -mca btl tcp,sm,self\
[grsacc20:09171] [[10785,0],0] plm:tm:launch: finished spawning orteds\
[grsacc19:00462] mca: base: components_register: registering state components\
[grsacc19:00462] mca: base: components_register: found loaded component app\
[grsacc19:00462] mca: base: components_register: component app has no register or open function\
[grsacc19:00462] mca: base: components_register: found loaded component hnp\
[grsacc19:00462] mca: base: components_register: component hnp has no register or open function\
[grsacc19:00462] mca: base: components_register: found loaded component novm\
[grsacc19:00462] mca: base: components_register: component novm register function successful\
[grsacc19:00462] mca: base: components_register: found loaded component orted\
[grsacc19:00462] mca: base: components_register: component orted has no register or open function\
[grsacc19:00462] mca: base: components_register: found loaded component staged_hnp\
[grsacc19:00462] mca: base: components_register: component staged_hnp has no register or open function\
[grsacc19:00462] mca: base: components_register: found loaded component staged_orted\
[grsacc19:00462] mca: base: components_register: component staged_orted has no register or open function\
[grsacc19:00462] mca: base: components_open: opening state components\
[grsacc19:00462] mca: base: components_open: found loaded component app\
[grsacc19:00462] mca: base: components_open: component app open function successful\
[grsacc19:00462] mca: base: components_open: found loaded component hnp\
[grsacc19:00462] mca: base: components_open: component hnp open function successful\
[grsacc19:00462] mca: base: components_open: found loaded component novm\
[grsacc19:00462] mca: base: components_open: component novm open function successful\
[grsacc19:00462] mca: base: components_open: found loaded component orted\
[grsacc19:00462] mca: base: components_open: component orted open function successful\
[grsacc19:00462] mca: base: components_open: found loaded component staged_hnp\
[grsacc19:00462] mca: base: components_open: component staged_hnp open function successful\
[grsacc19:00462] mca: base: components_open: found loaded component staged_orted\
[grsacc19:00462] mca: base: components_open: component staged_orted open function successful\
[grsacc19:00462] mca:base:select: Auto-selecting state components\
[grsacc19:00462] mca:base:select:(state) Querying component [app]\
[grsacc19:00462] mca:base:select:(state) Skipping component [app]. Query failed to return a module\
[grsacc19:00462] mca:base:select:(state) Querying component [hnp]\
[grsacc19:00462] mca:base:select:(state) Skipping component [hnp]. Query failed to return a module\
[grsacc19:00462] mca:base:select:(state) Querying component [novm]\
[grsacc19:00462] mca:base:select:(state) Skipping component [novm]. Query failed to return a module\
[grsacc19:00462] mca:base:select:(state) Querying component [orted]\
[grsacc19:00462] mca:base:select:(state) Query of component [orted] set priority to 100\
[grsacc19:00462] mca:base:select:(state) Querying component [staged_hnp]\
[grsacc19:00462] mca:base:select:(state) Skipping component [staged_hnp]. Query failed to return a module\
[grsacc19:00462] mca:base:select:(state) Querying component [staged_orted]\
[grsacc19:00462] mca:base:select:(state) Skipping component [staged_orted]. Query failed to return a module\
[grsacc19:00462] mca:base:select:(state) Selected component [orted]\
[grsacc19:00462] mca: base: close: component app closed\
[grsacc19:00462] mca: base: close: unloading component app\
[grsacc19:00462] mca: base: close: component hnp closed\
[grsacc19:00462] mca: base: close: unloading component hnp\
[grsacc19:00462] mca: base: close: component novm closed\
[grsacc19:00462] mca: base: close: unloading component novm\
[grsacc19:00462] mca: base: close: component staged_hnp closed\
[grsacc19:00462] mca: base: close: unloading component staged_hnp\
[grsacc19:00462] mca: base: close: component staged_orted closed\
[grsacc19:00462] mca: base: close: unloading component staged_orted\
[grsacc19:00462] ORTE_JOB_STATE_MACHINE:\
[grsacc19:00462] State: LOCAL LAUNCH COMPLETE cbfunc: DEFINED\
[grsacc19:00462] State: FORCED EXIT cbfunc: DEFINED\
[grsacc19:00462] State: DAEMONS TERMINATED cbfunc: DEFINED\
[grsacc19:00462] ORTE_PROC_STATE_MACHINE:\
[grsacc19:00462] State: RUNNING cbfunc: DEFINED\
[grsacc19:00462] State: SYNC REGISTERED cbfunc: DEFINED\
[grsacc19:00462] State: IOF COMPLETE cbfunc: DEFINED\
[grsacc19:00462] State: WAITPID FIRED cbfunc: DEFINED\
[grsacc19:00462] mca: base: components_register: registering errmgr components\
[grsacc19:00462] mca: base: components_register: found loaded component default_app\
[grsacc19:00462] mca: base: components_register: component default_app register function successful\
[grsacc19:00462] mca: base: components_register: found loaded component default_hnp\
[grsacc19:00462] mca: base: components_register: component default_hnp register function successful\
[grsacc19:00462] mca: base: components_register: found loaded component default_orted\
[grsacc19:00462] mca: base: components_register: component default_orted register function successful\
[grsacc19:00462] mca: base: components_open: opening errmgr components\
[grsacc19:00462] mca: base: components_open: found loaded component default_app\
[grsacc19:00462] mca: base: components_open: component default_app open function successful\
[grsacc19:00462] mca: base: components_open: found loaded component default_hnp\
[grsacc19:00462] mca: base: components_open: component default_hnp open function successful\
[grsacc19:00462] mca: base: components_open: found loaded component default_orted\
[grsacc19:00462] mca: base: components_open: component default_orted open function successful\
[grsacc19:00462] mca:base:select:( plm) Querying component [rsh]\
[grsacc19:00462] [[10785,0],1] plm:rsh_lookup on agent ssh : rsh path NULL\
[grsacc19:00462] mca:base:select:( plm) Query of component [rsh] set priority to 10\
[grsacc19:00462] mca:base:select:( plm) Selected component [rsh]\
[grsacc19:00462] mca:base:select: Auto-selecting errmgr components\
[grsacc19:00462] mca:base:select:(errmgr) Querying component [default_app]\
[grsacc19:00462] mca:base:select:(errmgr) Skipping component [default_app]. Query failed to return a module\
[grsacc19:00462] mca:base:select:(errmgr) Querying component [default_hnp]\
[grsacc19:00462] mca:base:select:(errmgr) Skipping component [default_hnp]. Query failed to return a module\
[grsacc19:00462] mca:base:select:(errmgr) Querying component [default_orted]\
[grsacc19:00462] mca:base:select:(errmgr) Query of component [default_orted] set priority to 1000\
[grsacc19:00462] mca:base:select:(errmgr) Selected component [default_orted]\
[grsacc19:00462] mca: base: close: component default_app closed\
[grsacc19:00462] mca: base: close: unloading component default_app\
[grsacc19:00462] mca: base: close: component default_hnp closed\
[grsacc19:00462] mca: base: close: unloading component default_hnp\
[grsacc19:00462] [[10785,0],1] plm:rsh_setup on agent ssh : rsh path NULL\
[grsacc19:00462] [[10785,0],1] plm:base:receive start comm\
[grsacc20:09171] [[10785,0],0] plm:base:orted_report_launch from daemon [[10785,0],1]\
[grsacc20:09171] [[10785,0],0] plm:base:orted_report_launch from daemon [[10785,0],1] on node grsacc19\
[grsacc20:09171] [[10785,0],0] RECEIVED TOPOLOGY FROM NODE grsacc19\
[grsacc20:09171] [[10785,0],0] NEW TOPOLOGY - ADDING\
[grsacc20:09171] [[10785,0],0] plm:base:orted_report_launch completed for daemon [[10785,0],1] at contact 706805760.1;tcp://192.168.222.19:36342\
[grsacc20:09171] [[10785,0],0] ACTIVATE JOB [10785,1] STATE ALL DAEMONS REPORTED AT base/plm_base_launch_support.c:842\
[grsacc20:09171] [[10785,0],0] ACTIVATING JOB [10785,1] STATE ALL DAEMONS REPORTED PRI 4\
[grsacc20:09171] [[10785,0],0] ACTIVATE JOB [10785,1] STATE VM READY AT base/plm_base_launch_support.c:170\
[grsacc20:09171] [[10785,0],0] ACTIVATING JOB [10785,1] STATE VM READY PRI 4\
[grsacc20:09171] [[10785,0],0] ACTIVATE JOB [10785,1] STATE PENDING FINAL SYSTEM PREP AT state_novm.c:265\
[grsacc20:09171] [[10785,0],0] ACTIVATING JOB [10785,1] STATE PENDING FINAL SYSTEM PREP PRI 4\
[grsacc20:09171] [[10785,0],0] ACTIVATE JOB [10785,1] STATE PENDING APP LAUNCH AT base/plm_base_launch_support.c:410\
[grsacc20:09171] [[10785,0],0] ACTIVATING JOB [10785,1] STATE PENDING APP LAUNCH PRI 4\
[grsacc20:09171] [[10785,0],0] plm:base:launch_apps for job [10785,1]\
[grsacc20:09171] [[10785,0],0] ACTIVATE PROC [[10785,1],0] STATE RUNNING AT base/odls_base_default_fns.c:1545\
[grsacc20:09171] [[10785,0],0] ACTIVATING PROC [[10785,1],0] STATE RUNNING PRI 4\
[grsacc20:09171] [[10785,0],0] ACTIVATE JOB [10785,1] STATE LOCAL LAUNCH COMPLETE AT base/odls_base_default_fns.c:1593\
[grsacc20:09171] [[10785,0],0] ACTIVATING JOB [10785,1] STATE LOCAL LAUNCH COMPLETE PRI 4\
[grsacc20:09171] [[10785,0],0] state:base:track_procs called for proc [[10785,1],0] state RUNNING\
[grsacc20:09173] mca: base: components_register: registering state components\
[grsacc20:09173] mca: base: components_register: found loaded component app\
[grsacc20:09173] mca: base: components_register: component app has no register or open function\
[grsacc20:09173] mca: base: components_register: found loaded component hnp\
[grsacc20:09173] mca: base: components_register: component hnp has no register or open function\
[grsacc20:09173] mca: base: components_register: found loaded component novm\
[grsacc20:09173] mca: base: components_register: component novm register function successful\
[grsacc20:09173] mca: base: components_register: found loaded component orted\
[grsacc20:09173] mca: base: components_register: component orted has no register or open function\
[grsacc20:09173] mca: base: components_register: found loaded component staged_hnp\
[grsacc20:09173] mca: base: components_register: component staged_hnp has no register or open function\
[grsacc20:09173] mca: base: components_register: found loaded component staged_orted\
[grsacc20:09173] mca: base: components_register: component staged_orted has no register or open function\
[grsacc20:09173] mca: base: components_open: opening state components\
[grsacc20:09173] mca: base: components_open: found loaded component app\
[grsacc20:09173] mca: base: components_open: component app open function successful\
[grsacc20:09173] mca: base: components_open: found loaded component hnp\
[grsacc20:09173] mca: base: components_open: component hnp open function successful\
[grsacc20:09173] mca: base: components_open: found loaded component novm\
[grsacc20:09173] mca: base: components_open: component novm open function successful\
[grsacc20:09173] mca: base: components_open: found loaded component orted\
[grsacc20:09173] mca: base: components_open: component orted open function successful\
[grsacc20:09173] mca: base: components_open: found loaded component staged_hnp\
[grsacc20:09173] mca: base: components_open: component staged_hnp open function successful\
[grsacc20:09173] mca: base: components_open: found loaded component staged_orted\
[grsacc20:09173] mca: base: components_open: component staged_orted open function successful\
[grsacc20:09173] mca:base:select: Auto-selecting state components\
[grsacc20:09173] mca:base:select:(state) Querying component [app]\
[grsacc20:09173] mca:base:select:(state) Query of component [app] set priority to 1000\
[grsacc20:09173] mca:base:select:(state) Querying component [hnp]\
[grsacc20:09173] mca:base:select:(state) Skipping component [hnp]. Query failed to return a module\
[grsacc20:09173] mca:base:select:(state) Querying component [novm]\
[grsacc20:09173] mca:base:select:(state) Skipping component [novm]. Query failed to return a module\
[grsacc20:09173] mca:base:select:(state) Querying component [orted]\
[grsacc20:09173] mca:base:select:(state) Skipping component [orted]. Query failed to return a module\
[grsacc20:09173] mca:base:select:(state) Querying component [staged_hnp]\
[grsacc20:09173] mca:base:select:(state) Skipping component [staged_hnp]. Query failed to return a module\
[grsacc20:09173] mca:base:select:(state) Querying component [staged_orted]\
[grsacc20:09173] mca:base:select:(state) Skipping component [staged_orted]. Query failed to return a module\
[grsacc20:09173] mca:base:select:(state) Selected component [app]\
[grsacc20:09173] mca: base: close: component hnp closed\
[grsacc20:09173] mca: base: close: unloading component hnp\
[grsacc20:09173] mca: base: close: component novm closed\
[grsacc20:09173] mca: base: close: unloading component novm\
[grsacc20:09173] mca: base: close: component orted closed\
[grsacc20:09173] mca: base: close: unloading component orted\
[grsacc20:09173] mca: base: close: component staged_hnp closed\
[grsacc20:09173] mca: base: close: unloading component staged_hnp\
[grsacc20:09173] mca: base: close: component staged_orted closed\
[grsacc20:09173] mca: base: close: unloading component staged_orted\
[grsacc20:09173] mca: base: components_register: registering errmgr components\
[grsacc20:09173] mca: base: components_register: found loaded component default_app\
[grsacc20:09173] mca: base: components_register: component default_app register function successful\
[grsacc20:09173] mca: base: components_register: found loaded component default_hnp\
[grsacc20:09173] mca: base: components_register: component default_hnp register function successful\
[grsacc20:09173] mca: base: components_register: found loaded component default_orted\
[grsacc20:09173] mca: base: components_register: component default_orted register function successful\
[grsacc20:09173] mca: base: components_open: opening errmgr components\
[grsacc20:09173] mca: base: components_open: found loaded component default_app\
[grsacc20:09173] mca: base: components_open: component default_app open function successful\
[grsacc20:09173] mca: base: components_open: found loaded component default_hnp\
[grsacc20:09173] mca: base: components_open: component default_hnp open function successful\
[grsacc20:09173] mca: base: components_open: found loaded component default_orted\
[grsacc20:09173] mca: base: components_open: component default_orted open function successful\
[grsacc20:09173] mca:base:select: Auto-selecting errmgr components\
[grsacc20:09173] mca:base:select:(errmgr) Querying component [default_app]\
[grsacc20:09173] mca:base:select:(errmgr) Query of component [default_app] set priority to 1000\
[grsacc20:09173] mca:base:select:(errmgr) Querying component [default_hnp]\
[grsacc20:09173] mca:base:select:(errmgr) Skipping component [default_hnp]. Query failed to return a module\
[grsacc20:09173] mca:base:select:(errmgr) Querying component [default_orted]\
[grsacc20:09173] mca:base:select:(errmgr) Skipping component [default_orted]. Query failed to return a module\
[grsacc20:09173] mca:base:select:(errmgr) Selected component [default_app]\
[grsacc20:09173] mca: base: close: component default_hnp closed\
[grsacc20:09173] mca: base: close: unloading component default_hnp\
[grsacc20:09173] mca: base: close: component default_orted closed\
[grsacc20:09173] mca: base: close: unloading component default_orted\
[grsacc19:00462] [[10785,0],1] ACTIVATE PROC [[10785,1],1] STATE RUNNING AT base/odls_base_default_fns.c:1545\
[grsacc19:00462] [[10785,0],1] ACTIVATING PROC [[10785,1],1] STATE RUNNING PRI 4\
[grsacc19:00462] [[10785,0],1] ACTIVATE JOB [10785,1] STATE LOCAL LAUNCH COMPLETE AT base/odls_base_default_fns.c:1593\
[grsacc19:00462] [[10785,0],1] ACTIVATING JOB [10785,1] STATE LOCAL LAUNCH COMPLETE PRI 4\
[grsacc19:00462] [[10785,0],1] state:orted:track_procs called for proc [[10785,1],1] state RUNNING\
[grsacc19:00462] [[10785,0],1] state:orted:track_jobs sending local launch complete for job [10785,1]\
[grsacc20:09171] [[10785,0],0] plm:base:receive processing msg\
[grsacc20:09171] [[10785,0],0] plm:base:receive update proc state command from [[10785,0],1]\
[grsacc20:09171] [[10785,0],0] plm:base:receive got update_proc_state for job [10785,1]\
[grsacc20:09171] [[10785,0],0] plm:base:receive got update_proc_state for vpid 1 state RUNNING exit_code 0\
[grsacc20:09171] [[10785,0],0] ACTIVATE PROC [[10785,1],1] STATE RUNNING AT base/plm_base_receive.c:296\
[grsacc20:09171] [[10785,0],0] ACTIVATING PROC [[10785,1],1] STATE RUNNING PRI 4\
[grsacc20:09171] [[10785,0],0] plm:base:receive done processing commands\
[grsacc19:00463] mca: base: components_register: registering state components\
[grsacc19:00463] mca: base: components_register: found loaded component app\
[grsacc19:00463] mca: base: components_register: component app has no register or open function\
[grsacc19:00463] mca: base: components_register: found loaded component hnp\
[grsacc19:00463] mca: base: components_register: component hnp has no register or open function\
[grsacc19:00463] mca: base: components_register: found loaded component novm\
[grsacc19:00463] mca: base: components_register: component novm register function successful\
[grsacc19:00463] mca: base: components_register: found loaded component orted\
[grsacc19:00463] mca: base: components_register: component orted has no register or open function\
[grsacc19:00463] mca: base: components_register: found loaded component staged_hnp\
[grsacc19:00463] mca: base: components_register: component staged_hnp has no register or open function\
[grsacc19:00463] mca: base: components_register: found loaded component staged_orted\
[grsacc19:00463] mca: base: components_register: component staged_orted has no register or open function\
[grsacc19:00463] mca: base: components_open: opening state components\
[grsacc19:00463] mca: base: components_open: found loaded component app\
[grsacc19:00463] mca: base: components_open: component app open function successful\
[grsacc19:00463] mca: base: components_open: found loaded component hnp\
[grsacc19:00463] mca: base: components_open: component hnp open function successful\
[grsacc19:00463] mca: base: components_open: found loaded component novm\
[grsacc19:00463] mca: base: components_open: component novm open function successful\
[grsacc19:00463] mca: base: components_open: found loaded component orted\
[grsacc19:00463] mca: base: components_open: component orted open function successful\
[grsacc19:00463] mca: base: components_open: found loaded component staged_hnp\
[grsacc19:00463] mca: base: components_open: component staged_hnp open function successful\
[grsacc19:00463] mca: base: components_open: found loaded component staged_orted\
[grsacc19:00463] mca: base: components_open: component staged_orted open function successful\
[grsacc19:00463] mca:base:select: Auto-selecting state components\
[grsacc19:00463] mca:base:select:(state) Querying component [app]\
[grsacc19:00463] mca:base:select:(state) Query of component [app] set priority to 1000\
[grsacc19:00463] mca:base:select:(state) Querying component [hnp]\
[grsacc19:00463] mca:base:select:(state) Skipping component [hnp]. Query failed to return a module\
[grsacc19:00463] mca:base:select:(state) Querying component [novm]\
[grsacc19:00463] mca:base:select:(state) Skipping component [novm]. Query failed to return a module\
[grsacc19:00463] mca:base:select:(state) Querying component [orted]\
[grsacc19:00463] mca:base:select:(state) Skipping component [orted]. Query failed to return a module\
[grsacc19:00463] mca:base:select:(state) Querying component [staged_hnp]\
[grsacc19:00463] mca:base:select:(state) Skipping component [staged_hnp]. Query failed to return a module\
[grsacc19:00463] mca:base:select:(state) Querying component [staged_orted]\
[grsacc19:00463] mca:base:select:(state) Skipping component [staged_orted]. Query failed to return a module\
[grsacc19:00463] mca:base:select:(state) Selected component [app]\
[grsacc19:00463] mca: base: close: component hnp closed\
[grsacc19:00463] mca: base: close: unloading component hnp\
[grsacc19:00463] mca: base: close: component novm closed\
[grsacc19:00463] mca: base: close: unloading component novm\
[grsacc19:00463] mca: base: close: component orted closed\
[grsacc19:00463] mca: base: close: unloading component orted\
[grsacc19:00463] mca: base: close: component staged_hnp closed\
[grsacc19:00463] mca: base: close: unloading component staged_hnp\
[grsacc19:00463] mca: base: close: component staged_orted closed\
[grsacc19:00463] mca: base: close: unloading component staged_orted\
[grsacc20:09171] [[10785,0],0] state:base:track_procs called for proc [[10785,1],1] state RUNNING\
[grsacc20:09171] [[10785,0],0] ACTIVATE JOB [10785,1] STATE RUNNING AT base/state_base_fns.c:482\
[grsacc20:09171] [[10785,0],0] ACTIVATING JOB [10785,1] STATE RUNNING PRI 4\
[grsacc20:09171] [[10785,0],0] plm:base:launch wiring up iof for job [10785,1]\
[grsacc19:00463] mca: base: components_register: registering errmgr components\
[grsacc19:00463] mca: base: components_register: found loaded component default_app\
[grsacc19:00463] mca: base: components_register: component default_app register function successful\
[grsacc19:00463] mca: base: components_register: found loaded component default_hnp\
[grsacc19:00463] mca: base: components_register: component default_hnp register function successful\
[grsacc19:00463] mca: base: components_register: found loaded component default_orted\
[grsacc19:00463] mca: base: components_register: component default_orted register function successful\
[grsacc19:00463] mca: base: components_open: opening errmgr components\
[grsacc19:00463] mca: base: components_open: found loaded component default_app\
[grsacc19:00463] mca: base: components_open: component default_app open function successful\
[grsacc19:00463] mca: base: components_open: found loaded component default_hnp\
[grsacc19:00463] mca: base: components_open: component default_hnp open function successful\
[grsacc19:00463] mca: base: components_open: found loaded component default_orted\
[grsacc19:00463] mca: base: components_open: component default_orted open function successful\
[grsacc20:09171] [[10785,0],0] ACTIVATE PROC [[10785,1],0] STATE SYNC REGISTERED AT base/odls_base_default_fns.c:1836\
[grsacc20:09171] [[10785,0],0] ACTIVATING PROC [[10785,1],0] STATE SYNC REGISTERED PRI 4\
[grsacc20:09171] [[10785,0],0] state:base:track_procs called for proc [[10785,1],0] state SYNC REGISTERED\
[grsacc19:00463] mca:base:select: Auto-selecting errmgr components\
[grsacc19:00463] mca:base:select:(errmgr) Querying component [default_app]\
[grsacc19:00463] mca:base:select:(errmgr) Query of component [default_app] set priority to 1000\
[grsacc19:00463] mca:base:select:(errmgr) Querying component [default_hnp]\
[grsacc19:00463] mca:base:select:(errmgr) Skipping component [default_hnp]. Query failed to return a module\
[grsacc19:00463] mca:base:select:(errmgr) Querying component [default_orted]\
[grsacc19:00463] mca:base:select:(errmgr) Skipping component [default_orted]. Query failed to return a module\
[grsacc19:00463] mca:base:select:(errmgr) Selected component [default_app]\
[grsacc19:00463] mca: base: close: component default_hnp closed\
[grsacc19:00463] mca: base: close: unloading component default_hnp\
[grsacc19:00463] mca: base: close: component default_orted closed\
[grsacc19:00463] mca: base: close: unloading component default_orted\
[grsacc19:00462] [[10785,0],1] ACTIVATE PROC [[10785,1],1] STATE SYNC REGISTERED AT base/odls_base_default_fns.c[grsacc20:09171] [[10785,0],0] plm:base:receive processing msg\
:1836\
[grsacc20:09171] [[10785,0],0] ACTIVATE PROC [[10785,1],1] STATE SYNC REGISTERED AT base/plm_base_receive.c:354\
[grsa[grsacc20:09171] [[10785,0],0] ACTIVATING PROC [[10785,1],1] STATE SYNC REGISTERED PRI 4\
cc19:00462] [[10[grsacc20:09171] [[10785,0],0] plm:base:receive done processing commands\
785,0],1] A[grsacc20:09171] [[10785,0],0] state:base:track_procs called for proc [[10785,1],1] state SYNC REGISTERED\
CTIVA[grsacc20:09171] [[10785,0],0] ACTIVATE JOB [10785,1] STATE SYNC REGISTERED AT base/state_base_fns.c:490\
TING[grsacc20:09171] [[10785,0],0] ACTIVATING JOB [10785,1] STATE SYNC REGISTERED PRI 4\
PROC [[107[grsacc20:09171] [[10785,0],0] plm:base:launch registered event\
85,1[grsacc20:09171] [[10785,0],0] plm:base:launch job [10785,1] is not a dynamic spawn\
],1[grsacc20:09171] [[10785,0],0] ACTIVATE JOB [10785,1] STATE READY FOR DEBUGGERS AT base/plm_base_launch_support.c:609\
[grsacc20:09171] [[10785,0],0] ACTIVATING JOB [10785,1] STATE READY FOR DEBUGGERS PRI 4\
] STATE SYNC REGISTERED PRI 4\
[grsacc19:00462] [[10785,0],1] state:orted:track_procs called for proc [[10785,1],1] state SYNC REGISTERED\
[grsacc19:00462] [[10785,0],1] state:orted: sending contact info to HNP\
[grsacc20:09171] [[10785,0],0] plm:base:receive processing msg\
[grsacc20:09171] [[10785,0],0] plm:base:receive job launch command from [[10785,1],0]\
[grsacc20:09171] [[10785,0],0] plm:base:receive adding hosts\
[grsacc20:09171] [[10785,0],0] plm:base:receive calling spawn\
[grsacc20:09171] [[10785,0],0] ACTIVATE JOB [INVALID] STATE PENDING INIT AT plm_tm_module.c:157\
[grsacc20:09171] [[10785,0],0] ACTIVATING JOB [INVALID] STATE PENDING INIT PRI 4\
[grsacc20:09171] [[10785,0],0] plm:base:receive done processing commands\
all info set, ready to spawn\
[grsacc20:09171] [[10785,0],0] plm:base:setup_job\
[grsacc20:09171] [[10785,0],0] ACTIVATE JOB [10785,2] STATE INIT_COMPLETE AT base/plm_base_launch_support.c:315\
[grsacc20:09171] [[10785,0],0] ACTIVATING JOB [10785,2] STATE INIT_COMPLETE PRI 4\
[grsacc20:09171] [[10785,0],0] ACTIVATE JOB [10785,2] STATE PENDING ALLOCATION AT base/plm_base_launch_support.c:326\
[grsacc20:09171] [[10785,0],0] ACTIVATING JOB [10785,2] STATE PENDING ALLOCATION PRI 4\
[grsacc20:09171] [[10785,0],0] ACTIVATE JOB [10785,2] STATE ALLOCATION COMPLETE AT base/ras_base_allocate.c:421\
[grsacc20:09171] [[10785,0],0] ACTIVATING JOB [10785,2] STATE ALLOCATION COMPLETE PRI 4\
[grsacc20:09171] [[10785,0],0] ACTIVATE JOB [10785,2] STATE PENDING MAPPING AT state_novm.c:235\
[grsacc20:09171] [[10785,0],0] ACTIVATING JOB [10785,2] STATE PENDING MAPPING PRI 4\
[grsacc20:09171] [[10785,0],0] ACTIVATE JOB [10785,2] STATE MAP COMPLETE AT base/rmaps_base_map_job.c:316\
[grsacc20:09171] [[10785,0],0] ACTIVATING JOB [10785,2] STATE MAP COMPLETE PRI 4\
[grsacc20:09171] [[10785,0],0] ACTIVATE JOB [10785,2] STATE PENDING DAEMON LAUNCH AT state_novm.c:250\
[grsacc20:09171] [[10785,0],0] ACTIVATING JOB [10785,2] STATE PENDING DAEMON LAUNCH PRI 4\
[grsacc20:09171] [[10785,0],0] plm:base:setup_vm\
[grsacc20:09171] [[10785,0],0] plm:base:setup_vm add new daemon [[10785,0],2]\
[grsacc20:09171] [[10785,0],0] plm:base:setup_vm assigning new daemon [[10785,0],2] to node grsacc18\
[grsacc20:09171] [[10785,0],0] plm:tm: launching vm\
[grsacc20:09171] [[10785,0],0] plm:tm: final top-level argv:\
orted -mca ess tm -mca orte_ess_jobid 706805760 -mca orte_ess_vpid <template> -mca orte_ess_num_procs 3 -mca orte_hnp_uri "706805760.0;tcp://192.168.222.20:46019" -mca state_base_verbose 10 -mca errmgr_base_verbose 10 -mca plm_base_verbose 5 -mca btl tcp,sm,self\
[grsacc20:09171] [[10785,0],0] plm:tm: launching on node grsacc19\
[grsacc20:09171] [[10785,0],0] plm:tm: executing:\
orted -mca ess tm -mca orte_ess_jobid 706805760 -mca orte_ess_vpid 1 -mca orte_ess_num_procs 3 -mca orte_hnp_uri "706805760.0;tcp://192.168.222.20:46019" -mca state_base_verbose 10 -mca errmgr_base_verbose 10 -mca plm_base_verbose 5 -mca btl tcp,sm,self\
[grsacc20:09171] [[10785,0],0] plm:tm: launching on node grsacc18\
[grsacc20:09171] [[10785,0],0] plm:tm: executing:\
orted -mca ess tm -mca orte_ess_jobid 706805760 -mca orte_ess_vpid 2 -mca orte_ess_num_procs 3 -mca orte_hnp_uri "706805760.0;tcp://192.168.222.20:46019" -mca state_base_verbose 10 -mca errmgr_base_verbose 10 -mca plm_base_verbose 5 -mca btl tcp,sm,self\
[grsacc20:09171] [[10785,0],0] plm:tm:launch: finished spawning orteds\
[grsacc19:00465] mca: base: components_register: registering state components\
[grsacc19:00465] mca: base: components_register: found loaded component app\
[grsacc19:00465] mca: base: components_register: component app has no register or open function\
[grsacc19:00465] mca: base: components_register: found loaded component hnp\
[grsacc19:00465] mca: base: components_register: component hnp has no register or open function\
[grsacc19:00465] mca: base: components_register: found loaded component novm\
[grsacc19:00465] mca: base: components_register: component novm register function successful\
[grsacc19:00465] mca: base: components_register: found loaded component orted\
[grsacc19:00465] mca: base: components_register: component orted has no register or open function\
[grsacc19:00465] mca: base: components_register: found loaded component staged_hnp\
[grsacc19:00465] mca: base: components_register: component staged_hnp has no register or open function\
[grsacc19:00465] mca: base: components_register: found loaded component staged_orted\
[grsacc19:00465] mca: base: components_register: component staged_orted has no register or open function\
[grsacc19:00465] mca: base: components_open: opening state components\
[grsacc19:00465] mca: base: components_open: found loaded component app\
[grsacc19:00465] mca: base: components_open: component app open function successful\
[grsacc19:00465] mca: base: components_open: found loaded component hnp\
[grsacc19:00465] mca: base: components_open: component hnp open function successful\
[grsacc19:00465] mca: base: components_open: found loaded component novm\
[grsacc19:00465] mca: base: components_open: component novm open function successful\
[grsacc19:00465] mca: base: components_open: found loaded component orted\
[grsacc19:00465] mca: base: components_open: component orted open function successful\
[grsacc19:00465] mca: base: components_open: found loaded component staged_hnp\
[grsacc19:00465] mca: base: components_open: component staged_hnp open function successful\
[grsacc19:00465] mca: base: components_open: found loaded component staged_orted\
[grsacc19:00465] mca: base: components_open: component staged_orted open function successful\
[grsacc19:00465] mca:base:select: Auto-selecting state components\
[grsacc19:00465] mca:base:select:(state) Querying component [app]\
[grsacc19:00465] mca:base:select:(state) Skipping component [app]. Query failed to return a module\
[grsacc19:00465] mca:base:select:(state) Querying component [hnp]\
[grsacc19:00465] mca:base:select:(state) Skipping component [hnp]. Query failed to return a module\
[grsacc19:00465] mca:base:select:(state) Querying component [novm]\
[grsacc19:00465] mca:base:select:(state) Skipping component [novm]. Query failed to return a module\
[grsacc19:00465] mca:base:select:(state) Querying component [orted]\
[grsacc19:00465] mca:base:select:(state) Query of component [orted] set priority to 100\
[grsacc19:00465] mca:base:select:(state) Querying component [staged_hnp]\
[grsacc19:00465] mca:base:select:(state) Skipping component [staged_hnp]. Query failed to return a module\
[grsacc19:00465] mca:base:select:(state) Querying component [staged_orted]\
[grsacc19:00465] mca:base:select:(state) Skipping component [staged_orted]. Query failed to return a module\
[grsacc19:00465] mca:base:select:(state) Selected component [orted]\
[grsacc19:00465] mca: base: close: component app closed\
[grsacc19:00465] mca: base: close: unloading component app\
[grsacc19:00465] mca: base: close: component hnp closed\
[grsacc19:00465] mca: base: close: unloading component hnp\
[grsacc19:00465] mca: base: close: component novm closed\
[grsacc19:00465] mca: base: close: unloading component novm\
[grsacc19:00465] mca: base: close: component staged_hnp closed\
[grsacc19:00465] mca: base: close: unloading component staged_hnp\
[grsacc19:00465] mca: base: close: component staged_orted closed\
[grsacc19:00465] mca: base: close: unloading component staged_orted\
[grsacc19:00465] ORTE_JOB_STATE_MACHINE:\
[grsacc19:00465] State: LOCAL LAUNCH COMPLETE cbfunc: DEFINED\
[grsacc19:00465] State: FORCED EXIT cbfunc: DEFINED\
[grsacc19:00465] State: DAEMONS TERMINATED cbfunc: DEFINED\
[grsacc19:00465] ORTE_PROC_STATE_MACHINE:\
[grsacc19:00465] State: RUNNING cbfunc: DEFINED\
[grsacc19:00465] State: SYNC REGISTERED cbfunc: DEFINED\
[grsacc19:00465] State: IOF COMPLETE cbfunc: DEFINED\
[grsacc19:00465] State: WAITPID FIRED cbfunc: DEFINED\
[grsacc19:00465] mca: base: components_register: registering errmgr components\
[grsacc19:00465] mca: base: components_register: found loaded component default_app\
[grsacc19:00465] mca: base: components_register: component default_app register function successful\
[grsacc19:00465] mca: base: components_register: found loaded component default_hnp\
[grsacc19:00465] mca: base: components_register: component default_hnp register function successful\
[grsacc19:00465] mca: base: components_register: found loaded component default_orted\
[grsacc19:00465] mca: base: components_register: component default_orted register function successful\
[grsacc19:00465] mca: base: components_open: opening errmgr components\
[grsacc19:00465] mca: base: components_open: found loaded component default_app\
[grsacc19:00465] mca: base: components_open: component default_app open function successful\
[grsacc19:00465] mca: base: components_open: found loaded component default_hnp\
[grsacc19:00465] mca: base: components_open: component default_hnp open function successful\
[grsacc19:00465] mca: base: components_open: found loaded component default_orted\
[grsacc19:00465] mca: base: components_open: component default_orted open function successful\
[grsacc19:00465] mca:base:select:( plm) Querying component [rsh]\
[grsacc19:00465] [[10785,0],1] plm:rsh_lookup on agent ssh : rsh path NULL\
[grsacc19:00465] mca:base:select:( plm) Query of component [rsh] set priority to 10\
[grsacc19:00465] mca:base:select:( plm) Selected component [rsh]\
[grsacc19:00465] mca:base:select: Auto-selecting errmgr components\
[grsacc19:00465] mca:base:select:(errmgr) Querying component [default_app]\
[grsacc19:00465] mca:base:select:(errmgr) Skipping component [default_app]. Query failed to return a module\
[grsacc19:00465] mca:base:select:(errmgr) Querying component [default_hnp]\
[grsacc19:00465] mca:base:select:(errmgr) Skipping component [default_hnp]. Query failed to return a module\
[grsacc19:00465] mca:base:select:(errmgr) Querying component [default_orted]\
[grsacc19:00465] mca:base:select:(errmgr) Query of component [default_orted] set priority to 1000\
[grsacc19:00465] mca:base:select:(errmgr) Selected component [default_orted]\
[grsacc19:00465] mca: base: close: component default_app closed\
[grsacc19:00465] mca: base: close: unloading component default_app\
[grsacc19:00465] mca: base: close: component default_hnp closed\
[grsacc19:00465] mca: base: close: unloading component default_hnp\
[grsacc19:00465] [[10785,0],1] plm:rsh_setup on agent ssh : rsh path NULL\
[grsacc19:00465] [[10785,0],1] plm:base:receive start comm\
all info set, ready to spawn\
[grsacc19:00462] [[10785,0],1] ACTIVATE PROC [[10785,0],0] STATE LIFELINE LOST AT oob_tcp_component.c:1102\
[grsacc19:00462] [[10785,0],1] ACTIVATING PROC [[10785,0],0] STATE LIFELINE LOST PRI 0\
[grsacc19:00462] [[10785,0],1] errmgr:default_orted:proc_errors process [[10785,0],0] error state LIFELINE LOST\
[grsacc19:00462] [[10785,0],1] errmgr:orted lifeline lost - exiting\
[grsacc19:00465] [[10785,0],1] FORCE-TERMINATE AT oob_tcp_sendrecv.c:430\
[grsacc19:00465] [[10785,0],1] ACTIVATE JOB NULL STATE FORCED EXIT AT oob_tcp_sendrecv.c:430\
[grsacc19:00465] [[10785,0],1] ACTIVATING JOB NULL STATE FORCED EXIT PRI 0\
[grsacc19:00465] mca: base: close: component default_orted closed\
[grsacc18:20721] mca: base: components_register: registering state components\
[grsacc19:00465] mca: base: close: unloading component default_orted\
[grsacc19:00465] [[10785,0],1] plm:base:receive stop comm\
[grsacc19:00465] mca: base: close: component orted closed\
[grsacc19:00465] mca: base: close: unloading component orted\
[grsacc18:20721] mca: base: components_register: found loaded component app\
[grsacc18:20721] mca: base: components_register: component app has no register or open function\
[grsacc18:20721] mca: base: components_register: found loaded component hnp\
[grsacc18:20721] mca: base: components_register: component hnp has no register or open function\
[grsacc18:20721] mca: base: components_register: found loaded component novm\
[grsacc18:20721] mca: base: components_register: component novm register function successful\
[grsacc18:20721] mca: base: components_register: found loaded component orted\
[grsacc18:20721] mca: base: components_register: component orted has no register or open function\
[grsacc18:20721] mca: base: components_register: found loaded component staged_hnp\
[grsacc18:20721] mca: base: components_register: component staged_hnp has no register or open function\
[grsacc18:20721] mca: base: components_register: found loaded component staged_orted\
[grsacc18:20721] mca: base: components_register: component staged_orted has no register or open function\
[grsacc18:20721] mca: base: components_open: opening state components\
[grsacc18:20721] mca: base: components_open: found loaded component app\
[grsacc18:20721] mca: base: components_open: component app open function successful\
[grsacc18:20721] mca: base: components_open: found loaded component hnp\
[grsacc18:20721] mca: base: components_open: component hnp open function successful\
[grsacc18:20721] mca: base: components_open: found loaded component novm\
[grsacc18:20721] mca: base: components_open: component novm open function successful\
[grsacc18:20721] mca: base: components_open: found loaded component orted\
[grsacc18:20721] mca: base: components_open: component orted open function successful\
[grsacc18:20721] mca: base: components_open: found loaded component staged_hnp\
[grsacc18:20721] mca: base: components_open: component staged_hnp open function successful\
[grsacc18:20721] mca: base: components_open: found loaded component staged_orted\
[grsacc18:20721] mca: base: components_open: component staged_orted open function successful\
[grsacc18:20721] mca:base:select: Auto-selecting state components\
[grsacc18:20721] mca:base:select:(state) Querying component [app]\
[grsacc18:20721] mca:base:select:(state) Skipping component [app]. Query failed to return a module\
[grsacc18:20721] mca:base:select:(state) Querying component [hnp]\
[grsacc18:20721] mca:base:select:(state) Skipping component [hnp]. Query failed to return a module\
[grsacc18:20721] mca:base:select:(state) Querying component [novm]\
[grsacc18:20721] mca:base:select:(state) Skipping component [novm]. Query failed to return a module\
[grsacc18:20721] mca:base:select:(state) Querying component [orted]\
[grsacc18:20721] mca:base:select:(state) Query of component [orted] set priority to 100\
[grsacc18:20721] mca:base:select:(state) Querying component [staged_hnp]\
[grsacc18:20721] mca:base:select:(state) Skipping component [staged_hnp]. Query failed to return a module\
[grsacc18:20721] mca:base:select:(state) Querying component [staged_orted]\
[grsacc18:20721] mca:base:select:(state) Skipping component [staged_orted]. Query failed to return a module\
[grsacc18:20721] mca:base:select:(state) Selected component [orted]\
[grsacc18:20721] mca: base: close: component app closed\
[grsacc18:20721] mca: base: close: unloading component app\
[grsacc18:20721] mca: base: close: component hnp closed\
[grsacc18:20721] mca: base: close: unloading component hnp\
[grsacc18:20721] mca: base: close: component novm closed\
[grsacc18:20721] mca: base: close: unloading component novm\
[grsacc18:20721] mca: base: close: component staged_hnp closed\
[grsacc18:20721] mca: base: close: unloading component staged_hnp\
[grsacc18:20721] mca: base: close: component staged_orted closed\
[grsacc18:20721] mca: base: close: unloading component staged_orted\
[grsacc18:20721] ORTE_JOB_STATE_MACHINE:\
[grsacc18:20721] State: LOCAL LAUNCH COMPLETE cbfunc: DEFINED\
[grsacc18:20721] State: FORCED EXIT cbfunc: DEFINED\
[grsacc18:20721] State: DAEMONS TERMINATED cbfunc: DEFINED\
[grsacc18:20721] ORTE_PROC_STATE_MACHINE:\
[grsacc18:20721] State: RUNNING cbfunc: DEFINED\
[grsacc18:20721] State: SYNC REGISTERED cbfunc: DEFINED\
[grsacc18:20721] State: IOF COMPLETE cbfunc: DEFINED\
[grsacc18:20721] State: WAITPID FIRED cbfunc: DEFINED\
[grsacc18:20721] mca: base: components_register: registering errmgr components\
[grsacc18:20721] mca: base: components_register: found loaded component default_app\
[grsacc18:20721] mca: base: components_register: component default_app register function successful\
[grsacc18:20721] mca: base: components_register: found loaded component default_hnp\
[grsacc18:20721] mca: base: components_register: component default_hnp register function successful\
[grsacc18:20721] mca: base: components_register: found loaded component default_orted\
[grsacc18:20721] mca: base: components_register: component default_orted register function successful\
[grsacc18:20721] mca: base: components_open: opening errmgr components\
[grsacc18:20721] mca: base: components_open: found loaded component default_app\
[grsacc18:20721] mca: base: components_open: component default_app open function successful\
[grsacc18:20721] mca: base: components_open: found loaded component default_hnp\
[grsacc18:20721] mca: base: components_open: component default_hnp open function successful\
[grsacc18:20721] mca: base: components_open: found loaded component default_orted\
[grsacc18:20721] mca: base: components_open: component default_orted open function successful\
[grsacc18:20721] mca:base:select:( plm) Querying component [rsh]\
[grsacc18:20721] [[10785,0],2] plm:rsh_lookup on agent ssh : rsh path NULL\
[grsacc18:20721] mca:base:select:( plm) Query of component [rsh] set priority to 10\
[grsacc18:20721] mca:base:select:( plm) Selected component [rsh]\
[grsacc18:20721] mca:base:select: Auto-selecting errmgr components\
[grsacc18:20721] mca:base:select:(errmgr) Querying component [default_app]\
[grsacc18:20721] mca:base:select:(errmgr) Skipping component [default_app]. Query failed to return a module\
[grsacc18:20721] mca:base:select:(errmgr) Querying component [default_hnp]\
[grsacc18:20721] mca:base:select:(errmgr) Skipping component [default_hnp]. Query failed to return a module\
[grsacc18:20721] mca:base:select:(errmgr) Querying component [default_orted]\
[grsacc18:20721] mca:base:select:(errmgr) Query of component [default_orted] set priority to 1000\
[grsacc18:20721] mca:base:select:(errmgr) Selected component [default_orted]\
[grsacc18:20721] mca: base: close: component default_app closed\
[grsacc18:20721] mca: base: close: unloading component default_app\
[grsacc18:20721] mca: base: close: component default_hnp closed\
[grsacc18:20721] mca: base: close: unloading component default_hnp\
[grsacc18:20721] [[10785,0],2] plm:rsh_setup on agent ssh : rsh path NULL\
[grsacc18:20721] [[10785,0],2] plm:base:receive start comm\
[grsacc20:09171] [[10785,0],0] plm:base:orted_report_launch from daemon [[10785,0],2]\
[grsacc20:09171] [[10785,0],0] plm:base:orted_report_launch from daemon [[10785,0],2] on node grsacc18\
[grsacc20:09171] [[10785,0],0] plm:base:orted_report_launch completed for daemon [[10785,0],2] at contact 706805760.2;tcp://192.168.222.18:51115\
[grsacc20:09171] [[10785,0],0] ACTIVATE JOB [10785,2] STATE ALL DAEMONS REPORTED AT base/plm_base_launch_support.c:842\
[grsacc20:09171] [[10785,0],0] ACTIVATING JOB [10785,2] STATE ALL DAEMONS REPORTED PRI 4\
[grsacc20:09171] [[10785,0],0] ACTIVATE JOB [10785,2] STATE VM READY AT base/plm_base_launch_support.c:170\
[grsacc20:09171] [[10785,0],0] ACTIVATING JOB [10785,2] STATE VM READY PRI 4\
[grsacc20:09171] [[10785,0],0] ACTIVATE JOB [10785,2] STATE PENDING FINAL SYSTEM PREP AT state_novm.c:265\
[grsacc20:09171] [[10785,0],0] ACTIVATING JOB [10785,2] STATE PENDING FINAL SYSTEM PREP PRI 4\
[grsacc20:09171] [[10785,0],0] ACTIVATE JOB [10785,2] STATE PENDING APP LAUNCH AT base/plm_base_launch_support.c:410\
[grsacc20:09171] [[10785,0],0] ACTIVATING JOB [10785,2] STATE PENDING APP LAUNCH PRI 4\
[grsacc20:09171] [[10785,0],0] plm:base:launch_apps for job [10785,2]\
[grsacc20:09171] [[10785,0],0] ACTIVATE JOB [10785,2] STATE LOCAL LAUNCH COMPLETE AT base/odls_base_default_fns.c:1593\
[grsacc20:09171] [[10785,0],0] ACTIVATING JOB [10785,2] STATE LOCAL LAUNCH COMPLETE PRI 4\
[grsacc18:20721] [[10785,0],2] ACTIVATE PROC [[10785,2],0] STATE RUNNING AT base/odls_base_default_fns.c:1545\
[grsacc18:20721] [[10785,0],2] ACTIVATING PROC [[10785,2],0] STATE RUNNING PRI 4\
[grsacc18:20721] [[10785,0],2] ACTIVATE JOB [10785,2] STATE LOCAL LAUNCH COMPLETE AT base/odls_base_default_fns.c:1593\
[grsacc18:20721] [[10785,0],2] ACTIVATING JOB [10785,2] STATE LOCAL LAUNCH COMPLETE PRI 4\
[grsacc18:20721] [[10785,0],2] state:orted:track_procs called for proc [[10785,2],0] state RUNNING\
[grsacc18:20721] [[10785,0],2] state:orted:track_jobs sending local launch complete for job [10785,2]\
[grsacc20:09171] [[10785,0],0] plm:base:receive processing msg\
[grsacc20:09171] [[10785,0],0] plm:base:receive update proc state command from [[10785,0],2]\
[grsacc20:09171] [[10785,0],0] plm:base:receive got update_proc_state for job [10785,2]\
[grsacc20:09171] [[10785,0],0] plm:base:receive got update_proc_state for vpid 0 state RUNNING exit_code 0\
[grsacc20:09171] [[10785,0],0] ACTIVATE PROC [[10785,2],0] STATE RUNNING AT base/plm_base_receive.c:296\
[grsacc20:09171] [[10785,0],0] ACTIVATING PROC [[10785,2],0] STATE RUNNING PRI 4\
[grsacc20:09171] [[10785,0],0] plm:base:receive done processing commands\
[grsacc18:20723] mca: base: components_register: registering state components\
[grsacc18:20723] mca: base: components_register: found loaded component app\
[grsacc18:20723] mca: base: components_register: component app has no register or open function\
[grsacc18:20723] mca: base: components_register: found loaded component hnp\
[grsacc18:20723] mca: base: components_register: component hnp has no register or open function\
[grsacc18:20723] mca: base: components_register: found loaded component novm\
[grsacc18:20723] mca: base: components_register: component novm register function successful\
[grsacc18:20723] mca: base: components_register: found loaded component orted\
[grsacc18:20723] mca: base: components_register: component orted has no register or open function\
[grsacc18:20723] mca: base: components_register: found loaded component staged_hnp\
[grsacc18:20723] mca: base: components_register: component staged_hnp has no register or open function\
[grsacc18:20723] mca: base: components_register: found loaded component staged_orted\
[grsacc18:20723] mca: base: components_register: component staged_orted has no register or open function\
[grsacc18:20723] mca: base: components_open: opening state components\
[grsacc18:20723] mca: base: components_open: found loaded component app\
[grsacc18:20723] mca: base: components_open: component app open function successful\
[grsacc18:20723] mca: base: components_open: found loaded component hnp\
[grsacc18:20723] mca: base: components_open: component hnp open function successful\
[grsacc18:20723] mca: base: components_open: found loaded component novm\
[grsacc18:20723] mca: base: components_open: component novm open function successful\
[grsacc18:20723] mca: base: components_open: found loaded component orted\
[grsacc18:20723] mca: base: components_open: component orted open function successful\
[grsacc18:20723] mca: base: components_open: found loaded component staged_hnp\
[grsacc18:20723] mca: base: components_open: component staged_hnp open function successful\
[grsacc18:20723] mca: base: components_open: found loaded component staged_orted\
[grsacc18:20723] mca: base: components_open: component staged_orted open function successful\
[grsacc18:20723] mca:base:select: Auto-selecting state components\
[grsacc18:20723] mca:base:select:(state) Querying component [app]\
[grsacc18:20723] mca:base:select:(state) Query of component [app] set priority to 1000\
[grsacc18:20723] mca:base:select:(state) Querying component [hnp]\
[grsacc18:20723] mca:base:select:(state) Skipping component [hnp]. Query failed to return a module\
[grsacc18:20723] mca:base:select:(state) Querying component [novm]\
[grsacc18:20723] mca:base:select:(state) Skipping component [novm]. Query failed to return a module\
[grsacc18:20723] mca:base:select:(state) Querying component [orted]\
[grsacc18:20723] mca:base:select:(state) Skipping component [orted]. Query failed to return a module\
[grsacc18:20723] mca:base:select:(state) Querying component [staged_hnp]\
[grsacc18:20723] mca:base:select:(state) Skipping component [staged_hnp]. Query failed to return a module\
[grsacc18:20723] mca:base:select:(state) Querying component [staged_orted]\
[grsacc18:20723] mca:base:select:(state) Skipping component [staged_orted]. Query failed to return a module\
[grsacc18:20723] mca:base:select:(state) Selected component [app]\
[grsacc18:20723] mca: base: close: component hnp closed\
[grsacc18:20723] mca: base: close: unloading component hnp\
[grsacc18:20723] mca: base: close: component novm closed\
[grsacc18:20723] mca: base: close: unloading component novm\
[grsacc18:20723] mca: base: close: component orted closed\
[grsacc18:20723] mca: base: close: unloading component orted\
[grsacc18:20723] mca: base: close: component staged_hnp closed\
[grsacc18:20723] mca: base: close: unloading component staged_hnp\
[grsacc18:20723] mca: base: close: component staged_orted closed\
[grsacc18:20723] mca: base: close: unloading component staged_orted\
[grsacc20:09171] [[10785,0],0] state:base:track_procs called for proc [[10785,2],0] state RUNNING\
[grsacc20:09171] [[10785,0],0] ACTIVATE JOB [10785,2] STATE RUNNING AT base/state_base_fns.c:482\
[grsacc20:09171] [[10785,0],0] ACTIVATING JOB [10785,2] STATE RUNNING PRI 4\
[grsacc20:09171] [[10785,0],0] plm:base:launch wiring up iof for job [10785,2]\
[grsacc18:20723] mca: base: components_register: registering errmgr components\
[grsacc18:20723] mca: base: components_register: found loaded component default_app\
[grsacc18:20723] mca: base: components_register: component default_app register function successful\
[grsacc18:20723] mca: base: components_register: found loaded component default_hnp\
[grsacc18:20723] mca: base: components_register: component default_hnp register function successful\
[grsacc18:20723] mca: base: components_register: found loaded component default_orted\
[grsacc18:20723] mca: base: components_register: component default_orted register function successful\
[grsacc18:20723] mca: base: components_open: opening errmgr components\
[grsacc18:20723] mca: base: components_open: found loaded component default_app\
[grsacc18:20723] mca: base: components_open: component default_app open function successful\
[grsacc18:20723] mca: base: components_open: found loaded component default_hnp\
[grsacc18:20723] mca: base: components_open: component default_hnp open function successful\
[grsacc18:20723] mca: base: components_open: found loaded component default_orted\
[grsacc18:20723] mca: base: components_open: component default_orted open function successful\
[grsacc18:20723] mca:base:select: Auto-selecting errmgr components\
[grsacc18:20723] mca:base:select:(errmgr) Querying component [default_app]\
[grsacc18:20723] mca:base:select:(errmgr) Query of component [default_app] set priority to 1000\
[grsacc18:20723] mca:base:select:(errmgr) Querying component [default_hnp]\
[grsacc18:20723] mca:base:select:(errmgr) Skipping component [default_hnp]. Query failed to return a module\
[grsacc18:20723] mca:base:select:(errmgr) Querying component [default_orted]\
[grsacc18:20723] mca:base:select:(errmgr) Skipping component [default_orted]. Query failed to return a module\
[grsacc18:20723] mca:base:select:(errmgr) Selected component [default_app]\
[grsacc18:20723] mca: base: close: component default_hnp closed\
[grsacc18:20723] mca: base: close: unloading component default_hnp\
[grsacc18:20723] mca: base: close: component default_orted closed\
[grsacc18:20723] mca: base: close: unloading component default_orted\
[grsacc18:20721] [[10785,0],2] ACTIVATE PROC [[10785,2],0] STATE SYNC REGISTERED AT base/odls_base_default_fns.c:1836\
[grsacc20:09171] [[10785,0],0] plm:base:receive processing msg\
[grsacc18:2[grsacc20:09171] [[10785,0],0] ACTIVATE PROC [[10785,2],0] STATE SYNC REGISTERED AT base/plm_base_receive.c:354\
0721][grsacc20:09171] [[10785,0],0] ACTIVATING PROC [[10785,2],0] STATE SYNC REGISTERED PRI 4\
[[10785,0],[grsacc20:09171] [[10785,0],0] plm:base:receive done processing commands\
2] ACTIVATI[grsacc20:09171] [[10785,0],0] state:base:track_procs called for proc [[10785,2],0] state SYNC REGISTERED\
NG PR[grsacc20:09171] [[10785,0],0] ACTIVATE JOB [10785,2] STATE SYNC REGISTERED AT base/state_base_fns.c:490\
OC [[grsacc20:09171] [[10785,0],0] ACTIVATING JOB [10785,2] STATE SYNC REGISTERED PRI 4\
[1078[grsacc20:09171] [[10785,0],0] plm:base:launch registered event\
5,2],0][grsacc20:09171] [[10785,0],0] plm:base:launch sending dyn release of job [10785,2] to [[10785,1],0]\
STATE [grsacc20:09171] [[10785,0],0] ACTIVATE JOB [10785,2] STATE READY FOR DEBUGGERS AT base/plm_base_launch_support.c:609\
SYNC [grsacc20:09171] [[10785,0],0] ACTIVATING JOB [10785,2] STATE READY FOR DEBUGGERS PRI 4\
REGISTERED PRI 4\
[grsacc18:20721] [[10785,0],2] state:orted:track_procs called for proc [[10785,2],0] state SYNC REGISTERED\
[grsacc18:20721] [[10785,0],2] state:orted: sending contact info to HNP\
[grsacc19:00462] [[10785,0],1] FORCE-TERMINATE AT errmgr_default_orted.c:259\
[grsacc19:00462] [[10785,0],1] ACTIVATE JOB NULL STATE FORCED EXIT AT errmgr_default_orted.c:259\
[grsacc19:00462] [[10785,0],1] ACTIVATING JOB NULL STATE FORCED EXIT PRI 0\
[grsacc19:00462] mca: base: close: component default_orted closed\
[grsacc19:00462] mca: base: close: unloading component default_orted\
[grsacc19:00462] [[10785,0],1] plm:base:receive stop comm\
[grsacc19:00462] mca: base: close: component orted closed\
[grsacc19:00462] mca: base: close: unloading component orted\
[grsacc20:09171] [[10785,0],0] FORCE-TERMINATE AT oob_tcp_sendrecv.c:430\
[grsacc20:09171] [[10785,0],0] ACTIVATE JOB NULL STATE FORCED EXIT AT oob_tcp_sendrecv.c:430\
[grsacc20:09171] [[10785,0],0] ACTIVATING JOB NULL STATE FORCED EXIT PRI 0\
[grsacc20:09171] [[10785,0],0] plm:base:receive stop comm\
[grsacc20:09171] mca: base: close: component default_hnp closed\
[grsacc20:09171] mca: base: close: unloading component default_hnp\
[grsacc20:09171] mca: base: close: component novm closed\
[grsacc20:09171] mca: base: close: unloading component novm\
-bash-4.1$ [grsacc18:20721] [[10785,0],2] ACTIVATE PROC [[10785,0],0] STATE LIFELINE LOST AT oob_tcp_component.c:1102\
[grsacc18:20721] [[10785,0],2] ACTIVATING PROC [[10785,0],0] STATE LIFELINE LOST PRI 0\
[grsacc18:20721] [[10785,0],2] errmgr:default_orted:proc_errors process [[10785,0],0] error state LIFELINE LOST\
[grsacc18:20721] [[10785,0],2] errmgr:orted lifeline lost - exiting\
[grsacc18:20721] [[10785,0],2] FORCE-TERMINATE AT errmgr_default_orted.c:259\
[grsacc18:20721] [[10785,0],2] ACTIVATE JOB NULL STATE FORCED EXIT AT errmgr_default_orted.c:259\
[grsacc18:20721] [[10785,0],2] ACTIVATING JOB NULL STATE FORCED EXIT PRI 0\
[grsacc18:20721] mca: base: close: component default_orted closed\
[grsacc18:20721] mca: base: close: unloading component default_orted\
[grsacc18:20721] [[10785,0],2] plm:base:receive stop comm\
[grsacc18:20721] mca: base: close: component orted closed\
[grsacc18:20721] mca: base: close: unloading component orted}
On Sep 24, 2013, at 8:24 PM, Ralph Castain wrote:
> What I find puzzling is that I don't see any output indicating that you went
> thru the Torque launcher to launch the daemons - not a peep of debug output.
> This makes me suspicious that something else is going on. Are you sure you
> sent me all the output?
>
> Try adding -novm to your mpirun cmd line and let's see if that mode works
>
> On Sep 24, 2013, at 9:06 AM, Suraj Prabhakaran <suraj.prabhaka...@gmail.com>
> wrote:
>
>> Hi Ralph,
>>
>> So here is what I do. I spawn just a "single" process on a new node which is
>> basically not in the $PBS_NODEFILE list.
>> My $PBS_NODEFILE list contains
>> grsacc20
>> grsacc19
>>
>> I then start the app with just 2 processes. So one host gets one process and
>> they are successfully spawned through the torque (through tm_spawn()). MPI
>> would have stored grsacc20 and grsacc19 to its list of hosts with launchid 0
>> and 1 correspondingly.
>> I then use the add-host info and spawn ONE new process on a new host
>> "grsacc18" through MPI_Comm_spawn. From what I saw in the code, the launchid
>> of this new host is -1 since openmpi does not know about this and it is not
>> available in the $PBS_NODEFILE. Since withouth the launchid, torque would
>> not know where to spawn, I just retrieve the correct launchid of this host
>> from a file just before tm_spawn() and use this launchid. This is the only
>> modification that I made to openmpi.
>> So, the host "grsacc18" will have a new launchid = 2 and will be used to
>> spawn the process through torque. This worked perfectly until 1.6.5.
>>
>> As we see here from the outputs, although I spawn only a single process on
>> grsacc18, I too have no clue why openmpi tries to spawn something on
>> grsacc19. Of course, without pbs/torque involved, everything works fine.
>> I have attached the simple test code. Please modify hostnames and executable
>> path before use.
>>
>> Best,
>> Suraj
>>
>> <addhosttest.c>
>>
>>
>> On Sep 24, 2013, at 4:59 PM, Ralph Castain wrote:
>>
>>> I'm going to need a little help here. The problem is that you launch two
>>> new daemons, and one of them exits immediately because it thinks it lost
>>> the connection back to mpirun - before it even gets a chance to create it.
>>>
>>> Can you give me a little more info as to exactly what you are doing?
>>> Perhaps send me your test code?
>>>
>>> On Sep 24, 2013, at 7:48 AM, Suraj Prabhakaran
>>> <suraj.prabhaka...@gmail.com> wrote:
>>>
>>>> Hi Ralph,
>>>>
>>>> Output attached in a file.
>>>> Thanks a lot!
>>>>
>>>> Best,
>>>> Suraj
>>>>
>>>> <output.rtf>
>>>>
>>>> On Sep 24, 2013, at 4:11 PM, Ralph Castain wrote:
>>>>
>>>>> Afraid I don't see the problem offhand - can you add the following to
>>>>> your cmd line?
>>>>>
>>>>> -mca state_base_verbose 10 -mca errmgr_base_verbose 10
>>>>>
>>>>> Thanks
>>>>> Ralph
>>>>>
>>>>> On Sep 24, 2013, at 6:35 AM, Suraj Prabhakaran
>>>>> <suraj.prabhaka...@gmail.com> wrote:
>>>>>
>>>>>> Hi Ralph,
>>>>>>
>>>>>> I always got this output from any MPI job that ran on our nodes. There
>>>>>> seems to be a problem somewhere but it never stopped the applications
>>>>>> from running. But anyway, I ran it again now with only tcp and excluded
>>>>>> the infiniband and I get the same output again. Except that this time,
>>>>>> the error related to this openib is not there anymore. Printing out the
>>>>>> log again.
>>>>>>
>>>>>> [grsacc20:04578] [[6160,0],0] plm:base:receive processing msg
>>>>>> [grsacc20:04578] [[6160,0],0] plm:base:receive job launch command from
>>>>>> [[6160,1],0]
>>>>>> [grsacc20:04578] [[6160,0],0] plm:base:receive adding hosts
>>>>>> [grsacc20:04578] [[6160,0],0] plm:base:receive calling spawn
>>>>>> [grsacc20:04578] [[6160,0],0] plm:base:receive done processing commands
>>>>>> [grsacc20:04578] [[6160,0],0] plm:base:setup_job
>>>>>> [grsacc20:04578] [[6160,0],0] plm:base:setup_vm
>>>>>> [grsacc20:04578] [[6160,0],0] plm:base:setup_vm add new daemon
>>>>>> [[6160,0],2]
>>>>>> [grsacc20:04578] [[6160,0],0] plm:base:setup_vm assigning new daemon
>>>>>> [[6160,0],2] to node grsacc18
>>>>>> [grsacc20:04578] [[6160,0],0] plm:tm: launching vm
>>>>>> [grsacc20:04578] [[6160,0],0] plm:tm: final top-level argv:
>>>>>> orted -mca ess tm -mca orte_ess_jobid 403701760 -mca orte_ess_vpid
>>>>>> <template> -mca orte_ess_num_procs 3 -mca orte_hnp_uri
>>>>>> "403701760.0;tcp://192.168.222.20:35163" -mca plm_base_verbose 5 -mca
>>>>>> btl tcp,sm,self
>>>>>> [grsacc20:04578] [[6160,0],0] plm:tm: launching on node grsacc19
>>>>>> [grsacc20:04578] [[6160,0],0] plm:tm: executing:
>>>>>> orted -mca ess tm -mca orte_ess_jobid 403701760 -mca orte_ess_vpid 1
>>>>>> -mca orte_ess_num_procs 3 -mca orte_hnp_uri
>>>>>> "403701760.0;tcp://192.168.222.20:35163" -mca plm_base_verbose 5 -mca
>>>>>> btl tcp,sm,self
>>>>>> [grsacc20:04578] [[6160,0],0] plm:tm: launching on node grsacc18
>>>>>> [grsacc20:04578] [[6160,0],0] plm:tm: executing:
>>>>>> orted -mca ess tm -mca orte_ess_jobid 403701760 -mca orte_ess_vpid 2
>>>>>> -mca orte_ess_num_procs 3 -mca orte_hnp_uri
>>>>>> "403701760.0;tcp://192.168.222.20:35163" -mca plm_base_verbose 5 -mca
>>>>>> btl tcp,sm,self
>>>>>> [grsacc20:04578] [[6160,0],0] plm:tm:launch: finished spawning orteds
>>>>>> [grsacc19:28821] mca:base:select:( plm) Querying component [rsh]
>>>>>> [grsacc19:28821] [[6160,0],1] plm:rsh_lookup on agent ssh : rsh path NULL
>>>>>> [grsacc19:28821] mca:base:select:( plm) Query of component [rsh] set
>>>>>> priority to 10
>>>>>> [grsacc19:28821] mca:base:select:( plm) Selected component [rsh]
>>>>>> [grsacc19:28821] [[6160,0],1] plm:rsh_setup on agent ssh : rsh path NULL
>>>>>> [grsacc19:28821] [[6160,0],1] plm:base:receive start comm
>>>>>> [grsacc19:28821] [[6160,0],1] plm:base:receive stop comm
>>>>>> [grsacc18:16717] mca:base:select:( plm) Querying component [rsh]
>>>>>> [grsacc18:16717] [[6160,0],2] plm:rsh_lookup on agent ssh : rsh path NULL
>>>>>> [grsacc18:16717] mca:base:select:( plm) Query of component [rsh] set
>>>>>> priority to 10
>>>>>> [grsacc18:16717] mca:base:select:( plm) Selected component [rsh]
>>>>>> [grsacc18:16717] [[6160,0],2] plm:rsh_setup on agent ssh : rsh path NULL
>>>>>> [grsacc18:16717] [[6160,0],2] plm:base:receive start comm
>>>>>> [grsacc20:04578] [[6160,0],0] plm:base:orted_report_launch from daemon
>>>>>> [[6160,0],2]
>>>>>> [grsacc20:04578] [[6160,0],0] plm:base:orted_report_launch from daemon
>>>>>> [[6160,0],2] on node grsacc18
>>>>>> [grsacc20:04578] [[6160,0],0] plm:base:orted_report_launch completed for
>>>>>> daemon [[6160,0],2] at contact 403701760.2;tcp://192.168.222.18:44229
>>>>>> [grsacc20:04578] [[6160,0],0] plm:base:launch_apps for job [6160,2]
>>>>>> [grsacc20:04578] [[6160,0],0] plm:base:receive processing msg
>>>>>> [grsacc20:04578] [[6160,0],0] plm:base:receive update proc state command
>>>>>> from [[6160,0],2]
>>>>>> [grsacc20:04578] [[6160,0],0] plm:base:receive got update_proc_state for
>>>>>> job [6160,2]
>>>>>> [grsacc20:04578] [[6160,0],0] plm:base:receive got update_proc_state for
>>>>>> vpid 0 state RUNNING exit_code 0
>>>>>> [grsacc20:04578] [[6160,0],0] plm:base:receive done processing commands
>>>>>> [grsacc20:04578] [[6160,0],0] plm:base:launch wiring up iof for job
>>>>>> [6160,2]
>>>>>> [grsacc20:04578] [[6160,0],0] plm:base:receive processing msg
>>>>>> [grsacc20:04578] [[6160,0],0] plm:base:receive done processing commands
>>>>>> [grsacc20:04578] [[6160,0],0] plm:base:launch registered event
>>>>>> [grsacc20:04578] [[6160,0],0] plm:base:launch sending dyn release of job
>>>>>> [6160,2] to [[6160,1],0]
>>>>>> [grsacc20:04578] [[6160,0],0] plm:base:orted_cmd sending orted_exit
>>>>>> commands
>>>>>> [grsacc19:28815] [[6160,0],1] plm:base:receive stop comm
>>>>>> [grsacc20:04578] [[6160,0],0] plm:base:receive stop comm
>>>>>> -bash-4.1$ [grsacc18:16717] [[6160,0],2] plm:base:receive stop comm
>>>>>>
>>>>>> Best,
>>>>>> Suraj
>>>>>> On Sep 24, 2013, at 3:24 PM, Ralph Castain wrote:
>>>>>>
>>>>>>> Your output shows that it launched your apps, but they exited. The
>>>>>>> error is reported here, though it appears we aren't flushing the
>>>>>>> message out before exiting due to a race condition:
>>>>>>>
>>>>>>>> [grsacc20:04511] 1 more process has sent help message
>>>>>>>> help-mpi-btl-openib.txt / no active ports found
>>>>>>>
>>>>>>> Here is the full text:
>>>>>>> [no active ports found]
>>>>>>> WARNING: There is at least non-excluded one OpenFabrics device found,
>>>>>>> but there are no active ports detected (or Open MPI was unable to use
>>>>>>> them). This is most certainly not what you wanted. Check your
>>>>>>> cables, subnet manager configuration, etc. The openib BTL will be
>>>>>>> ignored for this job.
>>>>>>>
>>>>>>> Local host: %s
>>>>>>>
>>>>>>> Looks like at least one node being used doesn't have an active
>>>>>>> Infiniband port on it?
>>>>>>>
>>>>>>>
>>>>>>> On Sep 24, 2013, at 6:11 AM, Suraj Prabhakaran
>>>>>>> <suraj.prabhaka...@gmail.com> wrote:
>>>>>>>
>>>>>>>> Hi Ralph,
>>>>>>>>
>>>>>>>> I tested it with the trunk r29228. I still have the following problem.
>>>>>>>> Now, it even spawns the daemon on the new node through torque but then
>>>>>>>> suddently quits. The following is the output. Can you please have a
>>>>>>>> look?
>>>>>>>>
>>>>>>>> Thanks
>>>>>>>> Suraj
>>>>>>>>
>>>>>>>> [grsacc20:04511] [[6253,0],0] plm:base:receive processing msg
>>>>>>>> [grsacc20:04511] [[6253,0],0] plm:base:receive job launch command from
>>>>>>>> [[6253,1],0]
>>>>>>>> [grsacc20:04511] [[6253,0],0] plm:base:receive adding hosts
>>>>>>>> [grsacc20:04511] [[6253,0],0] plm:base:receive calling spawn
>>>>>>>> [grsacc20:04511] [[6253,0],0] plm:base:receive done processing commands
>>>>>>>> [grsacc20:04511] [[6253,0],0] plm:base:setup_job
>>>>>>>> [grsacc20:04511] [[6253,0],0] plm:base:setup_vm
>>>>>>>> [grsacc20:04511] [[6253,0],0] plm:base:setup_vm add new daemon
>>>>>>>> [[6253,0],2]
>>>>>>>> [grsacc20:04511] [[6253,0],0] plm:base:setup_vm assigning new daemon
>>>>>>>> [[6253,0],2] to node grsacc18
>>>>>>>> [grsacc20:04511] [[6253,0],0] plm:tm: launching vm
>>>>>>>> [grsacc20:04511] [[6253,0],0] plm:tm: final top-level argv:
>>>>>>>> orted -mca ess tm -mca orte_ess_jobid 409796608 -mca
>>>>>>>> orte_ess_vpid <template> -mca orte_ess_num_procs 3 -mca orte_hnp_uri
>>>>>>>> "409796608.0;tcp://192.168.222.20:53097" -mca plm_base_verbose 6
>>>>>>>> [grsacc20:04511] [[6253,0],0] plm:tm: launching on node grsacc19
>>>>>>>> [grsacc20:04511] [[6253,0],0] plm:tm: executing:
>>>>>>>> orted -mca ess tm -mca orte_ess_jobid 409796608 -mca
>>>>>>>> orte_ess_vpid 1 -mca orte_ess_num_procs 3 -mca orte_hnp_uri
>>>>>>>> "409796608.0;tcp://192.168.222.20:53097" -mca plm_base_verbose 6
>>>>>>>> [grsacc20:04511] [[6253,0],0] plm:tm: launching on node grsacc18
>>>>>>>> [grsacc20:04511] [[6253,0],0] plm:tm: executing:
>>>>>>>> orted -mca ess tm -mca orte_ess_jobid 409796608 -mca
>>>>>>>> orte_ess_vpid 2 -mca orte_ess_num_procs 3 -mca orte_hnp_uri
>>>>>>>> "409796608.0;tcp://192.168.222.20:53097" -mca plm_base_verbose 6
>>>>>>>> [grsacc20:04511] [[6253,0],0] plm:tm:launch: finished spawning orteds
>>>>>>>> [grsacc19:28754] mca:base:select:( plm) Querying component [rsh]
>>>>>>>> [grsacc19:28754] [[6253,0],1] plm:rsh_lookup on agent ssh : rsh path
>>>>>>>> NULL
>>>>>>>> [grsacc19:28754] mca:base:select:( plm) Query of component [rsh] set
>>>>>>>> priority to 10
>>>>>>>> [grsacc19:28754] mca:base:select:( plm) Selected component [rsh]
>>>>>>>> [grsacc19:28754] [[6253,0],1] plm:rsh_setup on agent ssh : rsh path
>>>>>>>> NULL
>>>>>>>> [grsacc19:28754] [[6253,0],1] plm:base:receive start comm
>>>>>>>> [grsacc19:28754] [[6253,0],1] plm:base:receive stop comm
>>>>>>>> [grsacc18:16648] mca:base:select:( plm) Querying component [rsh]
>>>>>>>> [grsacc18:16648] [[6253,0],2] plm:rsh_lookup on agent ssh : rsh path
>>>>>>>> NULL
>>>>>>>> [grsacc18:16648] mca:base:select:( plm) Query of component [rsh] set
>>>>>>>> priority to 10
>>>>>>>> [grsacc18:16648] mca:base:select:( plm) Selected component [rsh]
>>>>>>>> [grsacc18:16648] [[6253,0],2] plm:rsh_setup on agent ssh : rsh path
>>>>>>>> NULL
>>>>>>>> [grsacc18:16648] [[6253,0],2] plm:base:receive start comm
>>>>>>>> [grsacc20:04511] [[6253,0],0] plm:base:orted_report_launch from daemon
>>>>>>>> [[6253,0],2]
>>>>>>>> [grsacc20:04511] [[6253,0],0] plm:base:orted_report_launch from daemon
>>>>>>>> [[6253,0],2] on node grsacc18
>>>>>>>> [grsacc20:04511] [[6253,0],0] plm:base:orted_report_launch completed
>>>>>>>> for daemon [[6253,0],2] at contact
>>>>>>>> 409796608.2;tcp://192.168.222.18:47974
>>>>>>>> [grsacc20:04511] [[6253,0],0] plm:base:launch_apps for job [6253,2]
>>>>>>>> [grsacc20:04511] 1 more process has sent help message
>>>>>>>> help-mpi-btl-openib.txt / no active ports found
>>>>>>>> [grsacc20:04511] Set MCA parameter "orte_base_help_aggregate" to 0 to
>>>>>>>> see all help / error messages
>>>>>>>> [grsacc20:04511] 1 more process has sent help message
>>>>>>>> help-mpi-btl-base.txt / btl:no-nics
>>>>>>>> [grsacc20:04511] [[6253,0],0] plm:base:receive processing msg
>>>>>>>> [grsacc20:04511] [[6253,0],0] plm:base:receive update proc state
>>>>>>>> command from [[6253,0],2]
>>>>>>>> [grsacc20:04511] [[6253,0],0] plm:base:receive got update_proc_state
>>>>>>>> for job [6253,2]
>>>>>>>> [grsacc20:04511] [[6253,0],0] plm:base:receive got update_proc_state
>>>>>>>> for vpid 0 state RUNNING exit_code 0
>>>>>>>> [grsacc20:04511] [[6253,0],0] plm:base:receive done processing commands
>>>>>>>> [grsacc20:04511] [[6253,0],0] plm:base:launch wiring up iof for job
>>>>>>>> [6253,2]
>>>>>>>> [grsacc20:04511] [[6253,0],0] plm:base:receive processing msg
>>>>>>>> [grsacc20:04511] [[6253,0],0] plm:base:receive done processing commands
>>>>>>>> [grsacc20:04511] [[6253,0],0] plm:base:launch registered event
>>>>>>>> [grsacc20:04511] [[6253,0],0] plm:base:launch sending dyn release of
>>>>>>>> job [6253,2] to [[6253,1],0]
>>>>>>>> [grsacc20:04511] [[6253,0],0] plm:base:orted_cmd sending orted_exit
>>>>>>>> commands
>>>>>>>> [grsacc19:28747] [[6253,0],1] plm:base:receive stop comm
>>>>>>>> [grsacc20:04511] [[6253,0],0] plm:base:receive stop comm
>>>>>>>> -bash-4.1$ [grsacc18:16648] [[6253,0],2] plm:base:receive stop comm
>>>>>>>>
>>>>>>>>
>>>>>>>>
>>>>>>>>
>>>>>>>> On Sep 23, 2013, at 1:55 AM, Ralph Castain wrote:
>>>>>>>>
>>>>>>>>> Found a bug in the Torque support - we were trying to connect to the
>>>>>>>>> MOM again, which would hang (I imagine). I pushed a fix to the trunk
>>>>>>>>> (r29227) and scheduled it to come to 1.7.3 if you want to try it
>>>>>>>>> again.
>>>>>>>>>
>>>>>>>>>
>>>>>>>>> On Sep 22, 2013, at 4:21 PM, Suraj Prabhakaran
>>>>>>>>> <suraj.prabhaka...@gmail.com> wrote:
>>>>>>>>>
>>>>>>>>>> Dear Ralph,
>>>>>>>>>>
>>>>>>>>>> This is the output I get when I execute with the verbose option.
>>>>>>>>>>
>>>>>>>>>> [grsacc20:21012] [[23526,0],0] plm:base:receive processing msg
>>>>>>>>>> [grsacc20:21012] [[23526,0],0] plm:base:receive job launch command
>>>>>>>>>> from [[23526,1],0]
>>>>>>>>>> [grsacc20:21012] [[23526,0],0] plm:base:receive adding hosts
>>>>>>>>>> [grsacc20:21012] [[23526,0],0] plm:base:receive calling spawn
>>>>>>>>>> [grsacc20:21012] [[23526,0],0] plm:base:receive done processing
>>>>>>>>>> commands
>>>>>>>>>> [grsacc20:21012] [[23526,0],0] plm:base:setup_job
>>>>>>>>>> [grsacc20:21012] [[23526,0],0] plm:base:setup_vm
>>>>>>>>>> [grsacc20:21012] [[23526,0],0] plm:base:setup_vm add new daemon
>>>>>>>>>> [[23526,0],2]
>>>>>>>>>> [grsacc20:21012] [[23526,0],0] plm:base:setup_vm assigning new
>>>>>>>>>> daemon [[23526,0],2] to node grsacc17/1-4
>>>>>>>>>> [grsacc20:21012] [[23526,0],0] plm:base:setup_vm add new daemon
>>>>>>>>>> [[23526,0],3]
>>>>>>>>>> [grsacc20:21012] [[23526,0],0] plm:base:setup_vm assigning new
>>>>>>>>>> daemon [[23526,0],3] to node grsacc17/0-5
>>>>>>>>>> [grsacc20:21012] [[23526,0],0] plm:tm: launching vm
>>>>>>>>>> [grsacc20:21012] [[23526,0],0] plm:tm: final top-level argv:
>>>>>>>>>> orted -mca ess tm -mca orte_ess_jobid 1541799936 -mca
>>>>>>>>>> orte_ess_vpid <template> -mca orte_ess_num_procs 4 -mca orte_hnp_uri
>>>>>>>>>> "1541799936.0;tcp://192.168.222.20:49049" -mca plm_base_verbose 5
>>>>>>>>>> [warn] opal_libevent2021_event_base_loop: reentrant invocation.
>>>>>>>>>> Only one event_base_loop can run on each event_base at once.
>>>>>>>>>> [grsacc20:21012] [[23526,0],0] plm:base:orted_cmd sending orted_exit
>>>>>>>>>> commands
>>>>>>>>>> [grsacc20:21012] [[23526,0],0] plm:base:receive stop comm
>>>>>>>>>>
>>>>>>>>>> Says something?
>>>>>>>>>>
>>>>>>>>>> Best,
>>>>>>>>>> Suraj
>>>>>>>>>>
>>>>>>>>>>
>>>>>>>>>>
>>>>>>>>>>
>>>>>>>>>> On Sep 22, 2013, at 9:45 PM, Ralph Castain wrote:
>>>>>>>>>>
>>>>>>>>>>> I'll still need to look at the intercomm_create issue, but I just
>>>>>>>>>>> tested both the trunk and current 1.7.3 branch for "add-host" and
>>>>>>>>>>> both worked just fine. This was on my little test cluster which
>>>>>>>>>>> only has rsh available - no Torque.
>>>>>>>>>>>
>>>>>>>>>>> You might add "-mca plm_base_verbose 5" to your cmd line to get
>>>>>>>>>>> some debug output as to the problem.
>>>>>>>>>>>
>>>>>>>>>>>
>>>>>>>>>>> On Sep 21, 2013, at 5:48 PM, Ralph Castain <r...@open-mpi.org>
>>>>>>>>>>> wrote:
>>>>>>>>>>>
>>>>>>>>>>>>
>>>>>>>>>>>> On Sep 21, 2013, at 4:54 PM, Suraj Prabhakaran
>>>>>>>>>>>> <suraj.prabhaka...@gmail.com> wrote:
>>>>>>>>>>>>
>>>>>>>>>>>>> Dear all,
>>>>>>>>>>>>>
>>>>>>>>>>>>> Really thanks a lot for your efforts. I too downloaded the trunk
>>>>>>>>>>>>> to check if it works for my case and as of revision 29215, it
>>>>>>>>>>>>> works for the original case I reported. Although it works, I
>>>>>>>>>>>>> still see the following in the output. Does it mean anything?
>>>>>>>>>>>>> [grsacc17][[13611,1],0][btl_openib_proc.c:157:mca_btl_openib_proc_create]
>>>>>>>>>>>>> [btl_openib_proc.c:157] ompi_modex_recv failed for peer
>>>>>>>>>>>>> [[13611,2],0]
>>>>>>>>>>>>
>>>>>>>>>>>> Yes - it means we don't quite have this right yet :-(
>>>>>>>>>>>>
>>>>>>>>>>>>>
>>>>>>>>>>>>> However, on another topic relevant to my use case, I have another
>>>>>>>>>>>>> problem to report. I am having problems using the "add-host" info
>>>>>>>>>>>>> to the MPI_Comm_spawn() when MPI is compiled with support for
>>>>>>>>>>>>> Torque resource manager. This problem is totally new in the 1.7
>>>>>>>>>>>>> series and it worked perfectly until 1.6.5
>>>>>>>>>>>>>
>>>>>>>>>>>>> Basically, I am working on implementing dynamic resource
>>>>>>>>>>>>> management facilities in the Torque/Maui batch system. Through a
>>>>>>>>>>>>> new tm call, an application can get new resources for a job.
>>>>>>>>>>>>
>>>>>>>>>>>> FWIW: you'll find that we added an API to the orte RAS framework
>>>>>>>>>>>> to support precisely that operation. It allows an application to
>>>>>>>>>>>> request that we dynamically obtain additional resources during
>>>>>>>>>>>> execution (e.g., as part of a Comm_spawn call via an info_key). We
>>>>>>>>>>>> originally implemented this with Slurm, but you could add the
>>>>>>>>>>>> calls into the Torque component as well if you like.
>>>>>>>>>>>>
>>>>>>>>>>>> This is in the trunk now - will come over to 1.7.4
>>>>>>>>>>>>
>>>>>>>>>>>>
>>>>>>>>>>>>> I want to use MPI_Comm_spawn() to spawn new processes in the new
>>>>>>>>>>>>> hosts. With my extended torque/maui batch system, I was able to
>>>>>>>>>>>>> perfectly use the "add-host" info argument to MPI_Comm_spawn() to
>>>>>>>>>>>>> spawn new processes on these hosts. Since MPI and Torque refer to
>>>>>>>>>>>>> the hosts through the nodeids, I made sure that OpenMPI uses the
>>>>>>>>>>>>> correct nodeid's for these new hosts.
>>>>>>>>>>>>> Until 1.6.5, this worked perfectly fine, except that due to the
>>>>>>>>>>>>> Intercomm_merge problem, I could not really run a real
>>>>>>>>>>>>> application to its completion.
>>>>>>>>>>>>>
>>>>>>>>>>>>> While this is now fixed in the trunk, I found that, however, when
>>>>>>>>>>>>> using the "add-host" info argument, everything collapses after
>>>>>>>>>>>>> printing out the following error.
>>>>>>>>>>>>>
>>>>>>>>>>>>> [warn] opal_libevent2021_event_base_loop: reentrant invocation.
>>>>>>>>>>>>> Only one event_base_loop can run on each event_base at once.
>>>>>>>>>>>>
>>>>>>>>>>>> I'll take a look - probably some stale code that hasn't been
>>>>>>>>>>>> updated yet for async ORTE operations
>>>>>>>>>>>>
>>>>>>>>>>>>>
>>>>>>>>>>>>> And due to this, I am still not really able to run my
>>>>>>>>>>>>> application! I also compiled the MPI without any Torque/PBS
>>>>>>>>>>>>> support and just used the "add-host" argument normally. Again,
>>>>>>>>>>>>> this worked perfectly in 1.6.5. But in the 1.7 series, it works
>>>>>>>>>>>>> but after printing out the following error.
>>>>>>>>>>>>>
>>>>>>>>>>>>> [grsacc17][[13731,1],0][btl_openib_proc.c:157:mca_btl_openib_proc_create]
>>>>>>>>>>>>> [btl_openib_proc.c:157] ompi_modex_recv failed for peer
>>>>>>>>>>>>> [[13731,2],0]
>>>>>>>>>>>>> [grsacc17][[13731,1],1][btl_openib_proc.c:157:mca_btl_openib_proc_create]
>>>>>>>>>>>>> [btl_openib_proc.c:157] ompi_modex_recv failed for peer
>>>>>>>>>>>>> [[13731,2],0]
>>>>>>>>>>>>
>>>>>>>>>>>> Yeah, the 1.7 series doesn't have the reentrant test in it - so we
>>>>>>>>>>>> "illegally" re-enter libevent. The error again means we don't have
>>>>>>>>>>>> Intercomm_create correct just yet.
>>>>>>>>>>>>
>>>>>>>>>>>> I'll see what I can do about this and get back to you
>>>>>>>>>>>>
>>>>>>>>>>>>>
>>>>>>>>>>>>> In short, with pbs/torque support, it fails and without
>>>>>>>>>>>>> pbs/torque support, it runs after spitting the above lines.
>>>>>>>>>>>>>
>>>>>>>>>>>>> I would really appreciate some help on this, since I need these
>>>>>>>>>>>>> features to actually test my case and (at least in my short
>>>>>>>>>>>>> experience) no other MPI implementation seem friendly to such
>>>>>>>>>>>>> dynamic scenarios.
>>>>>>>>>>>>>
>>>>>>>>>>>>> Thanks a lot!
>>>>>>>>>>>>>
>>>>>>>>>>>>> Best,
>>>>>>>>>>>>> Suraj
>>>>>>>>>>>>>
>>>>>>>>>>>>>
>>>>>>>>>>>>>
>>>>>>>>>>>>> On Sep 20, 2013, at 4:58 PM, Jeff Squyres (jsquyres) wrote:
>>>>>>>>>>>>>
>>>>>>>>>>>>>> Just to close my end of this loop: as of trunk r29213, it all
>>>>>>>>>>>>>> works for me. Thanks!
>>>>>>>>>>>>>>
>>>>>>>>>>>>>>
>>>>>>>>>>>>>> On Sep 18, 2013, at 12:52 PM, Ralph Castain <r...@open-mpi.org>
>>>>>>>>>>>>>> wrote:
>>>>>>>>>>>>>>
>>>>>>>>>>>>>>> Thanks George - much appreciated
>>>>>>>>>>>>>>>
>>>>>>>>>>>>>>> On Sep 18, 2013, at 9:49 AM, George Bosilca
>>>>>>>>>>>>>>> <bosi...@icl.utk.edu> wrote:
>>>>>>>>>>>>>>>
>>>>>>>>>>>>>>>> The test case was broken. I just pushed a fix.
>>>>>>>>>>>>>>>>
>>>>>>>>>>>>>>>> George.
>>>>>>>>>>>>>>>>
>>>>>>>>>>>>>>>> On Sep 18, 2013, at 16:49 , Ralph Castain <r...@open-mpi.org>
>>>>>>>>>>>>>>>> wrote:
>>>>>>>>>>>>>>>>
>>>>>>>>>>>>>>>>> Hangs with any np > 1
>>>>>>>>>>>>>>>>>
>>>>>>>>>>>>>>>>> However, I'm not sure if that's an issue with the test vs the
>>>>>>>>>>>>>>>>> underlying implementation
>>>>>>>>>>>>>>>>>
>>>>>>>>>>>>>>>>> On Sep 18, 2013, at 7:40 AM, "Jeff Squyres (jsquyres)"
>>>>>>>>>>>>>>>>> <jsquy...@cisco.com> wrote:
>>>>>>>>>>>>>>>>>
>>>>>>>>>>>>>>>>>> Does it hang when you run with -np 4?
>>>>>>>>>>>>>>>>>>
>>>>>>>>>>>>>>>>>> Sent from my phone. No type good.
>>>>>>>>>>>>>>>>>>
>>>>>>>>>>>>>>>>>> On Sep 18, 2013, at 4:10 PM, "Ralph Castain"
>>>>>>>>>>>>>>>>>> <r...@open-mpi.org> wrote:
>>>>>>>>>>>>>>>>>>
>>>>>>>>>>>>>>>>>>> Strange - it works fine for me on my Mac. However, I see
>>>>>>>>>>>>>>>>>>> one difference - I only run it with np=1
>>>>>>>>>>>>>>>>>>>
>>>>>>>>>>>>>>>>>>> On Sep 18, 2013, at 2:22 AM, Jeff Squyres (jsquyres)
>>>>>>>>>>>>>>>>>>> <jsquy...@cisco.com> wrote:
>>>>>>>>>>>>>>>>>>>
>>>>>>>>>>>>>>>>>>>> On Sep 18, 2013, at 9:33 AM, George Bosilca
>>>>>>>>>>>>>>>>>>>> <bosi...@icl.utk.edu> wrote:
>>>>>>>>>>>>>>>>>>>>
>>>>>>>>>>>>>>>>>>>>> 1. sm doesn't work between spawned processes. So you must
>>>>>>>>>>>>>>>>>>>>> have another network enabled.
>>>>>>>>>>>>>>>>>>>>
>>>>>>>>>>>>>>>>>>>> I know :-). I have tcp available as well (OMPI will abort
>>>>>>>>>>>>>>>>>>>> if you only run with sm,self because the comm_spawn will
>>>>>>>>>>>>>>>>>>>> fail with unreachable errors -- I just tested/proved this
>>>>>>>>>>>>>>>>>>>> to myself).
>>>>>>>>>>>>>>>>>>>>
>>>>>>>>>>>>>>>>>>>>> 2. Don't use the test case attached to my email, I left
>>>>>>>>>>>>>>>>>>>>> an xterm based spawn and the debugging. It can't work
>>>>>>>>>>>>>>>>>>>>> without xterm support. Instead try using the test case
>>>>>>>>>>>>>>>>>>>>> from the trunk, the one committed by Ralph.
>>>>>>>>>>>>>>>>>>>>
>>>>>>>>>>>>>>>>>>>> I didn't see any "xterm" strings in there, but ok. :-) I
>>>>>>>>>>>>>>>>>>>> ran with orte/test/mpi/intercomm_create.c, and that hangs
>>>>>>>>>>>>>>>>>>>> for me as well:
>>>>>>>>>>>>>>>>>>>>
>>>>>>>>>>>>>>>>>>>> -----
>>>>>>>>>>>>>>>>>>>> ❯❯❯ mpicc intercomm_create.c -o intercomm_create
>>>>>>>>>>>>>>>>>>>> ❯❯❯ mpirun -np 4 intercomm_create
>>>>>>>>>>>>>>>>>>>> b: MPI_Intercomm_create( intra, 0, intra, MPI_COMM_NULL,
>>>>>>>>>>>>>>>>>>>> 201, &inter) [rank 4]
>>>>>>>>>>>>>>>>>>>> b: MPI_Intercomm_create( intra, 0, intra, MPI_COMM_NULL,
>>>>>>>>>>>>>>>>>>>> 201, &inter) [rank 5]
>>>>>>>>>>>>>>>>>>>> b: MPI_Intercomm_create( intra, 0, intra, MPI_COMM_NULL,
>>>>>>>>>>>>>>>>>>>> 201, &inter) [rank 6]
>>>>>>>>>>>>>>>>>>>> b: MPI_Intercomm_create( intra, 0, intra, MPI_COMM_NULL,
>>>>>>>>>>>>>>>>>>>> 201, &inter) [rank 7]
>>>>>>>>>>>>>>>>>>>> c: MPI_Intercomm_create( MPI_COMM_WORLD, 0, intra, 0, 201,
>>>>>>>>>>>>>>>>>>>> &inter) [rank 4]
>>>>>>>>>>>>>>>>>>>> c: MPI_Intercomm_create( MPI_COMM_WORLD, 0, intra, 0, 201,
>>>>>>>>>>>>>>>>>>>> &inter) [rank 5]
>>>>>>>>>>>>>>>>>>>> c: MPI_Intercomm_create( MPI_COMM_WORLD, 0, intra, 0, 201,
>>>>>>>>>>>>>>>>>>>> &inter) [rank 6]
>>>>>>>>>>>>>>>>>>>> c: MPI_Intercomm_create( MPI_COMM_WORLD, 0, intra, 0, 201,
>>>>>>>>>>>>>>>>>>>> &inter) [rank 7]
>>>>>>>>>>>>>>>>>>>> a: MPI_Intercomm_create( ab_intra, 0, ac_intra, 0, 201,
>>>>>>>>>>>>>>>>>>>> &inter) (0)
>>>>>>>>>>>>>>>>>>>> a: MPI_Intercomm_create( ab_intra, 0, ac_intra, 0, 201,
>>>>>>>>>>>>>>>>>>>> &inter) (0)
>>>>>>>>>>>>>>>>>>>> a: MPI_Intercomm_create( ab_intra, 0, ac_intra, 0, 201,
>>>>>>>>>>>>>>>>>>>> &inter) (0)
>>>>>>>>>>>>>>>>>>>> a: MPI_Intercomm_create( ab_intra, 0, ac_intra, 0, 201,
>>>>>>>>>>>>>>>>>>>> &inter) (0)
>>>>>>>>>>>>>>>>>>>> [hang]
>>>>>>>>>>>>>>>>>>>> -----
>>>>>>>>>>>>>>>>>>>>
>>>>>>>>>>>>>>>>>>>> Similarly, on my Mac, it hangs with no output:
>>>>>>>>>>>>>>>>>>>>
>>>>>>>>>>>>>>>>>>>> -----
>>>>>>>>>>>>>>>>>>>> ❯❯❯ mpicc intercomm_create.c -o intercomm_create
>>>>>>>>>>>>>>>>>>>> ❯❯❯ mpirun -np 4 intercomm_create
>>>>>>>>>>>>>>>>>>>> [hang]
>>>>>>>>>>>>>>>>>>>> -----
>>>>>>>>>>>>>>>>>>>>
>>>>>>>>>>>>>>>>>>>>> George.
>>>>>>>>>>>>>>>>>>>>>
>>>>>>>>>>>>>>>>>>>>> On Sep 18, 2013, at 07:53 , "Jeff Squyres (jsquyres)"
>>>>>>>>>>>>>>>>>>>>> <jsquy...@cisco.com> wrote:
>>>>>>>>>>>>>>>>>>>>>
>>>>>>>>>>>>>>>>>>>>>> George --
>>>>>>>>>>>>>>>>>>>>>>
>>>>>>>>>>>>>>>>>>>>>> When I build the SVN trunk (r29201) on 64 bit linux,
>>>>>>>>>>>>>>>>>>>>>> your attached test case hangs:
>>>>>>>>>>>>>>>>>>>>>>
>>>>>>>>>>>>>>>>>>>>>> -----
>>>>>>>>>>>>>>>>>>>>>> ❯❯❯ mpicc intercomm_create.c -o intercomm_create
>>>>>>>>>>>>>>>>>>>>>> ❯❯❯ mpirun -np 4 intercomm_create
>>>>>>>>>>>>>>>>>>>>>> b: MPI_Intercomm_create( intra, 0, intra, MPI_COMM_NULL,
>>>>>>>>>>>>>>>>>>>>>> 201, &inter) [rank 4]
>>>>>>>>>>>>>>>>>>>>>> b: MPI_Intercomm_create( intra, 0, intra, MPI_COMM_NULL,
>>>>>>>>>>>>>>>>>>>>>> 201, &inter) [rank 5]
>>>>>>>>>>>>>>>>>>>>>> b: MPI_Intercomm_create( intra, 0, intra, MPI_COMM_NULL,
>>>>>>>>>>>>>>>>>>>>>> 201, &inter) [rank 6]
>>>>>>>>>>>>>>>>>>>>>> b: MPI_Intercomm_create( intra, 0, intra, MPI_COMM_NULL,
>>>>>>>>>>>>>>>>>>>>>> 201, &inter) [rank 7]
>>>>>>>>>>>>>>>>>>>>>> a: MPI_Intercomm_create( ab_intra, 0, ac_intra, 0, 201,
>>>>>>>>>>>>>>>>>>>>>> &inter) (0)
>>>>>>>>>>>>>>>>>>>>>> a: MPI_Intercomm_create( ab_intra, 0, ac_intra, 0, 201,
>>>>>>>>>>>>>>>>>>>>>> &inter) (0)
>>>>>>>>>>>>>>>>>>>>>> a: MPI_Intercomm_create( ab_intra, 0, ac_intra, 0, 201,
>>>>>>>>>>>>>>>>>>>>>> &inter) (0)
>>>>>>>>>>>>>>>>>>>>>> a: MPI_Intercomm_create( ab_intra, 0, ac_intra, 0, 201,
>>>>>>>>>>>>>>>>>>>>>> &inter) (0)
>>>>>>>>>>>>>>>>>>>>>> c: MPI_Intercomm_create( MPI_COMM_WORLD, 0, intra, 0,
>>>>>>>>>>>>>>>>>>>>>> 201, &inter) [rank 4]
>>>>>>>>>>>>>>>>>>>>>> c: MPI_Intercomm_create( MPI_COMM_WORLD, 0, intra, 0,
>>>>>>>>>>>>>>>>>>>>>> 201, &inter) [rank 5]
>>>>>>>>>>>>>>>>>>>>>> c: MPI_Intercomm_create( MPI_COMM_WORLD, 0, intra, 0,
>>>>>>>>>>>>>>>>>>>>>> 201, &inter) [rank 6]
>>>>>>>>>>>>>>>>>>>>>> c: MPI_Intercomm_create( MPI_COMM_WORLD, 0, intra, 0,
>>>>>>>>>>>>>>>>>>>>>> 201, &inter) [rank 7]
>>>>>>>>>>>>>>>>>>>>>> [hang]
>>>>>>>>>>>>>>>>>>>>>> -----
>>>>>>>>>>>>>>>>>>>>>>
>>>>>>>>>>>>>>>>>>>>>> On my Mac, it hangs without printing anything:
>>>>>>>>>>>>>>>>>>>>>>
>>>>>>>>>>>>>>>>>>>>>> -----
>>>>>>>>>>>>>>>>>>>>>> ❯❯❯ mpicc intercomm_create.c -o intercomm_create
>>>>>>>>>>>>>>>>>>>>>> ❯❯❯ mpirun -np 4 intercomm_create
>>>>>>>>>>>>>>>>>>>>>> [hang]
>>>>>>>>>>>>>>>>>>>>>> -----
>>>>>>>>>>>>>>>>>>>>>>
>>>>>>>>>>>>>>>>>>>>>>
>>>>>>>>>>>>>>>>>>>>>> On Sep 18, 2013, at 1:48 AM, George Bosilca
>>>>>>>>>>>>>>>>>>>>>> <bosi...@icl.utk.edu> wrote:
>>>>>>>>>>>>>>>>>>>>>>
>>>>>>>>>>>>>>>>>>>>>>> Here is a quick (and definitively not the cleanest)
>>>>>>>>>>>>>>>>>>>>>>> patch that addresses the MPI_Intercomm issue at the MPI
>>>>>>>>>>>>>>>>>>>>>>> level. It should be applied after removal of 29166.
>>>>>>>>>>>>>>>>>>>>>>>
>>>>>>>>>>>>>>>>>>>>>>> I also added the corrected test case stressing the
>>>>>>>>>>>>>>>>>>>>>>> corner cases by doing barriers at every inter-comm
>>>>>>>>>>>>>>>>>>>>>>> creation and doing a clean disconnect.
>>>>>>>>>>>>>>>>>>>>>>
>>>>>>>>>>>>>>>>>>>>>>
>>>>>>>>>>>>>>>>>>>>>> --
>>>>>>>>>>>>>>>>>>>>>> Jeff Squyres
>>>>>>>>>>>>>>>>>>>>>> jsquy...@cisco.com
>>>>>>>>>>>>>>>>>>>>>> For corporate legal information go to:
>>>>>>>>>>>>>>>>>>>>>> http://www.cisco.com/web/about/doing_business/legal/cri/
>>>>>>>>>>>>>>>>>>>>>>
>>>>>>>>>>>>>>>>>>>>>> _______________________________________________
>>>>>>>>>>>>>>>>>>>>>> devel mailing list
>>>>>>>>>>>>>>>>>>>>>> de...@open-mpi.org
>>>>>>>>>>>>>>>>>>>>>> http://www.open-mpi.org/mailman/listinfo.cgi/devel
>>>>>>>>>>>>>>>>>>>>>
>>>>>>>>>>>>>>>>>>>>> _______________________________________________
>>>>>>>>>>>>>>>>>>>>> devel mailing list
>>>>>>>>>>>>>>>>>>>>> de...@open-mpi.org
>>>>>>>>>>>>>>>>>>>>> http://www.open-mpi.org/mailman/listinfo.cgi/devel
>>>>>>>>>>>>>>>>>>>>
>>>>>>>>>>>>>>>>>>>>
>>>>>>>>>>>>>>>>>>>> --
>>>>>>>>>>>>>>>>>>>> Jeff Squyres
>>>>>>>>>>>>>>>>>>>> jsquy...@cisco.com
>>>>>>>>>>>>>>>>>>>> For corporate legal information go to:
>>>>>>>>>>>>>>>>>>>> http://www.cisco.com/web/about/doing_business/legal/cri/
>>>>>>>>>>>>>>>>>>>>
>>>>>>>>>>>>>>>>>>>> _______________________________________________
>>>>>>>>>>>>>>>>>>>> devel mailing list
>>>>>>>>>>>>>>>>>>>> de...@open-mpi.org
>>>>>>>>>>>>>>>>>>>> http://www.open-mpi.org/mailman/listinfo.cgi/devel
>>>>>>>>>>>>>>>>>>>
>>>>>>>>>>>>>>>>>>> _______________________________________________
>>>>>>>>>>>>>>>>>>> devel mailing list
>>>>>>>>>>>>>>>>>>> de...@open-mpi.org
>>>>>>>>>>>>>>>>>>> http://www.open-mpi.org/mailman/listinfo.cgi/devel
>>>>>>>>>>>>>>>>>> _______________________________________________
>>>>>>>>>>>>>>>>>> devel mailing list
>>>>>>>>>>>>>>>>>> de...@open-mpi.org
>>>>>>>>>>>>>>>>>> http://www.open-mpi.org/mailman/listinfo.cgi/devel
>>>>>>>>>>>>>>>>>
>>>>>>>>>>>>>>>>> _______________________________________________
>>>>>>>>>>>>>>>>> devel mailing list
>>>>>>>>>>>>>>>>> de...@open-mpi.org
>>>>>>>>>>>>>>>>> http://www.open-mpi.org/mailman/listinfo.cgi/devel
>>>>>>>>>>>>>>>>
>>>>>>>>>>>>>>>> _______________________________________________
>>>>>>>>>>>>>>>> devel mailing list
>>>>>>>>>>>>>>>> de...@open-mpi.org
>>>>>>>>>>>>>>>> http://www.open-mpi.org/mailman/listinfo.cgi/devel
>>>>>>>>>>>>>>>
>>>>>>>>>>>>>>> _______________________________________________
>>>>>>>>>>>>>>> devel mailing list
>>>>>>>>>>>>>>> de...@open-mpi.org
>>>>>>>>>>>>>>> http://www.open-mpi.org/mailman/listinfo.cgi/devel
>>>>>>>>>>>>>>
>>>>>>>>>>>>>>
>>>>>>>>>>>>>> --
>>>>>>>>>>>>>> Jeff Squyres
>>>>>>>>>>>>>> jsquy...@cisco.com
>>>>>>>>>>>>>> For corporate legal information go to:
>>>>>>>>>>>>>> http://www.cisco.com/web/about/doing_business/legal/cri/
>>>>>>>>>>>>>>
>>>>>>>>>>>>>> _______________________________________________
>>>>>>>>>>>>>> devel mailing list
>>>>>>>>>>>>>> de...@open-mpi.org
>>>>>>>>>>>>>> http://www.open-mpi.org/mailman/listinfo.cgi/devel
>>>>>>>>>>>>>
>>>>>>>>>>>>> _______________________________________________
>>>>>>>>>>>>> devel mailing list
>>>>>>>>>>>>> de...@open-mpi.org
>>>>>>>>>>>>> http://www.open-mpi.org/mailman/listinfo.cgi/devel
>>>>>>>>>>>>
>>>>>>>>>>>
>>>>>>>>>>> _______________________________________________
>>>>>>>>>>> devel mailing list
>>>>>>>>>>> de...@open-mpi.org
>>>>>>>>>>> http://www.open-mpi.org/mailman/listinfo.cgi/devel
>>>>>>>>>>
>>>>>>>>>> _______________________________________________
>>>>>>>>>> devel mailing list
>>>>>>>>>> de...@open-mpi.org
>>>>>>>>>> http://www.open-mpi.org/mailman/listinfo.cgi/devel
>>>>>>>>>
>>>>>>>>> _______________________________________________
>>>>>>>>> devel mailing list
>>>>>>>>> de...@open-mpi.org
>>>>>>>>> http://www.open-mpi.org/mailman/listinfo.cgi/devel
>>>>>>>>
>>>>>>>> _______________________________________________
>>>>>>>> devel mailing list
>>>>>>>> de...@open-mpi.org
>>>>>>>> http://www.open-mpi.org/mailman/listinfo.cgi/devel
>>>>>>>
>>>>>>> _______________________________________________
>>>>>>> devel mailing list
>>>>>>> de...@open-mpi.org
>>>>>>> http://www.open-mpi.org/mailman/listinfo.cgi/devel
>>>>>>
>>>>>> _______________________________________________
>>>>>> devel mailing list
>>>>>> de...@open-mpi.org
>>>>>> http://www.open-mpi.org/mailman/listinfo.cgi/devel
>>>>>
>>>>> _______________________________________________
>>>>> devel mailing list
>>>>> de...@open-mpi.org
>>>>> http://www.open-mpi.org/mailman/listinfo.cgi/devel
>>>>
>>>> _______________________________________________
>>>> devel mailing list
>>>> de...@open-mpi.org
>>>> http://www.open-mpi.org/mailman/listinfo.cgi/devel
>>>
>>> _______________________________________________
>>> devel mailing list
>>> de...@open-mpi.org
>>> http://www.open-mpi.org/mailman/listinfo.cgi/devel
>>
>> _______________________________________________
>> devel mailing list
>> de...@open-mpi.org
>> http://www.open-mpi.org/mailman/listinfo.cgi/devel
>
> _______________________________________________
> devel mailing list
> de...@open-mpi.org
> http://www.open-mpi.org/mailman/listinfo.cgi/devel