On Mon, 2006-11-27 at 21:11 -0500, George Bosilca wrote: > Which version of Open MPI are you using ? We can figure out what's > wrong if we have the output of "ompi_info" and "ompi_info --param all > all".
Forgot the "ompi_info --param all all". It's attached. - Matt > > I wonder if some of the memory is not related to the size of the > shared memory file. The default way to compute the size of the shared > memory file is defined by the MCA parameter mpool_sm_per_peer_size. > By default is set to 128MB for each local peer. Therefore using 2048 > procs on 256 nodes lead to using 8 procs by node i.e. at least 1GB > only for the SM file. The problem right now with the SM file is that > we're not reusing the buffers multiple times, instead we're using a > new fragment each time we send a message, forcing the OS to map the > entire file at one point. > > george. > > On Nov 27, 2006, at 8:21 PM, Matt Leininger wrote: > > > On Mon, 2006-11-27 at 16:45 -0800, Matt Leininger wrote: > >> Has anyone testing OMPI's alltoall at > 2000 MPI tasks? I'm > >> seeing each > >> MPI task eat up > 1GB of memory (just for OMPI - not the app). > > > > I gathered some more data using the alltoall benchmark in mpiBench. > > mpiBench is pretty smart about how large its buffers are. I set it to > > use <= 100MB. > > > > num nodes num MPI tasks system mem mpibench buffer mem > > 128 1024 1 GB 65 MB > > 160 1280 1.2 GB 82 MB > > 192 1536 1.4 GB 98 MB > > 224 1792 1.6 GB 57 MB > > 256 2048 1.6-1.8 GB < 100 MB > > > > The 256 node run was killed by the OOM for using too much memory. For > > all these tests the OMPI alltoall is using 1 GB or more of system > > memory. I know LANL is looking into optimized alltoall, but is anyone > > looking into the scalability of the memory footprint? > > > > Thanks, > > > > - Matt > > > >> > >> Thanks, > >> > >> - Matt > >> > >> > >> > >> _______________________________________________ > >> devel mailing list > >> de...@open-mpi.org > >> http://www.open-mpi.org/mailman/listinfo.cgi/devel > >> > > > > > > _______________________________________________ > > devel mailing list > > de...@open-mpi.org > > http://www.open-mpi.org/mailman/listinfo.cgi/devel > > _______________________________________________ > devel mailing list > de...@open-mpi.org > http://www.open-mpi.org/mailman/listinfo.cgi/devel >
MCA mca: parameter "mca_param_files" (current value: "/g/g12/mlleinin/.openmpi/mca-params.conf:/g/g12/mlleinin/src/ompi-v1.2b-112506-gcc/etc/openmpi-mca-params.conf") Path for MCA configuration files containing default parameter values MCA mca: parameter "mca_component_path" (current value: "/g/g12/mlleinin/src/ompi-v1.2b-112506-gcc/lib/openmpi:/g/g12/mlleinin/.openmpi/components") Path where to look for Open MPI and ORTE components MCA mca: parameter "mca_verbose" (current value: <none>) Top-level verbosity parameter MCA mca: parameter "mca_component_show_load_errors" (current value: "1") Whether to show errors for components that failed to load or not MCA mca: parameter "mca_component_disable_dlopen" (current value: "0") Whether to attempt to disable opening dynamic components or not MCA mpi: parameter "mpi_param_check" (current value: "1") Whether you want MPI API parameters checked at run-time or not. Possible values are 0 (no checking) and 1 (perform checking at run-time) MCA mpi: parameter "mpi_yield_when_idle" (current value: "0") Yield the processor when waiting for MPI communication (for MPI processes, will default to 1 when oversubscribing nodes) MCA mpi: parameter "mpi_event_tick_rate" (current value: "-1") How often to progress TCP communications (0 = never, otherwise specified in microseconds) MCA mpi: parameter "mpi_show_handle_leaks" (current value: "0") Whether MPI_FINALIZE shows all MPI handles that were not freed or not MCA mpi: parameter "mpi_no_free_handles" (current value: "0") Whether to actually free MPI objects when their handles are freed MCA mpi: parameter "mpi_show_mca_params" (current value: "0") Whether to show all MCA parameter value during MPI_INIT or not (good for reproducability of MPI jobs) MCA mpi: parameter "mpi_show_mca_params_file" (current value: <none>) If mpi_show_mca_params is true, setting this string to a valid filename tells Open MPI to dump all the MCA parameter values into a file suitable for reading via the mca_param_files parameter (good for reproducability of MPI jobs) MCA mpi: parameter "mpi_paffinity_alone" (current value: "0") If nonzero, assume that this job is the only (set of) process(es) running on each node and bind processes to processors, starting with processor ID 0 MCA mpi: parameter "mpi_keep_peer_hostnames" (current value: "1") If nonzero, save the string hostnames of all MPI peer processes (mostly for error / debugging output messages). This can add quite a bit of memory usage to each MPI process. MCA mpi: parameter "mpi_abort_delay" (current value: "0") If nonzero, print out an identifying message when MPI_ABORT is invoked (hostname, PID of the process that called MPI_ABORT) and delay for that many seconds before exiting (a negative delay value means to never abort). This allows attaching of a debugger before quitting the job. MCA mpi: parameter "mpi_abort_print_stack" (current value: "0") If nonzero, print out a stack trace when MPI_ABORT is invoked MCA mpi: parameter "mpi_preconnect_all" (current value: "0") Whether to force MPI processes to create connections / warmup with *all* peers during MPI_INIT (vs. making connections lazily -- upon the first MPI traffic between each process peer pair) MCA mpi: parameter "mpi_ddt_unpack_debug" (current value: "0") Whether to output debugging information in the ddt unpack functions (nonzero = enabled) MCA mpi: parameter "mpi_ddt_pack_debug" (current value: "0") Whether to output debugging information in the ddt pack functions (nonzero = enabled) MCA mpi: parameter "mpi_ddt_position_debug" (current value: "0") Non zero lead to output generated by the datatype position functions MCA mpi: parameter "mpi_ddt_copy_debug" (current value: "0") Whether to output debugging information in the ddt copy functions (nonzero = enabled) MCA mpi: parameter "mpi_leave_pinned" (current value: "0") leave_pinned MCA mpi: parameter "mpi_leave_pinned_pipeline" (current value: "0") leave_pinned_pipeline MCA orte: parameter "orte_base_user_debugger" (current value: "totalview @mpirun@ -a @mpirun_args@ : fxp @mpirun@ -a @mpirun_args@") Sequence of user-level debuggers to search for in orterun MCA orte: parameter "orte_debug" (current value: "0") Whether or not to enable debugging output for all ORTE components (0 or 1) MCA orte: parameter "orte_debug_daemons" (current value: "0") Whether or not to enable debugging of daemons (0 or 1) MCA orte: parameter "orte_timing" (current value: "0") Request that critical timing loops be measured MCA opal: parameter "opal_signal" (current value: "6,7,8,11") If a signal is received, display the stack trace frame MCA backtrace: parameter "backtrace" (current value: <none>) Default selection set of components for the backtrace framework (<none> means "use all components that can be found") MCA backtrace: parameter "backtrace_base_verbose" (current value: "0") Verbosity level for the backtrace framework (0 = no verbosity) MCA backtrace: parameter "backtrace_execinfo_priority" (current value: "0") MCA memory: parameter "memory" (current value: <none>) Default selection set of components for the memory framework (<none> means "use all components that can be found") MCA memory: parameter "memory_base_verbose" (current value: "0") Verbosity level for the memory framework (0 = no verbosity) MCA memory: parameter "memory_ptmalloc2_priority" (current value: "0") MCA paffinity: parameter "paffinity" (current value: <none>) Default selection set of components for the paffinity framework (<none> means "use all components that can be found") MCA paffinity: parameter "paffinity_linux_priority" (current value: "10") Priority of the linux paffinity component MCA paffinity: information "paffinity_linux_have_cpu_set_t" (value: "1") Whether this component was compiled on a system with the type cpu_set_t or not (1 = yes, 0 = no) MCA paffinity: information "paffinity_linux_CPU_ZERO_ok" (value: "1") Whether this component was compiled on a system where CPU_ZERO() is functional or broken (1 = functional, 0 = broken/not available) MCA paffinity: information "paffinity_linux_sched_setaffinity_num_params" (value: "3") The number of parameters that sched_set_affinity() takes on the machine where this component was compiled MCA maffinity: parameter "maffinity" (current value: <none>) Default selection set of components for the maffinity framework (<none> means "use all components that can be found") MCA maffinity: parameter "maffinity_first_use_priority" (current value: "10") Priority of the first_use maffinity component MCA timer: parameter "timer" (current value: <none>) Default selection set of components for the timer framework (<none> means "use all components that can be found") MCA timer: parameter "timer_base_verbose" (current value: "0") Verbosity level for the timer framework (0 = no verbosity) MCA timer: parameter "timer_linux_priority" (current value: "0") MCA allocator: parameter "allocator" (current value: <none>) Default selection set of components for the allocator framework (<none> means "use all components that can be found") MCA allocator: parameter "allocator_base_verbose" (current value: "0") Verbosity level for the allocator framework (0 = no verbosity) MCA allocator: parameter "allocator_basic_priority" (current value: "0") MCA allocator: parameter "allocator_bucket_num_buckets" (current value: "30") MCA allocator: parameter "allocator_bucket_priority" (current value: "0") MCA coll: parameter "coll" (current value: <none>) Default selection set of components for the coll framework (<none> means "use all components that can be found") MCA coll: parameter "coll_base_verbose" (current value: "0") Verbosity level for the coll framework (0 = no verbosity) MCA coll: parameter "coll_basic_priority" (current value: "10") Priority of the basic coll component MCA coll: parameter "coll_basic_crossover" (current value: "4") Minimum number of processes in a communicator before using the logarithmic algorithms MCA coll: parameter "coll_self_priority" (current value: "75") MCA coll: parameter "coll_sm_priority" (current value: "0") Priority of the sm coll component MCA coll: parameter "coll_sm_control_size" (current value: "4096") Length of the control data -- should usually be either the length of a cache line on most SMPs, or the size of a page on machines that support direct memory affinity page placement (in bytes) MCA coll: parameter "coll_sm_bootstrap_filename" (current value: "shared_mem_sm_bootstrap") Filename (in the Open MPI session directory) of the coll sm component bootstrap rendezvous mmap file MCA coll: parameter "coll_sm_bootstrap_num_segments" (current value: "8") Number of segments in the bootstrap file MCA coll: parameter "coll_sm_fragment_size" (current value: "8192") Fragment size (in bytes) used for passing data through shared memory (will be rounded up to the nearest control_size size) MCA coll: parameter "coll_sm_mpool" (current value: "sm") Name of the mpool component to use MCA coll: parameter "coll_sm_comm_in_use_flags" (current value: "2") Number of "in use" flags, used to mark a message passing area segment as currently being used or not (must be >= 2 and <= comm_num_segments) MCA coll: parameter "coll_sm_comm_num_segments" (current value: "8") Number of segments in each communicator's shared memory message passing area (must be >= 2, and must be a multiple of comm_in_use_flags) MCA coll: parameter "coll_sm_tree_degree" (current value: "4") Degree of the tree for tree-based operations (must be => 1 and <= min(control_size, 255)) MCA coll: information "coll_sm_shared_mem_used_bootstrap" (value: "216") Amount of shared memory used in the shared memory bootstrap area (in bytes) MCA coll: parameter "coll_sm_info_num_procs" (current value: "4") Number of processes to use for the calculation of the shared_mem_size MCA information parameter (must be => 2) MCA coll: information "coll_sm_shared_mem_used_data" (value: "548864") Amount of shared memory used in the shared memory data area for info_num_procs processes (in bytes) MCA coll: parameter "coll_tuned_priority" (current value: "30") Priority of the tuned coll component MCA coll: parameter "coll_tuned_pre_allocate_memory_comm_size_limit" (current value: "32768") Size of communicator were we stop pre-allocating memory for the fixed internal buffer used for message requests etc that is hung off the communicator data segment. I.e. if you have a 100'000 nodes you might not want to pre-allocate 200'000 request handle slots per communicator instance! MCA coll: parameter "coll_tuned_use_dynamic_rules" (current value: "0") Switch used to decide if we use static (compiled/if statements) or dynamic (built at runtime) decision function rules MCA coll: parameter "coll_tuned_init_tree_fanout" (current value: "4") Inital fanout used in the tree topologies for each communicator. This is only an initial guess, if a tuned collective needs a different fanout for an operation, it build it dynamically. This parameter is only for the first guess and might save a little time MCA coll: parameter "coll_tuned_init_chain_fanout" (current value: "4") Inital fanout used in the chain (fanout followed by pipeline) topologies for each communicator. This is only an initial guess, if a tuned collective needs a different fanout for an operation, it build it dynamically. This parameter is only for the first guess and might save a little time MCA io: parameter "io_base_freelist_initial_size" (current value: "16") Initial MPI-2 IO request freelist size MCA io: parameter "io_base_freelist_max_size" (current value: "64") Max size of the MPI-2 IO request freelist MCA io: parameter "io_base_freelist_increment" (current value: "16") Increment size of the MPI-2 IO request freelist MCA io: parameter "io" (current value: <none>) Default selection set of components for the io framework (<none> means "use all components that can be found") MCA io: parameter "io_base_verbose" (current value: "0") Verbosity level for the io framework (0 = no verbosity) MCA io: parameter "io_romio_priority" (current value: "10") Priority of the io romio component MCA io: parameter "io_romio_delete_priority" (current value: "10") Delete priority of the io romio component MCA io: parameter "io_romio_enable_parallel_optimizations" (current value: "0") Enable set of Open MPI-added options to improve collective file i/o performance MCA mpool: parameter "mpool" (current value: <none>) Default selection set of components for the mpool framework (<none> means "use all components that can be found") MCA mpool: parameter "mpool_base_verbose" (current value: "0") Verbosity level for the mpool framework (0 = no verbosity) MCA mpool: parameter "mpool_openib_rcache_name" (current value: "rb") The name of the registration cache the mpool should use MCA mpool: parameter "mpool_openib_priority" (current value: "0") MCA mpool: parameter "mpool_sm_allocator" (current value: "bucket") Name of allocator component to use with sm mpool MCA mpool: parameter "mpool_sm_max_size" (current value: "536870912") Maximum size of the sm mpool shared memory file MCA mpool: parameter "mpool_sm_min_size" (current value: "134217728") Minimum size of the sm mpool shared memory file MCA mpool: parameter "mpool_sm_per_peer_size" (current value: "33554432") Size (in bytes) to allocate per local peer in the sm mpool shared memory file, bounded by min_size and max_size MCA mpool: parameter "mpool_sm_priority" (current value: "0") MCA mpool: parameter "mpool_udapl_priority" (current value: "0") MCA mpool: parameter "mpool_base_use_mem_hooks" (current value: "0") use memory hooks for deregistering freed memory MCA mpool: parameter "mpool_use_mem_hooks" (current value: "0") (deprecated, use mpool_base_use_mem_hooks) MCA pml: parameter "pml" (current value: "ob1") Default selection set of components for the pml framework (<none> means "use all components that can be found") MCA pml: parameter "pml_base_verbose" (current value: "0") Verbosity level for the pml framework (0 = no verbosity) MCA pml: parameter "pml_cm_free_list_num" (current value: "4") Initial size of request free lists MCA pml: parameter "pml_cm_free_list_max" (current value: "-1") Maximum size of request free lists MCA pml: parameter "pml_cm_free_list_inc" (current value: "64") Number of elements to add when growing request free lists MCA pml: parameter "pml_cm_priority" (current value: "1") CM PML selection priority MCA pml: parameter "pml_dr_free_list_num" (current value: "4") MCA pml: parameter "pml_dr_free_list_max" (current value: "-1") MCA pml: parameter "pml_dr_free_list_inc" (current value: "64") MCA pml: parameter "pml_dr_priority" (current value: "1") MCA pml: parameter "pml_dr_eager_limit" (current value: "131072") MCA pml: parameter "pml_dr_send_pipeline_depth" (current value: "3") MCA pml: parameter "pml_dr_wdog_timer_sec" (current value: "5") MCA pml: parameter "pml_dr_wdog_timer_usec" (current value: "0") MCA pml: parameter "pml_dr_wdog_timer_multiplier" (current value: "1") MCA pml: parameter "pml_dr_wdog_retry_max" (current value: "1") MCA pml: parameter "pml_dr_ack_timer_sec" (current value: "10") MCA pml: parameter "pml_dr_ack_timer_usec" (current value: "0") MCA pml: parameter "pml_dr_ack_timer_multiplier" (current value: "1") MCA pml: parameter "pml_dr_ack_retry_max" (current value: "3") MCA pml: parameter "pml_dr_enable_csum" (current value: "1") MCA pml: parameter "pml_ob1_free_list_num" (current value: "4") MCA pml: parameter "pml_ob1_free_list_max" (current value: "-1") MCA pml: parameter "pml_ob1_free_list_inc" (current value: "64") MCA pml: parameter "pml_ob1_priority" (current value: "1") MCA pml: parameter "pml_ob1_eager_limit" (current value: "131072") MCA pml: parameter "pml_ob1_send_pipeline_depth" (current value: "3") MCA pml: parameter "pml_ob1_recv_pipeline_depth" (current value: "4") MCA bml: parameter "bml" (current value: <none>) Default selection set of components for the bml framework (<none> means "use all components that can be found") MCA bml: parameter "bml_base_verbose" (current value: "0") Verbosity level for the bml framework (0 = no verbosity) MCA bml: parameter "bml_r2_show_unreach_errors" (current value: "1") Show error message when procs are unreachable MCA bml: parameter "bml_r2_priority" (current value: "0") MCA rcache: parameter "rcache" (current value: <none>) Default selection set of components for the rcache framework (<none> means "use all components that can be found") MCA rcache: parameter "rcache_base_verbose" (current value: "0") Verbosity level for the rcache framework (0 = no verbosity) MCA rcache: parameter "rcache_rb_priority" (current value: "0") MCA rcache: parameter "rcache_vma_mru_len" (current value: "256") The maximum size IN ENTRIES of the MRU (most recently used) rcache list MCA rcache: parameter "rcache_vma_mru_size" (current value: "1073741824") The maximum size IN BYTES of the MRU (most recently used) rcache list MCA rcache: parameter "rcache_vma_priority" (current value: "0") MCA btl: parameter "btl_base_debug" (current value: "0") If btl_base_debug is 1 standard debug is output, if > 1 verbose debug is output MCA btl: parameter "btl" (current value: <none>) Default selection set of components for the btl framework (<none> means "use all components that can be found") MCA btl: parameter "btl_base_verbose" (current value: "0") Verbosity level for the btl framework (0 = no verbosity) MCA btl: parameter "btl_openib_verbose" (current value: "0") Output some verbose OpenIB BTL information (0 = no output, nonzero = output) MCA btl: parameter "btl_openib_warn_no_hca_params_found" (current value: "1") Warn when no HCA-specific parameters are found in the INI file specified by the btl_openib_hca_param_files MCA parameter (0 = do not warn; any other value = warn) MCA btl: parameter "btl_openib_warn_default_gid_prefix" (current value: "1") Warn when there is more than one active ports and at least one of them connected to the network with only default GID prefix configured (0 = do not warn; any other value = warn) MCA btl: parameter "btl_openib_hca_param_files" (current value: "/g/g12/mlleinin/src/ompi-v1.2b-112506-gcc/share/openmpi/mca-btl-openib-hca-params.ini") Colon-delimited list of INI-style files that contain HCA vendor/part-specific parameters MCA btl: parameter "btl_openib_max_btls" (current value: "-1") Maximum number of HCA ports to use (-1 = use all available, otherwise must be >= 1) MCA btl: parameter "btl_openib_free_list_num" (current value: "8") Intial size of free lists (must be >= 1) MCA btl: parameter "btl_openib_free_list_max" (current value: "-1") Maximum size of free lists (-1 = infinite, otherwise must be >= 0) MCA btl: parameter "btl_openib_free_list_inc" (current value: "32") Increment size of free lists (must be >= 1) MCA btl: parameter "btl_openib_mpool" (current value: "openib") Name of the memory pool to be used (it is unlikely that you will ever want to change this MCA btl: parameter "btl_openib_reg_mru_len" (current value: "16") Length of the registration cache most recently used list (must be >= 1) MCA btl: parameter "btl_openib_ib_cq_size" (current value: "1000") Size of the IB completion queue (will automatically be set to a minimum of (2 * number_of_peers * btl_openib_rd_num)) MCA btl: parameter "btl_openib_ib_sg_list_size" (current value: "4") Size of IB segment list (must be >= 1) MCA btl: parameter "btl_openib_ib_pkey_ix" (current value: "0") InfiniBand pkey index (must be >= 0) MCA btl: parameter "btl_openib_ib_psn" (current value: "0") InfiniBand packet sequence starting number (must be >= 0) MCA btl: parameter "btl_openib_ib_qp_ous_rd_atom" (current value: "4") InfiniBand outstanding atomic reads (must be >= 0) MCA btl: parameter "btl_openib_ib_mtu" (current value: "3") IB MTU, in bytes (if not specified in INI files). Valid values are: 1=256 bytes, 2=512 bytes, 3=1024 bytes, 4=2048 bytes, 5=4096 bytes MCA btl: parameter "btl_openib_ib_min_rnr_timer" (current value: "5") InfiniBand minimum "receiver not ready" timer, in seconds (must be >= 1) MCA btl: parameter "btl_openib_ib_timeout" (current value: "10") InfiniBand transmit timeout, in seconds(must be >= 1) MCA btl: parameter "btl_openib_ib_retry_count" (current value: "7") InfiniBand transmit retry count (must be >= 1) MCA btl: parameter "btl_openib_ib_rnr_retry" (current value: "7") InfiniBand "receiver not ready" retry count (must be >= 1) MCA btl: parameter "btl_openib_ib_max_rdma_dst_ops" (current value: "4") InfiniBand maximum pending RDMA destination operations (must be >= 1) MCA btl: parameter "btl_openib_ib_service_level" (current value: "0") InfiniBand service level (must be >= 0) MCA btl: parameter "btl_openib_ib_static_rate" (current value: "0") InfiniBand static rate (must be >= 0; defulat: %d) MCA btl: parameter "btl_openib_exclusivity" (current value: "1024") OpenIB BTL exclusivity (must be >= 0) MCA btl: parameter "btl_openib_rd_num" (current value: "8") Number of receive descriptors to post to a queue pair (must be >= 1) MCA btl: parameter "btl_openib_rd_low" (current value: "6") Low water mark before reposting occurs (must be >= 1) MCA btl: parameter "btl_openib_rd_win" (current value: "4") Window size at which generate explicit credit message (must be >= 1) MCA btl: parameter "btl_openib_use_srq" (current value: "0") If nonzero, use the InfiniBand shared receive queue ("SRQ") MCA btl: parameter "btl_openib_srq_rd_max" (current value: "1000") Maxium number of receive descriptors posted per SRQ (only relevant if btl_openib_use_srq is true; must be >= 1) MCA btl: parameter "btl_openib_srq_rd_per_peer" (current value: "16") Number of receive descriptors posted per peer in the SRQ (only relevant if btl_openib_use_srq is true; must be >= 1) MCA btl: parameter "btl_openib_srq_sd_max" (current value: "8") Maximum number of send descriptors posted (only relevant if btl_openib_use_srq is true; must be >= 1) MCA btl: parameter "btl_openib_use_eager_rdma" (current value: "1") Use RDMA for eager messages MCA btl: parameter "btl_openib_eager_rdma_threshold" (current value: "16") Use RDMA for short messages after this number of messages are received from a given peer (must be >= 1) MCA btl: parameter "btl_openib_max_eager_rdma" (current value: "16") Maximum number of peers allowed to use RDMA for short messages (RDMA is used for all long messages, except if explicitly disabled, such as with the "dr" pml) (must be >= 0) MCA btl: parameter "btl_openib_eager_rdma_num" (current value: "16") Number of RDMA buffers to allocate for small messages(must be >= 1) MCA btl: parameter "btl_openib_btls_per_lid" (current value: "1") Number of BTLs to create for each InfiniBand LID (must be >= 1) MCA btl: parameter "btl_openib_max_lmc" (current value: "0") Maximum number of LIDs to use for each HCA port (must be >= 0, where 0 = use all available) MCA btl: parameter "btl_openib_buffer_alignment" (current value: "64") Prefered communication buffer alignment, in bytes (must be >= 0) MCA btl: parameter "btl_openib_eager_limit" (current value: "12288") Eager send limit, in bytes (must be >= 1) MCA btl: parameter "btl_openib_min_send_size" (current value: "32768") Minimum send size, in bytes (must be >= 1) MCA btl: parameter "btl_openib_max_send_size" (current value: "65536") Maximum send size, in bytes (must be >= 1) MCA btl: parameter "btl_openib_min_rdma_size" (current value: "1048576") Minimum RDMA size, in bytes (must be >= 1) MCA btl: parameter "btl_openib_max_rdma_size" (current value: "1048576") Maximium RDMA size, in bytes (must be >= 1) MCA btl: parameter "btl_openib_flags" (current value: "54") BTL flags, added together: SEND=1, PUT=2, GET=4 (cannot be 0) MCA btl: parameter "btl_openib_bandwidth" (current value: "800") Approximate maximum bandwidth of network (must be >= 1) MCA btl: parameter "btl_openib_priority" (current value: "0") MCA btl: parameter "btl_self_free_list_num" (current value: "0") Number of fragments by default MCA btl: parameter "btl_self_free_list_max" (current value: "-1") Maximum number of fragments MCA btl: parameter "btl_self_free_list_inc" (current value: "32") Increment by this number of fragments MCA btl: parameter "btl_self_eager_limit" (current value: "131072") Eager size fragmeng (before the rendez-vous ptotocol) MCA btl: parameter "btl_self_min_send_size" (current value: "262144") Minimum fragment size after the rendez-vous MCA btl: parameter "btl_self_max_send_size" (current value: "262144") Maximum fragment size after the rendez-vous MCA btl: parameter "btl_self_min_rdma_size" (current value: "2147483647") Maximum fragment size for the RDMA transfer MCA btl: parameter "btl_self_max_rdma_size" (current value: "2147483647") Maximum fragment size for the RDMA transfer MCA btl: parameter "btl_self_exclusivity" (current value: "65536") Device exclusivity MCA btl: parameter "btl_self_flags" (current value: "10") Active behavior flags MCA btl: parameter "btl_self_priority" (current value: "0") MCA btl: parameter "btl_sm_free_list_num" (current value: "8") MCA btl: parameter "btl_sm_free_list_max" (current value: "-1") MCA btl: parameter "btl_sm_free_list_inc" (current value: "64") MCA btl: parameter "btl_sm_exclusivity" (current value: "65535") MCA btl: parameter "btl_sm_latency" (current value: "100") MCA btl: parameter "btl_sm_max_procs" (current value: "-1") MCA btl: parameter "btl_sm_sm_extra_procs" (current value: "2") MCA btl: parameter "btl_sm_mpool" (current value: "sm") MCA btl: parameter "btl_sm_eager_limit" (current value: "4096") MCA btl: parameter "btl_sm_max_frag_size" (current value: "32768") MCA btl: parameter "btl_sm_size_of_cb_queue" (current value: "128") MCA btl: parameter "btl_sm_cb_lazy_free_freq" (current value: "120") MCA btl: parameter "btl_sm_priority" (current value: "0") MCA btl: parameter "btl_tcp_if_include" (current value: <none>) MCA btl: parameter "btl_tcp_if_exclude" (current value: "lo") MCA btl: parameter "btl_tcp_free_list_num" (current value: "8") MCA btl: parameter "btl_tcp_free_list_max" (current value: "-1") MCA btl: parameter "btl_tcp_free_list_inc" (current value: "32") MCA btl: parameter "btl_tcp_sndbuf" (current value: "131072") MCA btl: parameter "btl_tcp_rcvbuf" (current value: "131072") MCA btl: parameter "btl_tcp_endpoint_cache" (current value: "30720") MCA btl: parameter "btl_tcp_exclusivity" (current value: "0") MCA btl: parameter "btl_tcp_eager_limit" (current value: "65536") MCA btl: parameter "btl_tcp_min_send_size" (current value: "65536") MCA btl: parameter "btl_tcp_max_send_size" (current value: "131072") MCA btl: parameter "btl_tcp_min_rdma_size" (current value: "131072") MCA btl: parameter "btl_tcp_max_rdma_size" (current value: "2147483647") MCA btl: parameter "btl_tcp_flags" (current value: "58") MCA btl: parameter "btl_tcp_priority" (current value: "0") MCA btl: parameter "btl_udapl_free_list_num" (current value: "8") MCA btl: parameter "btl_udapl_free_list_max" (current value: "-1") MCA btl: parameter "btl_udapl_free_list_inc" (current value: "8") MCA btl: parameter "btl_udapl_mpool" (current value: "udapl") MCA btl: parameter "btl_udapl_max_modules" (current value: "8") MCA btl: parameter "btl_udapl_evd_qlen" (current value: "32") MCA btl: parameter "btl_udapl_num_recvs" (current value: "8") MCA btl: parameter "btl_udapl_num_sends" (current value: "8") MCA btl: parameter "btl_udapl_timeout" (current value: "10000000") MCA btl: parameter "btl_udapl_exclusivity" (current value: "1014") MCA btl: parameter "btl_udapl_eager_limit" (current value: "32768") MCA btl: parameter "btl_udapl_min_send_size" (current value: "16384") MCA btl: parameter "btl_udapl_max_send_size" (current value: "65536") MCA btl: parameter "btl_udapl_min_rdma_size" (current value: "524288") MCA btl: parameter "btl_udapl_max_rdma_size" (current value: "131072") MCA btl: parameter "btl_udapl_bandwidth" (current value: "225") MCA btl: parameter "btl_udapl_priority" (current value: "0") MCA btl: parameter "btl_base_include" (current value: <none>) MCA btl: parameter "btl_base_exclude" (current value: <none>) MCA btl: parameter "btl_base_warn_component_unused" (current value: "1") This parameter is used to turn on warning messages when certain NICs are not used MCA mtl: parameter "mtl" (current value: <none>) Default selection set of components for the mtl framework (<none> means "use all components that can be found") MCA mtl: parameter "mtl_base_verbose" (current value: "0") Verbosity level for the mtl framework (0 = no verbosity) MCA topo: parameter "topo" (current value: <none>) Default selection set of components for the topo framework (<none> means "use all components that can be found") MCA topo: parameter "topo_base_verbose" (current value: "0") Verbosity level for the topo framework (0 = no verbosity) MCA osc: parameter "osc" (current value: <none>) Default selection set of components for the osc framework (<none> means "use all components that can be found") MCA osc: parameter "osc_base_verbose" (current value: "0") Verbosity level for the osc framework (0 = no verbosity) MCA osc: parameter "osc_pt2pt_no_locks" (current value: "0") Enable optimizations available only if MPI_LOCK is not used. MCA osc: parameter "osc_pt2pt_eager_limit" (current value: "16384") Max size of eagerly sent data MCA osc: parameter "osc_pt2pt_priority" (current value: "0") MCA osc: parameter "osc_rdma_fence_sync_method" (current value: "reduce_scatter") How to synchronize fence: reduce_scatter, allreduce, alltoall MCA osc: parameter "osc_rdma_eager_send" (current value: "0") Attempt to start data movement during communication call, instead of at synchrnoization time. Info key of same name overrides this value, if info key given. MCA osc: parameter "osc_rdma_no_locks" (current value: "0") Enable optimizations available only if MPI_LOCK is not used. MCA osc: parameter "osc_rdma_priority" (current value: "0") MCA errmgr: parameter "errmgr" (current value: <none>) Default selection set of components for the errmgr framework (<none> means "use all components that can be found") MCA errmgr: parameter "errmgr_hnp_debug" (current value: "0") MCA errmgr: parameter "errmgr_hnp_priority" (current value: "0") MCA errmgr: parameter "errmgr_orted_debug" (current value: "0") MCA errmgr: parameter "errmgr_orted_priority" (current value: "0") MCA errmgr: parameter "errmgr_proxy_debug" (current value: "0") MCA errmgr: parameter "errmgr_proxy_priority" (current value: "0") MCA gpr: parameter "gpr_base_maxsize" (current value: "2147483647") MCA gpr: parameter "gpr_base_blocksize" (current value: "512") MCA gpr: parameter "gpr" (current value: <none>) Default selection set of components for the gpr framework (<none> means "use all components that can be found") MCA gpr: parameter "gpr_null_priority" (current value: "0") MCA gpr: parameter "gpr_proxy_debug" (current value: "0") MCA gpr: parameter "gpr_proxy_priority" (current value: "0") MCA gpr: parameter "gpr_replica_debug" (current value: "0") MCA gpr: parameter "gpr_replica_isolate" (current value: "0") MCA gpr: parameter "gpr_replica_priority" (current value: "0") MCA iof: parameter "iof_base_window_size" (current value: "4096") MCA iof: parameter "iof_base_service" (current value: "0.0.0") MCA iof: parameter "iof" (current value: <none>) Default selection set of components for the iof framework (<none> means "use all components that can be found") MCA iof: parameter "iof_proxy_debug" (current value: "1") MCA iof: parameter "iof_proxy_priority" (current value: "0") MCA iof: parameter "iof_svc_debug" (current value: "1") MCA iof: parameter "iof_svc_priority" (current value: "0") MCA ns: parameter "ns" (current value: <none>) Default selection set of components for the ns framework (<none> means "use all components that can be found") MCA ns: parameter "ns_proxy_debug" (current value: "0") MCA ns: parameter "ns_proxy_maxsize" (current value: "2147483647") MCA ns: parameter "ns_proxy_blocksize" (current value: "512") MCA ns: parameter "ns_proxy_priority" (current value: "0") MCA ns: parameter "ns_replica_debug" (current value: "0") MCA ns: parameter "ns_replica_isolate" (current value: "0") MCA ns: parameter "ns_replica_maxsize" (current value: "2147483647") MCA ns: parameter "ns_replica_blocksize" (current value: "512") MCA ns: parameter "ns_replica_priority" (current value: "0") MCA oob: parameter "oob" (current value: <none>) Default selection set of components for the oob framework (<none> means "use all components that can be found") MCA oob: parameter "oob_base_verbose" (current value: "0") Verbosity level for the oob framework (0 = no verbosity) MCA oob: parameter "oob_tcp_peer_limit" (current value: "-1") MCA oob: parameter "oob_tcp_peer_retries" (current value: "60") MCA oob: parameter "oob_tcp_debug" (current value: "0") MCA oob: parameter "oob_tcp_include" (current value: <none>) MCA oob: parameter "oob_tcp_exclude" (current value: <none>) MCA oob: parameter "oob_tcp_sndbuf" (current value: "131072") MCA oob: parameter "oob_tcp_rcvbuf" (current value: "131072") MCA oob: parameter "oob_tcp_connect_timeout" (current value: "10") connect() timeout in seconds, before trying next interface MCA oob: parameter "oob_tcp_listen_mode" (current value: "event") Mode for HNP to accept incoming connections: event, listen_thread MCA oob: parameter "oob_tcp_listen_thread_max_queue" (current value: "10") High water mark for queued accepted socket list size MCA oob: parameter "oob_tcp_listen_thread_max_time" (current value: "10") Maximum amount of time (in milliseconds) to wait between processing accepted socket list MCA oob: parameter "oob_tcp_accept_spin_count" (current value: "10") Number of times to let accept return EWOULDBLOCK before updating accepted socket list MCA oob: parameter "oob_tcp_priority" (current value: "0") MCA ras: parameter "ras" (current value: <none>) MCA ras: parameter "ras_dash_host_priority" (current value: "5") Selection priority for the dash_host RAS component MCA ras: parameter "ras_gridengine_debug" (current value: "0") Enable debugging output for the gridengine ras component MCA ras: parameter "ras_gridengine_priority" (current value: "100") Priority of the gridengine ras component MCA ras: parameter "ras_gridengine_verbose" (current value: "0") Enable verbose output for the gridengine ras component MCA ras: parameter "ras_gridengine_show_jobid" (current value: "0") Show the JOB_ID of the Grid Engine job MCA ras: parameter "ras_localhost_priority" (current value: "0") Selection priority for the localhost RAS component MCA ras: parameter "ras_slurm_priority" (current value: "75") Priority of the slurm ras component MCA rds: parameter "rds" (current value: <none>) MCA rds: parameter "rds_hostfile_debug" (current value: "0") Toggle debug output for hostfile RDS component MCA rds: parameter "rds_hostfile_path" (current value: "/g/g12/mlleinin/src/ompi-v1.2b-112506-gcc/etc/openmpi-default-hostfile") ORTE Host filename MCA rds: parameter "rds_hostfile_priority" (current value: "0") MCA rds: parameter "rds_proxy_priority" (current value: "0") MCA rds: parameter "rds_resfile_debug" (current value: "0") Toggle debug output for resfile RDS component MCA rds: parameter "rds_resfile_name" (current value: <none>) ORTE Resource filename MCA rds: parameter "rds_resfile_priority" (current value: "0") MCA rmaps: parameter "rmaps_base_verbose" (current value: "0") Verbosity level for the rmaps framework MCA rmaps: parameter "rmaps_base_schedule_policy" (current value: "slot") Scheduling Policy for RMAPS. [slot | node] MCA rmaps: parameter "rmaps_base_pernode" (current value: "0") Request one ppn if num procs not specified MCA rmaps: parameter "rmaps_base_schedule_local" (current value: "1") If nonzero, allow scheduling MPI applications on the same node as mpirun (default). If zero, do not schedule any MPI applications on the same node as mpirun MCA rmaps: parameter "rmaps_base_no_oversubscribe" (current value: "0") If nonzero, then do not allow oversubscription of nodes - mpirun will return an error if there aren't enough nodes to launch all processes without oversubscribing MCA rmaps: parameter "rmaps" (current value: <none>) Default selection set of components for the rmaps framework (<none> means "use all components that can be found") MCA rmaps: parameter "rmaps_proxy_debug" (current value: "0") MCA rmaps: parameter "rmaps_proxy_priority" (current value: "0") MCA rmaps: parameter "rmaps_round_robin_debug" (current value: "1") Toggle debug output for Round Robin RMAPS component MCA rmaps: parameter "rmaps_round_robin_priority" (current value: "1") Selection priority for Round Robin RMAPS component MCA rmgr: parameter "rmgr" (current value: <none>) Default selection set of components for the rmgr framework (<none> means "use all components that can be found") MCA rmgr: parameter "rmgr_proxy_priority" (current value: "0") MCA rmgr: parameter "rmgr_urm_priority" (current value: "0") MCA rml: parameter "rml" (current value: <none>) Default selection set of components for the rml framework (<none> means "use all components that can be found") MCA rml: parameter "rml_base_verbose" (current value: "0") Verbosity level for the rml framework (0 = no verbosity) MCA rml: parameter "rml_oob_priority" (current value: "0") MCA pls: parameter "pls" (current value: <none>) Default selection set of components for the pls framework (<none> means "use all components that can be found") MCA pls: parameter "pls_base_verbose" (current value: "0") Verbosity level for the pls framework (0 = no verbosity) MCA pls: parameter "pls_gridengine_debug" (current value: "0") Enable debugging of gridengine pls component MCA pls: parameter "pls_gridengine_verbose" (current value: "0") Enable verbose output of the gridengine qrsh -inherit command MCA pls: parameter "pls_gridengine_priority" (current value: "100") Priority of the gridengine pls component MCA pls: parameter "pls_gridengine_orted" (current value: "orted") The command name that the gridengine pls component will invoke for the ORTE daemon MCA pls: parameter "pls_proxy_priority" (current value: "0") MCA pls: parameter "pls_rsh_debug" (current value: "0") Whether or not to enable debugging output for the rsh pls component (0 or 1) MCA pls: parameter "pls_rsh_num_concurrent" (current value: "128") How many pls_rsh_agent instances to invoke concurrently (must be > 0) MCA pls: parameter "pls_rsh_orted" (current value: "orted") The command name that the rsh pls component will invoke for the ORTE daemon MCA pls: parameter "pls_rsh_priority" (current value: "10") Priority of the rsh pls component MCA pls: parameter "pls_rsh_delay" (current value: "1") Delay (in seconds) between invocations of the remote agent, but only used when the "debug" MCA parameter is true, or the top-level MCA debugging is enabled (otherwise this value is ignored) MCA pls: parameter "pls_rsh_reap" (current value: "1") If set to 1, wait for all the processes to complete before exiting. Otherwise, quit immediately -- without waiting for confirmation that all other processes in the job have completed. MCA pls: parameter "pls_rsh_assume_same_shell" (current value: "1") If set to 1, assume that the shell on the remote node is the same as the shell on the local node. Otherwise, probe for what the remote shell. MCA pls: parameter "pls_rsh_agent" (current value: "ssh : rsh") The command used to launch executables on remote nodes (typically either "ssh" or "rsh") MCA pls: parameter "pls_slurm_debug" (current value: "0") Enable debugging of slurm pls MCA pls: parameter "pls_slurm_priority" (current value: "75") Default selection priority MCA pls: parameter "pls_slurm_orted" (current value: "orted") Command to use to start proxy orted MCA pls: parameter "pls_slurm_args" (current value: <none>) Custom arguments to srun MCA sds: parameter "sds" (current value: <none>) Default selection set of components for the sds framework (<none> means "use all components that can be found") MCA sds: parameter "sds_base_verbose" (current value: "0") Verbosity level for the sds framework (0 = no verbosity) MCA sds: parameter "sds_env_priority" (current value: "0") MCA sds: parameter "sds_pipe_priority" (current value: "0") MCA sds: parameter "sds_seed_priority" (current value: "0") MCA sds: parameter "sds_singleton_priority" (current value: "0") MCA sds: parameter "sds_slurm_priority" (current value: "0")