On Mon, Jan 12, 2015 at 10:31 AM, Roy Stogner <royst...@ices.utexas.edu>
wrote:

The failure seems to be within a ParallelMesh-only code path.
>

OK, I see this with ParallelMesh enabled.

Environment:

Currently Loaded Modules:
  1) texlive/2013   3) asciidoc/8.6.9   5) subversion/1.7.17   7)
emacs/24.3       9) autotools/1.0  11) epd/7.3.2        13) gdb/7.6.2
15) paraview/4.1.0  17) mpich/3.0.4         19) vtk/5.10.1  21)
petsc/3.5.2-debug
  2) texinfo/5.2    4) git/1.8.5.3      6) svn2git/2.3.2       8) cmake/
2.8.12.2  10) doxygen/1.8.6  12) graphviz/2.36.0  14) valgrind/3.9.0  16)
gcc/4.8.2       18) openblas/0.2.9.rc1  20) tbb/4.2

configure line:

../../libMesh/configure
--prefix=/fry1/data/users/pbauman/software/libs/libmesh/master
--enable-everything --with-metis=PETSc --enable-parmesh

Building and running the installed introduction/ex4 as is in dbg mode gives
the attached dbg_output.txt

If I run the devel version, it hangs. Here's the stack trace attached to
one of the two processes:

(gdb) where
#0  0x00007f14008350f3 in MPIDI_CH3I_Progress () from
/fry1/data/users/pbauman/software/libs/mpich/3.0.4/gcc/4.8.2/lib/libmpich.so.10
#1  0x00007f14008279a5 in MPID_Probe () from
/fry1/data/users/pbauman/software/libs/mpich/3.0.4/gcc/4.8.2/lib/libmpich.so.10
#2  0x00007f14008de396 in PMPI_Probe () from
/fry1/data/users/pbauman/software/libs/mpich/3.0.4/gcc/4.8.2/lib/libmpich.so.10
#3  0x00007f14063c9818 in probe (tag=..., src_processor_id=1,
this=0x7fff31c58078) at ./include/libmesh/parallel_implementation.h:1931
#4  receive<unsigned int> (tag=..., type=..., buf=..., src_processor_id=1,
this=0x7fff31c58078) at ./include/libmesh/parallel_implementation.h:2504
#5  send_receive<unsigned int, unsigned int> (recv_tag=..., send_tag=...,
type2=..., recv=..., source_processor_id=1, type1=..., sendvec=...,
dest_processor_id=<optimized out>, this=0x7fff31c58078)
    at ./include/libmesh/parallel_implementation.h:2632
#6  libMesh::Parallel::Communicator::send_receive<unsigned int>
(this=0x7fff31c58078, dest_processor_id=<optimized out>, sendvec=...,
source_processor_id=1, recv=..., send_tag=..., recv_tag=...)
    at ./include/libmesh/parallel_implementation.h:2697
#7  0x00007f14063f1499 in libMesh::DofMap::scatter_constraints
(this=this@entry=0xa625d0, mesh=...) at
../../libMesh/src/base/dof_map_constraints.C:3402
#8  0x00007f14063f42d8 in libMesh::DofMap::process_constraints
(this=0xa625d0, mesh=...) at
../../libMesh/src/base/dof_map_constraints.C:3229
#9  0x00007f1406b263d7 in libMesh::System::init_data (this=this@entry=0xa62240)
at ../../libMesh/src/systems/system.C:268
#10 0x00007f1406ae8c52 in libMesh::ExplicitSystem::init_data
(this=this@entry=0xa62240) at ../../libMesh/src/systems/explicit_system.C:72
#11 0x00007f1406b0d8a4 in libMesh::ImplicitSystem::init_data
(this=0xa62240) at ../../libMesh/src/systems/implicit_system.C:89
#12 0x00007f1406b102c9 in libMesh::LinearImplicitSystem::init_data
(this=0xa62240) at ../../libMesh/src/systems/linear_implicit_system.C:77
#13 0x00007f1406b26819 in libMesh::System::init (this=0xa62240) at
../../libMesh/src/systems/system.C:235
#14 0x00007f1406ae03e7 in libMesh::EquationSystems::init
(this=this@entry=0x7fff31c58130)
at ../../libMesh/src/systems/equation_systems.C:121
#15 0x000000000042aa1c in main (argc=<optimized out>, argv=<optimized out>)
at introduction_ex4.C:287
(gdb) quit

opt also hangs.

Note I see the same behavior minus the PARMETIS ERROR is I switch to a
different partitioner (MetisPartitioner and LinearPartitioner both worked
in dbg and hung in devel and opt).

I'm going to try with OpenMPI and see if this is replicated or not.
Running ./example-dbg -d 2 -n 15 -ksp_monitor

PARMETIS ERROR: The sum of tpwgts for constraint #0 is not 1.0
PARMETIS ERROR: The sum of tpwgts for constraint #0 is not 1.0
 Mesh Information:
  mesh_dimension()=2
  spatial_dimension()=3
  n_nodes()=961
    n_local_nodes()=513
  n_elem()=225
    n_local_elem()=114
    n_active_elem()=225
  n_subdomains()=1
  n_partitions()=2
  n_processors()=2
  n_threads()=1
  processor_id()=0

 EquationSystems
  n_systems()=1
   System #0, "Poisson"
    Type "LinearImplicit"
    Variables="u" 
    Finite Element Types="LAGRANGE", "JACOBI_20_00" 
    Infinite Element Mapping="CARTESIAN" 
    Approximation Orders="SECOND", "THIRD" 
    n_dofs()=961
    n_local_dofs()=513
    n_constrained_dofs()=120
    n_local_constrained_dofs()=62
    n_vectors()=1
    n_matrices()=1
    DofMap Sparsity
      Average  On-Processor Bandwidth <= 14.7336
      Average Off-Processor Bandwidth <= 0.720083
      Maximum  On-Processor Bandwidth <= 26
      Maximum Off-Processor Bandwidth <= 15
    DofMap Constraints
      Number of DoF Constraints = 120
      Number of Heterogenous Constraints= 118
      Average DoF Constraint Length= 0
      Number of Node Constraints = 0

 Mesh Information:
  mesh_dimension()=2
  spatial_dimension()=3
  n_nodes()=961
    n_local_nodes()=513
  n_elem()=225
    n_local_elem()=114
    n_active_elem()=225
  n_subdomains()=1
  n_partitions()=2
  n_processors()=2
  n_threads()=1
  processor_id()=0


 
-------------------------------------------------------------------------------------------
| Processor id:   0                                                             
            |
| Num Processors: 2                                                             
            |
| Time:           Mon Jan 12 11:22:46 2015                                      
            |
| OS:             Linux                                                         
            |
| HostName:       fry.eng.buffalo.edu                                           
            |
| OS Release:     2.6.32-504.3.3.el6.x86_64                                     
            |
| OS Version:     #1 SMP Fri Dec 12 16:05:43 EST 2014                           
            |
| Machine:        x86_64                                                        
            |
| Username:       pbauman                                                       
            |
| Configuration:  ../../libMesh/configure  
'--prefix=/fry1/data/users/pbauman/software/libs/libmesh/master'|
|  '--enable-everything'                                                        
            |
|  '--with-metis=PETSc'                                                         
            |
|  '--enable-parmesh'                                                           
            |
|  'CXX=g++'                                                                    
            |
|  'CC=gcc'                                                                     
            |
|  'FC=gfortran'                                                                
            |
|  'F77=gfortran'                                                               
            |
|  'PETSC_DIR=/fry1/data/users/pbauman/software/libs/petsc/petsc-3.5.2'         
            |
|  'PETSC_ARCH=gcc-4.8.2-mpich-3.0.4-openblas-0.2.9.rc1-cxx-opt'                
            |
|  
'VTK_INCLUDE=/fry1/data/users/pbauman/software/libs/vtk/5.10.1/gcc/4.8.2/include/vtk-5.10'|
|  'VTK_DIR=/fry1/data/users/pbauman/software/libs/vtk/5.10.1/gcc/4.8.2'        
            |
 
-------------------------------------------------------------------------------------------
 
------------------------------------------------------------------------------------------------------------
| Matrix Assembly Performance: Alive time=0.032835, Active time=0.025345        
                             |
 
------------------------------------------------------------------------------------------------------------
| Event                         nCalls     Total Time  Avg Time    Total Time  
Avg Time    % of Active Time  |
|                                          w/o Sub     w/o Sub     With Sub    
With Sub    w/o S    With S   |
|------------------------------------------------------------------------------------------------------------|
|                                                                               
                             |
| Fe                            114        0.0012      0.000010    0.0012      
0.000010    4.65     4.65     |
| Ke                            114        0.0068      0.000059    0.0068      
0.000059    26.71    26.71    |
| elem init                     114        0.0167      0.000147    0.0167      
0.000147    66.06    66.06    |
| matrix insertion              114        0.0007      0.000006    0.0007      
0.000006    2.59     2.59     |
 
------------------------------------------------------------------------------------------------------------
| Totals:                       456        0.0253                               
           100.00            |
 
------------------------------------------------------------------------------------------------------------

  0 KSP Residual norm 6.281555363893e+00 
  1 KSP Residual norm 1.441578986136e+00 
  2 KSP Residual norm 8.657661935565e-01 
  3 KSP Residual norm 6.298483075757e-01 
  4 KSP Residual norm 4.584797396291e-01 
  5 KSP Residual norm 3.672004215548e-01 
  6 KSP Residual norm 2.854933103510e-01 
  7 KSP Residual norm 2.207387889462e-01 
  8 KSP Residual norm 1.507528183709e-01 
  9 KSP Residual norm 7.609993007381e-02 
 10 KSP Residual norm 3.870496969236e-02 
 11 KSP Residual norm 1.694470333105e-02 
 12 KSP Residual norm 1.056205718874e-02 
 13 KSP Residual norm 7.962944594936e-03 
 14 KSP Residual norm 6.247891445290e-03 
 15 KSP Residual norm 5.142557174263e-03 
 16 KSP Residual norm 4.064335778374e-03 
 17 KSP Residual norm 2.986839198661e-03 
 18 KSP Residual norm 2.212248591674e-03 
 19 KSP Residual norm 1.711006399209e-03 
 20 KSP Residual norm 1.423591127897e-03 
 21 KSP Residual norm 1.228658042495e-03 
 22 KSP Residual norm 1.021344123622e-03 
 23 KSP Residual norm 8.969047514369e-04 
 24 KSP Residual norm 7.995662891498e-04 
 25 KSP Residual norm 6.652930701941e-04 
 26 KSP Residual norm 5.304353279729e-04 
 27 KSP Residual norm 4.030242592245e-04 
 28 KSP Residual norm 2.730993578525e-04 
 29 KSP Residual norm 1.645810058886e-04 
 30 KSP Residual norm 1.103538427762e-04 
 31 KSP Residual norm 8.610454940155e-05 
 32 KSP Residual norm 6.498404359506e-05 
 33 KSP Residual norm 4.556983417930e-05 
 34 KSP Residual norm 3.231154166980e-05 
 35 KSP Residual norm 2.577303633035e-05 
 36 KSP Residual norm 2.113582798346e-05 
 37 KSP Residual norm 1.752446809300e-05 
 38 KSP Residual norm 1.484672477927e-05 
 39 KSP Residual norm 1.302629184089e-05 
 40 KSP Residual norm 1.172471062290e-05 
 41 KSP Residual norm 1.006971964980e-05 
 42 KSP Residual norm 8.230366349966e-06 
 43 KSP Residual norm 6.171634470124e-06 
 44 KSP Residual norm 4.117409795018e-06 
 45 KSP Residual norm 2.611987673662e-06 
 46 KSP Residual norm 1.745591647676e-06 
 47 KSP Residual norm 1.190676355730e-06 
 48 KSP Residual norm 9.028453572394e-07 
 49 KSP Residual norm 7.224248002419e-07 
 50 KSP Residual norm 5.339195753159e-07 
 51 KSP Residual norm 3.748508543720e-07 
 52 KSP Residual norm 2.677517396278e-07 
 53 KSP Residual norm 1.840889093657e-07 
 54 KSP Residual norm 1.207493202252e-07 
 55 KSP Residual norm 7.670458622698e-08 
 56 KSP Residual norm 4.744749826056e-08 
 57 KSP Residual norm 2.888924255267e-08 
 58 KSP Residual norm 1.855668558305e-08 
 59 KSP Residual norm 1.225261973163e-08 
 60 KSP Residual norm 7.274693375029e-09 
 61 KSP Residual norm 5.678840027999e-09 
 62 KSP Residual norm 4.272621003973e-09 
 63 KSP Residual norm 3.297977189053e-09 
 64 KSP Residual norm 2.507083460763e-09 
 65 KSP Residual norm 1.834020100457e-09 
 66 KSP Residual norm 1.337490896343e-09 
 67 KSP Residual norm 1.047810076369e-09 
 68 KSP Residual norm 7.828493744498e-10 
 69 KSP Residual norm 5.516829101298e-10 
 70 KSP Residual norm 3.616568657065e-10 
 71 KSP Residual norm 2.576313931394e-10 
 72 KSP Residual norm 2.089089652561e-10 
 73 KSP Residual norm 1.754670989786e-10 
 74 KSP Residual norm 1.516055264043e-10 
 75 KSP Residual norm 1.314547789622e-10 
 76 KSP Residual norm 1.151571443845e-10 
 77 KSP Residual norm 1.002170334358e-10 
 78 KSP Residual norm 8.399470485732e-11 
 79 KSP Residual norm 6.720961716070e-11 
 80 KSP Residual norm 4.385649839608e-11 
 81 KSP Residual norm 2.932111979815e-11 
 82 KSP Residual norm 1.808693464627e-11 
 83 KSP Residual norm 1.155116888517e-11 
 84 KSP Residual norm 7.787886447474e-12 
 85 KSP Residual norm 5.539837062396e-12 
Warning:  This MeshOutput subclass only supports meshes which have been 
serialized!
Warning:  This MeshOutput subclass only supports meshes which have been 
serialized!

 ---------------------------------------------------------------------------- 
| Reference count information                                                |
 ---------------------------------------------------------------------------- 
| N7libMesh10FEAbstractE reference count information:
|  Creations:    4
|  Destructions: 4
| N7libMesh10Parameters5ValueE reference count information:
|  Creations:    2
|  Destructions: 2
| N7libMesh12LinearSolverIdEE reference count information:
|  Creations:    1
|  Destructions: 1
| N7libMesh12SparseMatrixIdEE reference count information:
|  Creations:    1
|  Destructions: 1
| N7libMesh13NumericVectorIdEE reference count information:
|  Creations:    5
|  Destructions: 5
| N7libMesh15EquationSystemsE reference count information:
|  Creations:    1
|  Destructions: 1
| N7libMesh4ElemE reference count information:
|  Creations:    2632
|  Destructions: 2632
| N7libMesh4NodeE reference count information:
|  Creations:    1333
|  Destructions: 1333
| N7libMesh5QBaseE reference count information:
|  Creations:    5
|  Destructions: 5
| N7libMesh6DofMapE reference count information:
|  Creations:    1
|  Destructions: 1
| N7libMesh6SystemE reference count information:
|  Creations:    1
|  Destructions: 1
| N7libMesh9DofObjectE reference count information:
|  Creations:    3965
|  Destructions: 3965
 ---------------------------------------------------------------------------- 
 
---------------------------------------------------------------------------------------------------------------------
| libMesh Performance: Alive time=1.62748, Active time=1.36299                  
                                      |
 
---------------------------------------------------------------------------------------------------------------------
| Event                                  nCalls     Total Time  Avg Time    
Total Time  Avg Time    % of Active Time  |
|                                                   w/o Sub     w/o Sub     
With Sub    With Sub    w/o S    With S   |
|---------------------------------------------------------------------------------------------------------------------|
|                                                                               
                                      |
|                                                                               
                                      |
| DofMap                                                                        
                                      |
|   add_neighbors_to_send_list()         1          0.0050      0.005017    
0.0073      0.007321    0.37     0.54     |
|   build_constraint_matrix_and_vector() 114        0.0018      0.000016    
0.0018      0.000016    0.13     0.13     |
|   build_sparsity()                     1          0.0313      0.031347    
0.0378      0.037773    2.30     2.77     |
|   create_dof_constraints()             1          0.0072      0.007207    
0.0170      0.017019    0.53     1.25     |
|   distribute_dofs()                    1          0.1193      0.119323    
0.6241      0.624148    8.75     45.79    |
|   dof_indices()                        524        0.0257      0.000049    
0.0257      0.000049    1.89     1.89     |
|   hetero_cnstrn_elem_mat_vec()         114        0.0011      0.000010    
0.0011      0.000010    0.08     0.08     |
|   prepare_send_list()                  1          0.0006      0.000619    
0.0006      0.000619    0.05     0.05     |
|   reinit()                             1          0.0148      0.014765    
0.0148      0.014765    1.08     1.08     |
|                                                                               
                                      |
| EquationSystems                                                               
                                      |
|   build_solution_vector()              1          0.0019      0.001909    
0.0077      0.007734    0.14     0.57     |
|                                                                               
                                      |
| ExodusII_IO                                                                   
                                      |
|   write_nodal_data()                   1          0.0057      0.005673    
0.0060      0.005965    0.42     0.44     |
|                                                                               
                                      |
| FE                                                                            
                                      |
|   compute_shape_functions()            144        0.0058      0.000040    
0.0058      0.000040    0.43     0.43     |
|   init_shape_functions()               31         0.0002      0.000007    
0.0002      0.000007    0.02     0.02     |
|   inverse_map()                        90         0.0009      0.000010    
0.0009      0.000010    0.07     0.07     |
|                                                                               
                                      |
| FEMap                                                                         
                                      |
|   compute_affine_map()                 144        0.0025      0.000018    
0.0025      0.000018    0.19     0.19     |
|   compute_face_map()                   30         0.0009      0.000028    
0.0019      0.000063    0.06     0.14     |
|   init_face_shape_functions()          30         0.0002      0.000007    
0.0002      0.000007    0.02     0.02     |
|   init_reference_to_physical_map()     31         0.0009      0.000029    
0.0009      0.000029    0.07     0.07     |
|                                                                               
                                      |
| Mesh                                                                          
                                      |
|   find_neighbors()                     2          0.0289      0.014471    
0.0317      0.015828    2.12     2.32     |
|                                                                               
                                      |
| MeshCommunication                                                             
                                      |
|   (all)gather()                        1          0.0438      0.043773    
0.0580      0.057992    3.21     4.25     |
|   compute_hilbert_indices()            4          0.0049      0.001213    
0.0049      0.001213    0.36     0.36     |
|   delete_remote_elements()             3          0.0104      0.003473    
0.0119      0.003965    0.76     0.87     |
|   find_global_indices()                4          0.0106      0.002642    
0.0217      0.005420    0.78     1.59     |
|   parallel_sort()                      4          0.0042      0.001053    
0.0054      0.001358    0.31     0.40     |
|                                                                               
                                      |
| MeshOutput                                                                    
                                      |
|   write_equation_systems()             1          0.0132      0.013242    
0.1180      0.117971    0.97     8.66     |
|                                                                               
                                      |
| MeshTools::Generation                                                         
                                      |
|   build_cube()                         1          0.0141      0.014134    
0.0141      0.014134    1.04     1.04     |
|                                                                               
                                      |
| Parallel                                                                      
                                      |
|   allgather()                          24         0.0013      0.000052    
0.0018      0.000074    0.09     0.13     |
|   broadcast()                          1          0.0000      0.000009    
0.0000      0.000009    0.00     0.00     |
|   max(bool)                            8310       0.0544      0.000007    
0.0544      0.000007    3.99     3.99     |
|   max(scalar)                          19698      0.1212      0.000006    
0.1212      0.000006    8.90     8.90     |
|   max(vector)                          3763       0.0506      0.000013    
0.1410      0.000037    3.71     10.34    |
|   max(vector<bool>)                    3          0.0007      0.000224    
0.0008      0.000252    0.05     0.06     |
|   min(bool)                            8158       0.0525      0.000006    
0.0525      0.000006    3.85     3.85     |
|   min(scalar)                          61241      0.3972      0.000006    
0.3972      0.000006    29.14    29.14    |
|   min(vector)                          3763       0.0518      0.000014    
0.1450      0.000039    3.80     10.64    |
|   probe()                              67         0.0016      0.000024    
0.0016      0.000024    0.12     0.12     |
|   receive()                            67         0.0006      0.000009    
0.0023      0.000035    0.04     0.17     |
|   send()                               67         0.0003      0.000004    
0.0003      0.000004    0.02     0.02     |
|   send_receive()                       98         0.0011      0.000012    
0.0042      0.000043    0.08     0.31     |
|   sum()                                52         0.0008      0.000016    
0.0016      0.000030    0.06     0.11     |
|                                                                               
                                      |
| Parallel::Request                                                             
                                      |
|   wait()                               70         0.0002      0.000003    
0.0002      0.000003    0.01     0.01     |
|                                                                               
                                      |
| ParallelMesh                                                                  
                                      |
|   renumber_nodes_and_elements()        2          0.0722      0.036124    
0.2182      0.109118    5.30     16.01    |
|                                                                               
                                      |
| ParmetisPartitioner                                                           
                                      |
|   repartition()                        1          0.1673      0.167309    
0.1818      0.181825    12.28    13.34    |
|                                                                               
                                      |
| Partitioner                                                                   
                                      |
|   set_node_processor_ids()             1          0.0097      0.009723    
0.0265      0.026458    0.71     1.94     |
|   set_parent_processor_ids()           1          0.0007      0.000657    
0.0007      0.000657    0.05     0.05     |
|                                                                               
                                      |
| PetscLinearSolver                                                             
                                      |
|   solve()                              1          0.0069      0.006875    
0.0069      0.006875    0.50     0.50     |
|                                                                               
                                      |
| System                                                                        
                                      |
|   assemble()                           1          0.0158      0.015771    
0.0331      0.033131    1.16     2.43     |
 
---------------------------------------------------------------------------------------------------------------------
| Totals:                                106669     1.3630                      
                    100.00            |
 
---------------------------------------------------------------------------------------------------------------------

------------------------------------------------------------------------------
New Year. New Location. New Benefits. New Data Center in Ashburn, VA.
GigeNET is offering a free month of service with a new server in Ashburn.
Choose from 2 high performing configs, both with 100TB of bandwidth.
Higher redundancy.Lower latency.Increased capacity.Completely compliant.
www.gigenet.com
_______________________________________________
Libmesh-devel mailing list
Libmesh-devel@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/libmesh-devel

Reply via email to