I found a slight difference between the trace files:

The traceout_8_142118.txt contains

libMesh::MeshTools::libmesh_assert_parallel_consistent_procids<libMesh::Node> 
(mesh=...) at src/mesh/mesh_tools.C:1608

whereas traceout_57_85461.txt  and traceout_11_104555.txt :

libMesh::MeshTools::libmesh_assert_parallel_consistent_procids<libMesh::Node> 
(mesh=...) at src/mesh/mesh_tools.C:1609

Not sure if this helps.  

-- 
 

On 3/18/18, 12:36 PM, "Salazar De Troya, Miguel" <salazardet...@llnl.gov> 
wrote:

    Hello,
    
    Running a big problem (1,601,777 elements) on 100 processors. I am using a 
DistributedMesh. At some point, I call 
MeshRefinement::refine_and_coarsen_elements() to do AMR, but I get this 
assertion error (running in debug mode):
    
    Assertion `min_id == node->processor_id()' failed.
    min_id = 11
    node->processor_id() = 57
    
    Assertion `max_id == node->processor_id()' failed.
    max_id = 57
    node->processor_id() = 11
    
    Assertion `max_id == node->processor_id()' failed.
    max_id = 57
    node->processor_id() = 11
    
    I also obtain traceout files with numbers: traceout_57_85461.txt  
traceout_11_104555.txt  traceout_8_142118.txt. Their content is similar and 
looks like this:
    
    
    [New LWP 142203]
    [Thread debugging using libthread_db enabled]
    Using host libthread_db library "/usr/lib64/libthread_db.so.1".
    0x00002aaaba58fe09 in __libc_waitpid (pid=143882, 
stat_loc=stat_loc@entry=0x7fffffff4a90, options=options@entry=0) at 
../sysdeps/unix/sysv/linux/waitpid.c:40
    40      int result = INLINE_SYSCALL (wait4, 4, pid, stat_loc, options, 
NULL);
    To enable execution of this file add
        add-auto-load-safe-path 
/usr/tce/packages/gcc/gcc-4.9.3/lib64/libstdc++.so.6.0.20-gdb.py
    line to your configuration file "/g/g92/miguel/.gdbinit".
    To completely disable this security protection add
        set auto-load safe-path /
    line to your configuration file "/g/g92/miguel/.gdbinit".
    For more information about this security protection see the
    "Auto-loading safe path" section in the GDB manual.  E.g., run from the 
shell:
        info "(gdb)Auto-loading safe path"
    #0  0x00002aaaba58fe09 in __libc_waitpid (pid=143882, 
stat_loc=stat_loc@entry=0x7fffffff4a90, options=options@entry=0) at 
../sysdeps/unix/sysv/linux/waitpid.c:40
    #1  0x00002aaaba512cc2 in do_system (line=line@entry=0x1e3ac078 "gdb -p 
142118 -batch -ex bt -ex detach 2>/dev/null 1>temp_print_trace.4jiwUH") at 
../sysdeps/posix/system.c:148
    #2  0x00002aaaba513071 in __libc_system (line=0x1e3ac078 "gdb -p 142118 
-batch -ex bt -ex detach 2>/dev/null 1>temp_print_trace.4jiwUH") at 
../sysdeps/posix/system.c:189
    #3  0x00002aaaad67e17b in (anonymous namespace)::gdb_backtrace 
(out_stream=...) at src/base/print_trace.C:162
    #4  0x00002aaaad6806ab in libMesh::print_trace (out_stream=...) at 
src/base/print_trace.C:209
    #5  0x00002aaaad67f7f4 in libMesh::write_traceout () at 
src/base/print_trace.C:239
    #6  0x00002aaaad6769fb in libMesh::MacroFunctions::report_error 
(file=0x2320a <Address 0x2320a out of bounds>, line=-46448, date=0x0, 
time=0xffffffffffffffff <Address 0xffffffffffffffff out of bounds>) at 
src/base/libmesh_common.C:89
    #7  0x00002aaaaebe174e in 
libMesh::MeshTools::libmesh_assert_parallel_consistent_procids<libMesh::Node> 
(mesh=...) at src/mesh/mesh_tools.C:1608
    #8  0x00002aaaaeba931e in libMesh::MeshTools::correct_node_proc_ids 
(mesh=...) at src/mesh/mesh_tools.C:1844
    #9  0x00002aaaae69a0ce in 
libMesh::MeshCommunication::make_new_nodes_parallel_consistent (this=0x2320a, 
mesh=...) at src/mesh/mesh_communication.C:1776
    #10 0x00002aaaaea95919 in libMesh::MeshRefinement::_refine_elements 
(this=0x2320a) at src/mesh/mesh_refinement.C:1601
    #11 0x00002aaaaea6a4d1 in 
libMesh::MeshRefinement::refine_and_coarsen_elements (this=0x2320a) at 
src/mesh/mesh_refinement.C:578
    #12 0x00002aaab9d69dcd in OptiProblem::solve (this=0x7fffffffabd8) at 
/g/g92/miguel/code/topsm/src/opti_problem.C:370
    #13 0x00000000004371b8 in main (argc=4, argv=0x7fffffffb798) at 
/g/g92/miguel/code/topsm/test/3D_stress_constraint/linear_stress_opti.C:196
    
    Are there other things I can do to debug this?
    
    
    Thanks
    
    Miguel
    
------------------------------------------------------------------------------
    Check out the vibrant tech community on one of the world's most
    engaging tech sites, Slashdot.org! http://sdm.link/slashdot
    _______________________________________________
    Libmesh-users mailing list
    Libmesh-users@lists.sourceforge.net
    https://lists.sourceforge.net/lists/listinfo/libmesh-users
    

------------------------------------------------------------------------------
Check out the vibrant tech community on one of the world's most
engaging tech sites, Slashdot.org! http://sdm.link/slashdot
_______________________________________________
Libmesh-users mailing list
Libmesh-users@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/libmesh-users

Reply via email to