Dear all,

I want to run step-17 on cluster. It works well with single node, but gives 
error when I use multiple nodes. I cannot figure out where the problem is. 
I believe the code is correct. It seems that the cluster cannot operate the 
code "  GridTools::partition_triangulation (n_mpi_processes, 
triangulation);". By the way, I have installed METIS on cluster. Need I 
turn METIS on manually and explicitly write something in the pbs file? 
Anyone can help? Thank you very much in advance!

The error file is copied as follows when I use only two nodes for 
parallelization,

****Memory allocation failed for SetupCtrl: maxvwgt. Requested size: 
18446744073709551608 bytes   Current memory used:         384 bytes*
*   Maximum memory used:         384 bytes*
****Memory allocation failed for SetupCtrl: maxvwgt. Requested size: 
18446744073709551608 bytes*

*ERROR: Uncaught exception in MPI_InitFinalize on proc 0. Skipping 
MPI_Finalize() to avoid a deadlock.*
*----------------------------------------------------*
*ERROR: Uncaught exception in MPI_InitFinalize on proc 1. Skipping 
MPI_Finalize() to avoid a deadlock.*
*----------------------------------------------------*
*Exception on processing: *
*-------------------------------------------------------*
*An error occurred in line <131> of file 
</opt/src/dealii-8.3.0/source/lac/sparsity_tools.cc> in function*
*    void dealii::SparsityTools::partition(const dealii::SparsityPattern&, 
unsigned int, std::vector<unsigned int, std::allocator<unsigned int> >&)*
*The violated condition was: *
*    ierr == 1*
*The name and call sequence of the exception was:*
*    ExcMETISError (ierr)*
*Additional Information: *
*    An error with error number -3 occurred while calling a METIS function*
*--------------------------------------------------------*
*Aborting!*
*----------------------------------------------------*
*Exception on processing: *
*--------------------------------------------------------*
*An error occurred in line <131> of file 
</opt/src/dealii-8.3.0/source/lac/sparsity_tools.cc> in function*
*    void dealii::SparsityTools::partition(const dealii::SparsityPattern&, 
unsigned int, std::vector<unsigned int, std::allocator<unsigned int> >&)*
*The violated condition was: *
*    ierr == 1*
*The name and call sequence of the exception was:*
*    ExcMETISError (ierr)*
*Additional Information: *
*    An error with error number -3 occurred while calling a METIS function*
*--------------------------------------------------------*
*Aborting!*
*--------------------------------------------------------------------------*
*mpirun has exited due to process rank 1 with PID 44095 on*
*node walnut-node42.internal exiting improperly. There are two reasons this 
could occur:*

*1. this process did not call "init" before exiting, but others in*
*the job did. This can cause a job to hang indefinitely while it waits*
*for all processes to call "init". By rule, if one process calls "init",*
*then ALL processes must call "init" prior to termination.*

*2. this process called "init", but exited without calling "finalize".*
*By rule, all processes that call "init" MUST call "finalize" prior to*
*exiting or it will be considered an "abnormal termination"*

*This may have caused other processes in the application to be*
*terminated by signals sent by mpirun (as reported here).*
*--------------------------------------------------------------------------*

Best,
Chenchen

-- 
The deal.II project is located at http://www.dealii.org/
For mailing list/forum options, see 
https://groups.google.com/d/forum/dealii?hl=en
--- 
You received this message because you are subscribed to the Google Groups 
"deal.II User Group" group.
To unsubscribe from this group and stop receiving emails from it, send an email 
to [email protected].
For more options, visit https://groups.google.com/d/optout.

Reply via email to