Re: [OMPI users] Programming help required in Interleaving computation+communication !

Atle Rudshaug Wed, 25 Nov 2009 06:48:03 -0500

souvik bhattacherjee wrote:

Hi all,
I'm trying to interleave computation with communication. As a result,I have resorted to using MPI with POSIX threads. Primarily, I amtrying to communicate a partial vector v3 while computing an innerproduct v1*v2 (mod q). To give you an idea of the platform and thelibraries:
1. Intel dual-socket quadcore m/c (total 8 cores/machine)
2. openmpi 1.3.3 (separate installations on ict6 and ict4 machines)
3. lib64gmp3 4.3.1
4. gcc 4.3.2
5. interconnect: Gigabit ethernet
I have used a single thread for most of the communication and theremaining 7 threads for computation. Perhaps, this portion of the codehas gone wrong somewhere and the program terminates with the followingerror message.
$ mpicc test-vecvecmul.c -lgmp -pthread -Wall -o tvmul

$ mpirun --prefix /usr/local/openmpi-1.3.3/ -np 2 --host ict6,ict4 ./tvmul

[err] event_queue_remove: 0xc1d6b0(fd 10) not on queue 8
[err] event_queue_remove: 0xc1d6b0(fd 10) not on queue 8
[ict6][[21545,1],0][../../../../../ompi/mca/btl/tcp/btl_tcp_frag.c:216:mca_btl_tcp_frag_recv]mca_btl_tcp_frag_recv: readv failed: Connection reset by peer (104)
--------------------------------------------------------------------------
mpirun has exited due to process rank 1 with PID 17154 on
node ict4 exiting without calling "finalize". This may
have caused other processes in the application to be
terminated by signals sent by mpirun (as reported here).
--------------------------------------------------------------------------
The code is attached along with. Please suggest where in the code haveI gone wrong. Also, a more efficient way of interleaving (if exists)is something that I am interested in.
**** Can anyone suggest a good tutorial sort of thing where I can getto know about programming in MPI with POSIX threads/OpenMP.
Regards,
--
Souvik

I got a similar error when using non-blocking communication on largedatasets. I eventually had to switch to blocking communication... Try tomake the code work with blocking communication first and see if thatremoves your error, then re-implement it from there with non-blockingagain. Doesn't MPI have decent threading performance if the processesare located on the same node? Could you perhaps use MPI only?


- Atle

Re: [OMPI users] Programming help required in Interleaving computation+communication !

Reply via email to