[OMPI users] About debugging and asynchronous communication

2014-09-18 Thread XingFENG
Dear all, I am new to MPI. Please forgive me if I ask a redundant question. I am now programming about graph processing using MPI. I get two problems as described below. a. How to get more information about errors? I got errors like below. This says that program exited abnormally in function

Re: [OMPI users] About debugging and asynchronous communication

2014-09-18 Thread XingFENG
By the way, I am using Open MPI 1.6.5 and programming with C++. On Thu, Sep 18, 2014 at 4:43 PM, XingFENG <xingf...@cse.unsw.edu.au> wrote: > Dear all, > > I am new to MPI. Please forgive me if I ask a redundant question. > > I am now programming about graph processing

Re: [OMPI users] About debugging and asynchronous communication

2014-09-18 Thread XingFENG
ated upon completion through a wait > statement. > > > 2014-09-18 12:36 GMT+02:00 Jeff Squyres (jsquyres) <jsquy...@cisco.com>: > > On Sep 18, 2014, at 2:43 AM, XingFENG <xingf...@cse.unsw.edu.au> wrote: >> >> > a. How to get more information about err

Re: [OMPI users] About debugging and asynchronous communication

2014-09-18 Thread XingFENG
t;> >> >> I here attach my example script which shows two working cases, and one >> not workning (you can check the memory usage simultaneously and see that >> the first two works, the last one goes ballistic in memory). >> >> Just check it with test to see if it

Re: [OMPI users] About debugging and asynchronous communication

2014-09-18 Thread XingFENG
at 10:07 PM, XingFENG <xingf...@cse.unsw.edu.au> wrote: > Thank you for your reply! I am still working on my codes. I would update > the post when I fix bugs. > > On Thu, Sep 18, 2014 at 9:48 PM, Nick Papior Andersen < > nickpap...@gmail.com> wrote: > >> I just ch

Re: [OMPI users] About debugging and asynchronous communication

2014-09-18 Thread XingFENG
edu> wrote: > There is no guarantee that the messages will be received in the same > order that they were sent. > Use tags or another mechanism to match the messages on send and recv ends. > > On 09/18/2014 10:42 AM, XingFENG wrote: > >> I have found some thing strange

[OMPI users] Running program on a cluster

2014-09-24 Thread XingFENG
Hi all, I got problem with running program on a cluster. I used the following command. *my_hosts* is a file containing 3 hosts while *testMPI* is a very simple MPI program. == *mpirun -np 2 --hostfile ./my_hosts ./testMPImpirun -np 2 --machinefile

Re: [OMPI users] Running program on a cluster

2014-09-24 Thread XingFENG
I have found the solution. The command *mpirun -machinefile ./my_hosts -n 3 ./testMPI* works. I think the order of arguments matters here. On Thu, Sep 25, 2014 at 11:02 AM, XingFENG <xingf...@cse.unsw.edu.au> wrote: > Hi all, > > I got problem with running program on a cl

Re: [OMPI users] Running program on a cluster

2014-09-25 Thread XingFENG
any order is fine. The issue I see > is that your mpiexec isn't the OMPI one, but is from someone else. I have > no idea whose mpiexec you are using > > > On Sep 24, 2014, at 6:38 PM, XingFENG <xingf...@cse.unsw.edu.au> wrote: > > I have found the solution. The command *m

Re: [OMPI users] Running program on a cluster

2014-09-25 Thread XingFENG
It returns /usr/bin/mpiexec. On Thu, Sep 25, 2014 at 8:57 PM, Ralph Castain <r...@open-mpi.org> wrote: > Do "which mpiexec" and look at the path. The options you show are from > MPICH, not OMPI. > > On Sep 25, 2014, at 12:15 AM, XingFENG <xingf...@cse.unsw.edu.au&g

Re: [OMPI users] About debugging and asynchronous communication

2014-10-01 Thread XingFENG
the next message then I guess you have a bug, your counter > is not consistent. > I am pretty sure the error is on your side, I do something similar but > have never experienced anything like that. :) > > 2014-09-19 3:21 GMT+02:00 XingFENG <xingf...@cse.unsw.edu.au>: > &

[OMPI users] About valgrind and OpenMPI

2014-10-02 Thread XingFENG
Hi there, I am using valgrind to help analyse my MPI program. I used hdfs file system to read/write data. And if I run the code without valgrind, it works correctly. However, if I run with valgrind, for example, *mpirun -np 3 /usr/bin/valgrind --tool=callgrind ./myprogram /input_file

Re: [OMPI users] About valgrind and OpenMPI

2014-10-02 Thread XingFENG
e a conflict between valgrind and HDFS. Does valgrind even > support Java programs? I honestly have never tried to do that before. > > > On Oct 2, 2014, at 4:40 AM, XingFENG <xingf...@cse.unsw.edu.au> wrote: > > Hi there, > > I am using valgrind to help analyse m

[OMPI users] Running mpi with different account

2015-04-13 Thread XingFENG
Hi all, I am wondering if it is possible that MPI programs can be run on machines with different account? I am doing experiments with some MPI programs on a cluster. My problem is that my account is limited to use 4 machines (I need more machines to process data). I can borrow my friend's

Re: [OMPI users] Running mpi with different account

2015-04-13 Thread XingFENG
Thanks very much for your reply. I would report later if I succeed. On Tue, Apr 14, 2015 at 11:19 AM, Ralph Castain <r...@open-mpi.org> wrote: > > On Apr 13, 2015, at 5:47 PM, XingFENG <xingf...@cse.unsw.edu.au> wrote: > > Thanks for all who joined the discussion. >

Re: [OMPI users] Running mpi with different account

2015-05-03 Thread XingFENG
wrote: > You can also specify per-machine usernames in $HOME/.ssh/config. > > See ssh_config(5). > > > > > On Apr 13, 2015, at 9:19 PM, Ralph Castain <r...@open-mpi.org> wrote: > > > > > >> On Apr 13, 2015, at 5:47 PM, XingFENG <xingf...

Re: [OMPI users] Running mpi with different account

2015-05-03 Thread XingFENG
:03 PM, XingFENG <xingf...@cse.unsw.edu.au> wrote: > I got some problems while running MPI programs with different account. > I have done the following things. > > 1. I have set password-less connection between these machines. > 2. I have specified user names in host file. >

Re: [OMPI users] Running mpi with different account

2015-05-03 Thread XingFENG
, it seems that daemon process is disabled because I could not find processes orted and sshd any more. Is rsh used instead here? On Mon, May 4, 2015 at 12:54 AM, Ralph Castain <r...@open-mpi.org> wrote: > > On May 3, 2015, at 5:54 AM, XingFENG <xingf...@cse.unsw.edu.au> wrot

Re: [OMPI users] Running mpi with different account

2015-05-04 Thread XingFENG
ttp://blogs.cisco.com/performance/tree-based-launch-in-open-mpi-part-2 > > > > On May 3, 2015, at 8:52 PM, XingFENG <xingf...@cse.unsw.edu.au> wrote: > > > > Thanks very much Ralph. It works fine now. > > > > Just some more questions to make it clearer. > > >

[OMPI users] OPENMPI only recognize 4 cores of AWS EC2 machine

2015-05-24 Thread XingFENG
Hi, OPENMPI users, I have met some problem while doing experiments on Amazon EC2. I have chosen instance type r3.2xlarge. As amazon claimed, it has 8 vcpu. Each vCPU is a hyperthread of an Intel Xeon core for M3, C4, C3, R3, HS1, G2, I2, and D2. (from *https://aws.amazon.com/ec2/instance-types/

Re: [OMPI users] OPENMPI only recognize 4 cores of AWS EC2 machine

2015-05-24 Thread XingFENG
reads), > hence the error message. > > I cannot remember the option to bind to threads, but you can mpirun > --oversubscribe if you are currently stuck > > Cheers, > > Gilles > > > On Sunday, May 24, 2015, XingFENG <xingf...@cse.unsw.edu.au> wrote: > >&g

Re: [OMPI users] OPENMPI only recognize 4 cores of AWS EC2 machine

2015-05-24 Thread XingFENG
es > > On Sunday, May 24, 2015, XingFENG <xingf...@cse.unsw.edu.au> wrote: > >> Hi, Gilles, >> >> Thanks very much for your respond. But if I oversubscribe, would the >> performance be influenced? >> >> On Sun, May 24, 2015 at 7:24 PM, Gilles Gouaillar

[OMPI users] about MPI communication complexity

2015-09-30 Thread XingFENG
Hi, every one, I am working with open-mpi. When I tried to analyse performance of my programs, I find it is hard to understand the communication complexity of MPI routines. I have found some page on Internet such as http://stackoverflow.com/questions/10625643/mpi-communication-complexity This