Re: [OMPI devel] [PATCH] fix mx btl_bandwidth

2010-09-03 Thread Bogdan Costescu
On Fri, Sep 3, 2010 at 3:47 PM, Jeff Squyres wrote: > It might be worth having even a Linux-specific way to auto-detect, just for > this use case (which is becoming more common -- 1GB LOM and a 10GB non-iWARP > NIC). The file: /sys/class/net/ethX/speed should contain the current speed and is

Re: [OMPI devel] System V Shared Memory for Open MPI: Request for Community Input and Testing

2010-04-28 Thread Bogdan Costescu
On Tue, Apr 27, 2010 at 7:55 PM, Samuel K. Gutierrez wrote: > With Jeff and Ralph's help, I have completed a System V shared memory > component for Open MPI. What is the motivation for this work ? Are there situations where the mmap based SM component doesn't work or is slow(er) ? Kind regards,

Re: [OMPI devel] kernel 2.6.23 vs 2.6.24 - communication/wait times

2010-04-10 Thread Bogdan Costescu
On Sat, Apr 10, 2010 at 5:51 AM, Eugene Loh wrote: > Why is shared-memory performance about four orders of magnitude slower than > it should be? Have there been any process scheduler changes in the newer kernels ? I'm not sure that they could explain four orders of magnitude differences though...

Re: [OMPI devel] Rankfile related problems

2010-03-02 Thread Bogdan Costescu
On Mon, Mar 1, 2010 at 9:15 PM, Ralph Castain wrote: > Tracking this down has reminded me of all the reasons why I despise the > rankfile mapper... :-/ Thanks for all your efforts ! I'm using the rankfile mapper as this is the documented (in the FAQ) affinity-related one at least for the stable

Re: [OMPI devel] Rankfile related problems

2010-03-01 Thread Bogdan Costescu
On Sat, Feb 27, 2010 at 7:35 PM, Ralph Castain wrote: > I can't seem to replicate this first problem - it runs fine for me even if > the rankfile contains only one entry. First of all, thanks for taking a look at this ! For me it's repeatable. Please note that I do specify '-np 4' even when in

[OMPI devel] Rankfile related problems

2010-02-15 Thread Bogdan Costescu
Hi! With version 1.4.1 I get a rather strange crash in mpirun whenever I try to run a job using (I think) a rankfile which doesn't contain the specified number of ranks. F.e. I ask for 4 ranks ('-np 4'), but the rankfile contains only one entry: rank 0=mbm-01-24 slot=1:* and the following comes

Re: [OMPI devel] Improvement of openmpi.spec

2009-08-06 Thread Bogdan Costescu
ge the name of the package, f.e. to allow installing several packages at the same time by simply changing: Name: fftw2 to allow for the package called 'fftw' to track the 3.x versions. This was done previously by Red Hat f.e. for their python packages. -- Bogdan Costescu IW

Re: [OMPI devel] Hang in collectives involving shared memory

2009-06-10 Thread Bogdan Costescu
build should also solve the problem, or do I interpret things the wrong way ? -- Bogdan Costescu IWR, University of Heidelberg, INF 368, D-69120 Heidelberg, Germany Phone: +49 6221 54 8240, Fax: +49 6221 54 8850 E-mail: bogdan.coste...@iwr.uni-heidelberg.de

Re: [OMPI devel] Hang in collectives involving shared memory

2009-06-10 Thread Bogdan Costescu
several times. This, together with the earlier post also describing a negative result, points to a problem related to your particular setup... -- Bogdan Costescu IWR, University of Heidelberg, INF 368, D-69120 Heidelberg, Germany Phone: +49 6221 54 8240, Fax: +49 6221 54 8850 E-mail: bogdan.coste...@iwr.uni-heidelberg.de

Re: [OMPI devel] Still troubles with 1.3 and MX

2009-01-23 Thread Bogdan Costescu
e conditions and will write back in case this problem appears again. -- Bogdan Costescu IWR, University of Heidelberg, INF 368, D-69120 Heidelberg, Germany Phone: +49 6221 54 8240, Fax: +49 6221 54 8850 E-mail: bogdan.coste...@iwr.uni-heidelberg.de

[OMPI devel] Still troubles with 1.3 and MX

2009-01-22 Thread Bogdan Costescu
8 in main () Can anyone suggest some ways forward ? I'd be happy to help in debugging if given some instructions. Thanks in advance! -- Bogdan Costescu IWR, University of Heidelberg, INF 368, D-69120 Heidelberg, Germany Phone: +49 6221 54 8240, Fax: +49 6221 54 8850 E-mail: bogdan.coste...@iwr.uni-heidelberg.de

Re: [OMPI devel] 1.3 PML default choice

2009-01-13 Thread Bogdan Costescu
ction "Network Support", there is a paragraph saying: Open MPI will, by default, choose to use "cm" if it finds a cm-supported network at run-time. With the MX MTL being available at run-time, I would expect CM to be chosen based on the quoted paragraph. -- Bogdan Cos

Re: [OMPI devel] 1.3 PML default choice

2009-01-13 Thread Bogdan Costescu
be chosen; for v1.3rc3 I can't distinguish anymore from timings as they behave very similarly. -- Bogdan Costescu IWR, University of Heidelberg, INF 368, D-69120 Heidelberg, Germany Phone: +49 6221 54 8240, Fax: +49 6221 54 8850 E-mail: bogdan.coste...@iwr.uni-heidelberg.de

Re: [OMPI devel] Open MPI v1.3rc3 has been posted

2009-01-13 Thread Bogdan Costescu
3rc3. That message has also raised the question of selection of default components, but as there was no reply to it, I still don't have any idea whether my testing or the docs were wrong; can someone clear this up ? Thanks for all the work put in v1.3 ! -- Bogdan Costescu IWR, University of

Re: [OMPI devel] Preparations for moving the btl's

2008-12-03 Thread Bogdan Costescu
I really unnecessary ? :-) -- Bogdan Costescu IWR, University of Heidelberg, INF 368, D-69120 Heidelberg, Germany Phone: +49 6221 54 8240, Fax: +49 6221 54 8850 E-mail: bogdan.coste...@iwr.uni-heidelberg.de

[OMPI devel] 1.2.7 hung in IMB at 16K collectives with MX

2008-11-26 Thread Bogdan Costescu
gs ? Can someone suggest what to do to avoid them or at least a way to debug this ? Thanks in advance ! -- Bogdan Costescu IWR, University of Heidelberg, INF 368, D-69120 Heidelberg, Germany Phone: +49 6221 54 8240, Fax: +49 6221 54 8850 E-mail: bogdan.coste...@iwr.uni-heidelberg.de

[OMPI devel] 1.3 PML default choice

2008-11-17 Thread Bogdan Costescu
to behave so poorly with MX ? Thanks for any insight into this issues. -- Bogdan Costescu IWR, University of Heidelberg, INF 368, D-69120 Heidelberg, Germany Phone: +49 6221 54 8240, Fax: +49 6221 54 8850 E-mail: bogdan.coste...@iwr.uni-heidelberg.de

Re: [OMPI devel] memcpy MCA framework

2008-08-18 Thread Bogdan Costescu
y available ? I'd be very much interested in using it. I've talked to George Bosilca at ISC08 about the issue of choosing the right settings for (as close as possible to) maximum OpenMPI performance on a given cluster and his answer was 'we regularly organize workshops'

Re: [OMPI devel] memcpy MCA framework

2008-08-18 Thread Bogdan Costescu
lf. This sounds very much like the collectives tuning, with MCA params to give the admin or user view of how the best performance can be achieved. -- Bogdan Costescu IWR, University of Heidelberg, INF 368, D-69120 Heidelberg, Germany Phone: +49 6221 54 8869/8240, Fax: +49 6221 54 8868/8850 E-mai

Re: [OMPI devel] Proposed changes to maximum size of sm btl backing file

2008-08-12 Thread Bogdan Costescu
On Tue, 12 Aug 2008, Rolf Vandevaart wrote: I propose bumping the max for 32-bit programs to 2G and for 64-bit programs to 8G. Can't this be dynamically adjusted depending on the amount of RAM and CPUs/cores ? -- Bogdan Costescu IWR, University of Heidelberg, INF 368, D-69120 Heide

Re: [OMPI devel] open ib dependency question

2008-07-11 Thread Bogdan Costescu
On Thu, 10 Jul 2008, Pavel Shamis (Pasha) wrote: FYI the issue was resolved - https://svn.open-mpi.org/trac/ompi/ticket/1376 Indeed, no more IBCM error message displayed with r18878. Thank you ! -- Bogdan Costescu IWR, University of Heidelberg, INF 368, D-69120 Heidelberg, Germany Phone

Re: [OMPI devel] IOF repair

2008-07-11 Thread Bogdan Costescu
viour mentioned in a previous e-mail (which happened with 1.3a1r18769) has disappeared, CHARMM can again read its instructions properly from stdin. Thanks for the quick resolution! -- Bogdan Costescu IWR, University of Heidelberg, INF 368, D-69120 Heidelberg, Germany Phone: +49 6221 54 8869/8240

Re: [OMPI devel] IOF repair

2008-07-10 Thread Bogdan Costescu
setup your clusters have, but most that I have seen, including all those that I admin, do run mpirun/mpiexec and rank=0 on the same node. I really think that this will bite a lot of people. -- Bogdan Costescu IWR, University of Heidelberg, INF 368, D-69120 Heidelberg, Germany Phone: +49 6221 54

Re: [OMPI devel] open ib dependency question

2008-07-07 Thread Bogdan Costescu
t to it; my 1.3 list is already pretty long... -- Bogdan Costescu IWR, University of Heidelberg, INF 368, D-69120 Heidelberg, Germany Phone: +49 6221 54 8869/8240, Fax: +49 6221 54 8868/8850 E-mail: bogdan.coste...@iwr.uni-heidelberg.de

Re: [OMPI devel] open ib dependency question

2008-07-04 Thread Bogdan Costescu
l openib,self" so I think that the IB stack is still being used (there is also a TCP/GigE network which could be chosen otherwise). I don't know whether this is caused by a somehow inconsistent setup of the system, but I would welcome an option to make 1.3a behave like 1.2.

Re: [OMPI devel] [OMPI svn] svn:open-mpi r17307

2008-01-30 Thread Bogdan Costescu
tested this recently with the RHEL5 kernels with one gigabit and one Myri-10G connection, seeing a TCP stream switching randomly between the gigabit and the Myri-10G connection. -- Bogdan Costescu IWR, University of Heidelberg, INF 368, D-69120 Heidelberg, Germany Phone: +49 6221 54 8869/8240, Fax: +49 6221 54 8868/8850 E-mail: bogdan.coste...@iwr.uni-heidelberg.de

Re: [OMPI devel] PathScale 3.0 problems with Open MPI 1.2.[34]

2007-10-31 Thread Bogdan Costescu
On Tue, 30 Oct 2007, Bogdan Costescu wrote: Bad timing... I don't have access to the files at the moment, I'll write back shortly (which probably means tomorrow ;-)). Here they are: http://spider.iwr.uni-heidelberg.de/~bogdan/openmpi/ Due to their size, I decided to put them u

Re: [OMPI devel] PathScale 3.0 problems with Open MPI 1.2.[34]

2007-10-30 Thread Bogdan Costescu
t the generated compile line for orterun and it's library functions ended up being. This can include the "libtool" lines. Bad timing... I don't have access to the files at the moment, I'll write back shortly (which probably means tomorrow ;-)). -- Bogdan Costescu

Re: [OMPI devel] PathScale 3.0 problems with Open MPI 1.2.[34]

2007-10-30 Thread Bogdan Costescu
ages already included the configure line; the environment is only modified to set CC=pathcc, etc. and there are no options on the make line; just like the documentation says that I should do... -- Bogdan Costescu IWR - Interdisziplinaeres Zentrum fuer Wissenschaftliches Rechnen Universitaet Hei

Re: [OMPI devel] PathScale 3.0 problems with Open MPI 1.2.[34]

2007-10-24 Thread Bogdan Costescu
and Brian recommend using "--without-memory-manager", I won't feel bad about doing it :-) Thanks a lot ! -- Bogdan Costescu IWR - Interdisziplinaeres Zentrum fuer Wissenschaftliches Rechnen Universitaet Heidelberg, INF 368, D-69120 Heidelberg, GERMANY Telephone: +49 6221 54 886

Re: [OMPI devel] PathScale 3.0 problems with Open MPI 1.2.[34]

2007-10-23 Thread Bogdan Costescu
e compiler). As I wrote in my previous e-mail, I tried configuring with and without the MX libs, but this made no difference. It's only when I disabled the memory manager, while still enabling MX, that I was able to get a working build. -- Bogdan Costescu IWR - Interdisziplinaer

Re: [OMPI devel] PathScale 3.0 problems with Open MPI 1.2.[34]

2007-10-23 Thread Bogdan Costescu
.2.3) MCA sds: seed (MCA v1.0, API v1.0, Component v1.2.3) MCA sds: singleton (MCA v1.0, API v1.0, Component v1.2.3) MCA sds: pipe (MCA v1.0, API v1.0, Component v1.2.3) MCA sds: slurm (MCA v1.0, API v1.0, Component v1

[OMPI devel] PathScale 3.0 problems with Open MPI 1.2.[34]

2007-10-22 Thread Bogdan Costescu
assing errors that I might have made, before filling a bug report. The existing bugs related to PathScale compilers don't seem to describe the symptoms that I'm seeing, unless it's some kind of threading issue which seems to have no resolution yet... Thanks in ad

Re: [OMPI devel] Multi-environment builds

2007-07-10 Thread Bogdan Costescu
ection and selection is supposed to be for. Yes, I understand that, it's the same type of mechanism as in LAM/MPI which it's not that foreign to me ;-) -- Bogdan Costescu IWR - Interdisziplinaeres Zentrum fuer Wissenschaftliches Rechnen Universitaet Heidelberg, INF 368, D-69120

Re: [OMPI devel] Multi-environment builds

2007-07-10 Thread Bogdan Costescu
case where there are several versions of the same batch system installed, all using the same configuration files and therefore being ready to run ? And how about the case where there is a machine reserved for compilations, where libraries are made available but there is no batch system active ?

Re: [OMPI devel] (loose) SGE Integration fails, why?

2007-06-25 Thread Bogdan Costescu
y some effort on integrating ssh as well, the problem being that the ssh daemon needs some modifications to allow SGE to obtain accounting information. There was also some talk about a TM-like API; unfortunately the progress in this area seems to be very slow, if there is any at all... -- Bo

Re: [OMPI devel] [IPv6] new component oob/tcp6

2006-09-06 Thread Bogdan Costescu
ion is only about a proof of concept version, then I'd say that anything to show IPv6 functionality would be acceptable. -- Bogdan Costescu IWR - Interdisziplinaeres Zentrum fuer Wissenschaftliches Rechnen Universitaet Heidelberg, INF 368, D-69120 Heidelberg, GERMANY Telephone: +49 6221 54

Re: [OMPI devel] IPv6 support in OpenMPI?

2006-04-04 Thread Bogdan Costescu
munication via IPv4). -- Bogdan Costescu IWR - Interdisziplinaeres Zentrum fuer Wissenschaftliches Rechnen Universitaet Heidelberg, INF 368, D-69120 Heidelberg, GERMANY Telephone: +49 6221 54 8869, Telefax: +49 6221 54 8868 E-mail: bogdan.coste...@iwr.uni-heidelberg.de

Re: [OMPI devel] IPv6 support in OpenMPI?

2006-04-03 Thread Bogdan Costescu
as IPs (either v4 or v6), OpenMPI should probably assume that the address as given can be passed further to the underlying mechanism for starting the job (for example, for SGE this would be its own rsh client, not the system rsh client); but how about machines given as names ? -- Bo

Re: [OMPI devel] IPv6 support in OpenMPI?

2006-04-03 Thread Bogdan Costescu
ut ;-)) For example, we ran several weeks without an IPv6-enabled rsh, which is used to handle MPI job startup on the cluster, without any problems. What do you mean by "IPv6-enabled rsh" ? Was it the daemon, client or both ? -- Bogdan Costescu IWR - Interdisziplinaeres Zent

Re: [OMPI devel] IPv6 support in OpenMPI?

2006-03-31 Thread Bogdan Costescu
some user-provided mapping. That's all that I remember now from my IPv6 endeavour with LAM/MPI. IMHO, some discussion of them should occur before the actual coding... -- Bogdan Costescu IWR - Interdisziplinaeres Zentrum fuer Wissenschaftliches Rechnen Universitaet Heidelberg, INF 36

Re: [O-MPI devel] Linux processor affinity

2005-12-13 Thread Bogdan Costescu
se changes until a final, stable API was established, but when you want to be the first to claim "I support this and that"... Thanks for your diligence in pestering us about this! :-) Eh, don't mention it! I want Open MPI to work :-) -- Bogdan Costescu IWR - Interdisziplinaeres

Re: [O-MPI devel] Linux processor affinity

2005-12-12 Thread Bogdan Costescu
ssociated to case 1. of _get. Cases 2. and 3. of _set are both associated to case 2. of _get. So IMHO the test should be made with the _get function (as explained in a previous message), by setting len=sizeof(long) which would allow the case 1. to work fine, while case 2. would return -EINVAL, exac

Re: [O-MPI devel] Linux processor affinity

2005-12-09 Thread Bogdan Costescu
ample by a (smart, don't know if such exists now) batch system. I haven't checked if it's possible, but I think that a similar solution based on sched_getaffinity would be much better, as this would not disturb the current settings. -- Bogdan Costescu IWR - Interdiszipl

Re: [O-MPI devel] Linux processor affinity

2005-12-09 Thread Bogdan Costescu
the _glibc_ function that changes prototype. -- Bogdan Costescu IWR - Interdisziplinaeres Zentrum fuer Wissenschaftliches Rechnen Universitaet Heidelberg, INF 368, D-69120 Heidelberg, GERMANY Telephone: +49 6221 54 8869, Telefax: +49 6221 54 8868 E-mail: bogdan.coste...@iwr.uni-heidelberg.de

Re: [O-MPI devel] Linux processor affinity

2005-11-29 Thread Bogdan Costescu
d, unsigned int, len, unsigned long *, user_mask_ptr) int main(int argc, char **argv){ unsigned long cpus = 1; int r; r = sched_setaffinity(0, sizeof(cpus), &cpus); if (r == -1) { perror("sched_setaffinity:"); } return