Re: [OMPI devel] Need to know your Github ID

2014-09-18 Thread Alex Margolin
alex -> alex-ma alinas -> alinask amikheev -> alex-mikheev vasily -> vasilyMellanox On Wed, Sep 10, 2014 at 1:46 PM, Jeff Squyres (jsquyres) wrote: > As the next step of the planned migration to Github, I need to know: > > - Your Github ID (so that you can be added to the new OMPI git repo)

Re: [OMPI devel] mosix patches

2014-04-28 Thread Alex Margolin
Thanks jeff, I'll make it work. I'm moving my apartment tomorrow, so I hope I'll get to it on the weekend. On 28 Apr 2014 18:09, "Jeff Squyres (jsquyres)" wrote: > If it helps, I just updated that to the most recent OMPI trunk SVN > revision. There were at least 3 files that had conflicts, thou

Re: [OMPI devel] mosix patches

2014-04-25 Thread Alex Margolin
I'm the original developer. The patch never got merged, but I have patches to some branches. Which version are you using? On 24 Apr 2014 19:07, "Pavel V. Kaygorodov" wrote: > Hi! > > What is current status of mosix support in OpenMPI? > I have tried patches from > http://www.cs.huji.ac.il/wikis/M

[OMPI devel] How to read OPAL_OUTPUT-ed strings

2014-02-17 Thread Alex Margolin
Hi, I'm having trouble getting the OPAL_OUTPUT to print. I'm trying the following command line (with no success): `pwd`/osh_install/bin/oshrun --map-by node -np 2 -mca orte_debug true -mca orte_debug_verbose 100 -mca orte_report_silent_errors true -mca orte_map_stddiag_to_stderr true ./examples/

Re: [OMPI devel] (no subject)

2013-09-06 Thread Alex Margolin
directory `/a/store-04/h/lab/mosix/alexam02/ompi-jeff/opal' make: *** [all-recursive] Error 1 Should this be a trac ticket? Alex On Fri, Sep 6, 2013 at 1:22 PM, Alex Margolin wrote: > Hi, > > I'm building ompi r29104 with the following command: > > make distclean &

[OMPI devel] (no subject)

2013-09-06 Thread Alex Margolin
Hi, I'm building ompi r29104 with the following command: make distclean && ./autogen.sh && ./configure --prefix=/cs/mosna/alexam02/ompi CFLAGS=-m64 CXXFLAGS=-m64 --without-hwloc --disable-mpi-threads --disable-progress-threads --enable-mca-no-build=maffinity,paffinity --enable-contrib-no-build=li

Re: [OMPI devel] Mosix support?

2013-04-26 Thread Alex Margolin
Alex On 25 Apr 2013 06:01, "Ralph Castain" wrote: > Sending this to devel because we lack a current address for Alex Margolin > - can someone please review the Mosix support requested to transfer to v1.7 > in CMR #3411? We need (a) to know the status of that code, (b) is it >

[OMPI devel] [patch] MOSIX support complete

2012-07-11 Thread Alex Margolin
Hi, I'm not sure if anyone remembers, but I was working on Open MPI support for MOSIX in the form of several MCA modules (turned out to be BTL, ODLS, and RAS). It's pretty much finished now, thanks to your help (I got many useful tips and clarifications from this mailing list). As I said befo

Re: [OMPI devel] How to debug segv

2012-04-25 Thread Alex Margolin
u got the banner of the FT benchmark, so I guess at least the rank 0 successfully completed the MPI_Init call. This is a hint that you should investigate more into the point-to-point logic of your mosix BTL. george. On Apr 25, 2012, at 09:30 , Alex Margolin wrote: NAS Parallel Bench

Re: [OMPI devel] How to debug segv

2012-04-25 Thread Alex Margolin
On 04/25/2012 02:57 PM, Ralph Castain wrote: Strange that your code didn't generate any symbols - is that a mosix thing? Have you tried just adding opal_output (so it goes to a special diagnostic output channel) statements in your code to see where the segfault is occurring? It looks like you

[OMPI devel] How to debug segv

2012-04-25 Thread Alex Margolin
Hi, I'm getting a segv error off my build of the trunk. I know that my BTL module is responsible ("-mca btl self,tcp" works, "-mca btl self,mosix" fails). Smaller/simpler test applications pass, NPB doesn't. Can anyone suggest how to proceed with debugging this? my attempts include some debug

[OMPI devel] OPAL polling optimization

2012-04-19 Thread Alex Margolin
Hi, I'm writing a new polling module and I stumbled upon some strange code: The following function is implemented in openmpi-trunk/opal/mca/event/libevent2013/libevent/signal.c : evsig_add(struct event_base *base, evutil_socket_t evsignal, short old, short events, void *p) - It appears the

Re: [OMPI devel] RTE node allocation component

2012-04-14 Thread Alex Margolin
On 04/15/2012 12:36 AM, Ralph Castain wrote: The 1.6 branch is a stable series - no new features will be added to it, so your patch won't be going there. I'd focus solely on the trunk. - Can I add my module(s) to 1.5 branch? (as well as the trunk?) What you're doing with he RAS is fine for no

Re: [OMPI devel] RTE node allocation component

2012-04-14 Thread Alex Margolin
On 04/15/2012 12:36 AM, Ralph Castain wrote: The 1.6 branch is a stable series - no new features will be added to it, so your patch won't be going there. I'd focus solely on the trunk. OK, but what would you recommend for benchmarking? a local 1.6 checkout? What you're doing with he RAS is fin

Re: [OMPI devel] RTE node allocation component

2012-04-14 Thread Alex Margolin
d idea - remember, multiple jobs can be launching at the same time! On Apr 13, 2012, at 10:07 AM, Alex Margolin wrote: Hi, The next component I'm writing is a component for allocating nodes to run the processes of an MPI job. Suppose I have a "getbestnode" executable which not onl

[OMPI devel] RTE node allocation component

2012-04-13 Thread Alex Margolin
Hi, The next component I'm writing is a component for allocating nodes to run the processes of an MPI job. Suppose I have a "getbestnode" executable which not only tells me the best location for spawning a new process, but it also reserves the space (for some time), so that every time I run it I

[OMPI devel] Using opal_convertor_t for In-place send buffers in a BTL component

2012-04-05 Thread Alex Margolin
Hi, First, I'm glad to say my MOSIX component is working and giving good initial result. Thanks for all your help! I'm not sure how (I know I should fill in some license agreement docs), but I would like to contribute the code to the Open-MPI project. Is there an official code-review process? a

[OMPI devel] mca_btl_tcp_alloc

2012-04-04 Thread Alex Margolin
Hi, As I'm working out the bugs in my component I used TCP as reference and came across the following: In mca_btl_tcp_alloc (openmpi-trunk/ompi/mca/btl/tcp/btl_tcp.c:188) the first segment is initialized to point to "frag + 1". I don't get it... how/when is this location allocated? Isn't it jus

Re: [OMPI devel] New MOSIX components draft

2012-04-02 Thread Alex Margolin
_module_t mosix_module; /**< local module */ }; typedef struct mca_btl_mosix_component_t mca_btl_mosix_component_t; You can then overload that component with your additional info, leaving the base component to contain the required minimal elements. On Apr 1, 2012, at 1:59 AM, Ale

Re: [OMPI devel] New MOSIX components draft

2012-04-01 Thread Alex Margolin
do with the way I initialize my component - I'll resume debugging after lunch. Alex On 03/31/2012 07:04 PM, Alex Margolin wrote: P.S. I get the following Error - I'm pretty sure my BTL is to blame here: alex@singularity:~/huji/benchmarks/simple$ mpirun -mca btl_base_verbose 100 -mc

Re: [OMPI devel] New MOSIX components draft

2012-03-31 Thread Alex Margolin
e kernel optimizations on the TCP side. Still, UDP may perform better with fire-and-forget scenarios. Thanks a lot (and sorry for the hassle), Alex On 03/31/2012 07:04 PM, Alex Margolin wrote: Hi, I think i'm close to finishing an initial version of the MOSIX support for open-mpi. A perl

Re: [OMPI devel] New MOSIX components draft

2012-03-31 Thread Alex Margolin
o real way to support the way MOSIX wants to manage resources without the constraint that MOSIX only operate at a job level - i.e., it start all specified procs at the beginning of time, and it not migrate them. Kinda defeated the intent of MOSIX. On Mar 31, 2012, at 10:04 AM, Alex Margolin wrote:

[OMPI devel] New MOSIX components draft

2012-03-31 Thread Alex Margolin
Hi, I think i'm close to finishing an initial version of the MOSIX support for open-mpi. A perliminary draft is attached. The support consists of two modules: ODLS module for launching processes under MOSIX, and BTL module for efficient communication between processes. I'm not quite there yet -

Re: [OMPI devel] Replacing poll()

2012-03-19 Thread Alex Margolin
d this is the minimum you have to implement. The RMA protocols (GET or PUT) are optional, and are specified by setting specific bits in your BTL flag. Regarding the TCP BTL, the two RMA operations are "fake", they are simply implemented on top of mca_btl_tcp_endpoint_send. george

Re: [OMPI devel] Replacing poll()

2012-03-17 Thread Alex Margolin
ch I assume notifies the upper layer of a message, but this is only for MCA_BTL_TCP_HDR_TYPE_SEND. What about MCA_BTL_TCP_HDR_TYPE_PUT? Thanks, Alex On 03/04/2012 02:54 AM, George Bosilca wrote: On Mar 3, 2012, at 18:18 , Alex Margolin wrote: I've figured that what I really need i

Re: [OMPI devel] New odls component fails

2012-03-17 Thread Alex Margolin
posix, sysv) and orte wouldn't start without any (so I had to turn it back on). Could you tell me if there is a way to run the application without making any mmap() calls with MAP_SHARED? Currently, mosrun is run with -w asking it to fail (return -1) on any such system-call. Thanks for

[OMPI devel] New odls component fails

2012-03-17 Thread Alex Margolin
Hi, I want to launch Open-MPI processes using another process: instead of using "hello" x 4 I want to run "mosrun -w hello" x 4 when I start it with "mpirun -n 4 hello". I've cloned the "default" component in orte/mca/odls (from trunk) - see patch attached. I'm getting an error which is rela

[OMPI devel] MCA BTL Fragment lists

2012-03-09 Thread Alex Margolin
Hi, I'm implementing a new BTL component, and 1. I read the TCP code and ran into the three fragment lists: /* free list of fragment descriptors */ ompi_free_list_t tcp_frag_eager; ompi_free_list_t tcp_frag_max; ompi_free_list_t tcp_frag_user; I've looked it up, and found that

Re: [OMPI devel] Replacing poll()

2012-03-03 Thread Alex Margolin
he btl_progress function will get called by the PML quite frequently. It's how BTL's like openib progress their outstanding message passing. On Mar 2, 2012, at 2:22 PM, Alex Margolin wrote: On 03/02/2012 04:33 PM, Jeffrey Squyres wrote: Note that the OMPI 1.4.x series is about to be

Re: [OMPI devel] Replacing poll()

2012-03-02 Thread Alex Margolin
On 03/02/2012 04:33 PM, Jeffrey Squyres wrote: Note that the OMPI 1.4.x series is about to be retired. If you're doing new stuff, I'd advise you to be working with the Open MPI SVN trunk. In the trunk, we've changed how we build libevent, so if you're adding to it, you probably want to be w

Re: [OMPI devel] Replacing poll()

2012-03-02 Thread Alex Margolin
/x86_64-linux-gnu/libpthread.so.0(+0x10060) [0x7f7d6ac26060] [singularity:32552] *** End of error message *** Segmentation fault alex@singularity:~/huji/benchmarks/simple$ Any ideas? On 03/02/2012 01:26 PM, Alex Margolin wrote: Hi, I'm trying to replace the poll() function with mine (say

[OMPI devel] Replacing poll()

2012-03-02 Thread Alex Margolin
Hi, I'm trying to replace the poll() function with mine (say poll2() in poll2.c), and I got some building errors. This is after I copied poll2.c into opal/events/ and added it in the sources list in Makefile.am in that folder. ... Making all in tools/wrappers make[2]: Entering directory `/hom

[OMPI devel] Adding a BTL module implementing poll()

2010-10-31 Thread Alex Margolin
Hi, I'm developing a new module under for BTL component to utilize an existing distributed computing software in our lab. I decided to write a TCP-like interface (implementing socket(), connect(), accept(), send(), recv(), etc.) and then copy and modify the existing BTL TCP module to create my own.