Thanks, problem got solved. I had installed old version of open mpi as root for all users and now installed Open MPI version 1 for myself. Though the previous version was not in the path, seems like this was causing the problems. The combination of removing previous libraries, runtimes solved the problem.

thanks,
-Manjunath

On Apr 22, 2006, at 11:12 AM, Ralph Castain wrote:

Another thing to try - go to your installation location's lib subdirectory (at $prefix/lib) and delete everything that is there. Then go back to the directory where you put the software and do a "make install" again.

Sometimes, especially if you are upgrading to a new version, you can be burned by stale shared libraries. This sounds like it could be the problem here. We don't remove any old libraries when you do an installation, so if you change versions, you really should do this procedure to avoid picking up "old stuff".

Alternatively, you could build and run without shared libraries to avoid this problem altogether - just reconfigure with "--enable- static --disable-shared" and then do "make clean all install".

Ralph


Brian Barrett wrote:
Well, so much for the easy one :(.

Is it possible that you have two versions of Open MPI in your path
somewhere and that you might be getting different versions on
different nodes?  The errors below generally indicate that data was
received in a totally different format than expected, so I'm just
kind of guessing as to how one could get to that situation...

Brian

On Apr 21, 2006, at 5:01 PM, Manjunath G Venkata wrote:


On Thu, 20 Apr 2006, Brian Barrett wrote:


Are these both identical architecture?  Those look suspiciously
like what happens when you're trying to mix 32/64 bit or little
endian / big endian.


- Both my nodes are Intel Xeons and run linux 2.4.26.

-Manjunath


Brian

On Apr 20, 2006, at 8:53 PM, Galen M. Shipman wrote:


Hey Guys,
Not sure what is going on here, has anyone seen this before?
- Galen

Hi Galen,
Sorry to bother you.
I have installed latest stable version of Open MPI(1.0) on two
of spider
nodes(s7,s4) for some experiments, but there seems to be
configuration
error  or something else which I don't understand. After
installing, as
a test I ran an simple MPI program but it complains with following
errors.
[s4:10685] [0,0,0] ORTE_ERROR_LOG: Pack data mismatch in file
dps_unpack.c at line 121
[s4:10685] [0,0,0] ORTE_ERROR_LOG: Pack data mismatch in file
dps_unpack.c at line 95
Further digging with gdb prints following errors
[s7:07005] ERROR: A daemon on node s4 failed to start as expected.
[s7:07005] ERROR: There may be more information available from
[s7:07005] ERROR: the remote shell (see above).
[s7:07005] The daemon received a signal 5.
[s7:07005] [0,0,0] ORTE_ERROR_LOG: Pack data mismatch in file
dps_unpack.c at line 121
[s7:07005] [0,0,0] ORTE_ERROR_LOG: Pack data mismatch in file
dps_unpack.c at line 95
[s7:07005] [0,0,0] ORTE_ERROR_LOG: Pack data mismatch in file
dps_unpack.c at line 121
[s7:07005] [0,0,0] ORTE_ERROR_LOG: Pack data mismatch in file
dps_unpack.c at line 95
any clue on what I am doing wrong ?
thanks,
-Manjunath

_______________________________________________
devel mailing list
de...@open-mpi.org
http://www.open-mpi.org/mailman/listinfo.cgi/devel

_______________________________________________
devel mailing list
de...@open-mpi.org
http://www.open-mpi.org/mailman/listinfo.cgi/devel


_______________________________________________
devel mailing list
de...@open-mpi.org
http://www.open-mpi.org/mailman/listinfo.cgi/devel

Reply via email to