Hi all,
We are trying to get mpb-mpi running on Debian 6.0 64bit.
We installed libctl/h5utils/meep/mpb/meep-openmpi/mpb-mpi from official
Squeeze repo.
When we try to use mpb-mpi on the "sq-rods.ctl" example it ends up with
a segfault.
The position of the segfault seems to depend on the number of processors
we use in mpirun.
The very same example works like a charm with mpb.
This occurs on a stock install where just the packages listed above
where installed.
We have tried on a 32bit Debian and got mixed results: both "mpb" and
"mpirun -np 1 mpb-mpi" works ok, but "mpirun -np 2 mpb-mpi" crashes with
a segfault.
Any one have an idea of what we could do to solve this problem?
Cheers,
Antoine
STEPS TO REPRODUCE:
me@debmeep:~$ cp /usr/share/doc/mpb-doc/examples/*.ctl /tmp
me@debmeep:~$ cd /tmp
me@debmeep:/tmp$ mpb-mpi sq-rods.ctl
or
me@debmeep:/tmp$ mpirun -np2 mpb-mpi sq-rods.ctl
EXAMPLE OF OUTPUT
me@debmeep:/tmp$ mpirun -np 2 mpb-mpi sq-rods.ctl
init-params: initializing eigensolver data
Computing 8 bands with 1.000000e-07 tolerance.
Working in 2 dimensions.
Grid size is 32 x 32 x 1.
Solving for 8 bands at a time.
Creating Maxwell data...
Allocating fields...
Mesh size is 3.
Lattice vectors:
(1, 0, 0)
(0, 1, 0)
(0, 0, 1)
Cell volume = 1
Reciprocal lattice vectors (/ 2 pi):
(1, -0, 0)
(-0, 1, -0)
(0, -0, 1)
Geometric objects:
cylinder, center = (0,0,0)
radius 0.2, height 1e+20, axis (0, 0, 1)
dielectric constant epsilon = 11.56
Geometric object tree has depth 1 and 1 object nodes (vs. 1 actual objects)
Initializing dielectric function...
16 k-points:
(0,0,0)
(0.1,0,0)
(0.2,0,0)
(0.3,0,0)
(0.4,0,0)
(0.5,0,0)
(0.5,0.1,0)
(0.5,0.2,0)
(0.5,0.3,0)
(0.5,0.4,0)
(0.5,0.5,0)
(0.4,0.4,0)
(0.3,0.3,0)
(0.2,0.2,0)
(0.1,0.1,0)
(0,0,0)
Solving for band polarization: te.
Initializing fields to random numbers...
elapsed time for initialization: 0 seconds.
epsilon: 1-11.56, mean 2.30281, harm. mean 1.14695, 14.5508% > 1,
12.3372% "fill"
Outputting epsilon...
solve_kpoint (0,0,0):
tefreqs:, k index, k1, k2, k3, kmag/2pi, te band 1, te band 2, te band
3, te band 4, te band 5, te band 6, te band 7, te band 8
Solving for bands 2 to 8...
[debmeep:23486] *** Process received signal ***
[debmeep:23485] *** Process received signal ***
[debmeep:23485] Signal: Segmentation fault (11)
[debmeep:23485] Signal code: Address not mapped (1)
[debmeep:23485] Failing at address: 0x1b
[debmeep:23485] [ 0] /lib/libpthread.so.0(+0xef60) [0x7f0d148cdf60]
[debmeep:23486] Signal: Segmentation fault (11)
[debmeep:23486] Signal code: Address not mapped (1)
[debmeep:23486] Failing at address: 0x1b
[debmeep:23486] [ 0] /lib/libpthread.so.0(+0xef60) [0x7f285c9e0f60]
[debmeep:23485] [ 1] /usr/lib/libmpi.so.0(MPI_Sendrecv+0xa8)
[0x7f0d155fbb08]
[debmeep:23485] [ 2]
/usr/lib/libfftw_mpi.so.2(transpose_start_exchange_step+0x107)
[0x7f0d170c14d7]
[debmeep:23485] [ 3] /usr/lib/libfftw_mpi.so.2(transpose_mpi+0x354)
[0x7f0d170c1a24]
[debmeep:23485] [ 4] /usr/lib/libfftw_mpi.so.2(fftwnd_mpi+0xcf)
[0x7f0d170c249f]
[debmeep:23485] [ 5] mpb-mpi(maxwell_operator+0xef) [0x42463f]
[debmeep:23485] [ 6] mpb-mpi(eigensolver+0x966) [0x42bb36]
[debmeep:23485] [ 7] mpb-mpi(solve_kpoint+0xbf0) [0x410e20]
[debmeep:23485] [ 8] mpb-mpi(solve_kpoint_aux+0x38) [0x40c1b8]
[debmeep:23486] [ 1] /usr/lib/libmpi.so.0(MPI_Sendrecv+0xa8)
[0x7f285d70eb08]
[debmeep:23485] [ 9] /usr/lib/libguile.so.17(+0x51668) [0x7f0d18416668]
[debmeep:23485] [10] /usr/lib/libguile.so.17(+0x50274) [0x7f0d18415274]
[debmeep:23486] [ 2]
/usr/lib/libfftw_mpi.so.2(transpose_start_exchange_step+0x107)
[0x7f285f1d44d7]
[debmeep:23485] [11] /usr/lib/libguile.so.17(+0x4f7dc) [0x7f0d184147dc]
[debmeep:23486] [ 3] /usr/lib/libfftw_mpi.so.2(transpose_mpi+0x354)
[0x7f285f1d4a24]
[debmeep:23485] [12] /usr/lib/libguile.so.17(scm_eval_body+0xa8)
[0x7f0d1841c768]
[debmeep:23486] [ 4] /usr/lib/libfftw_mpi.so.2(fftwnd_mpi+0xcf)
[0x7f285f1d549f]
[debmeep:23485] [13] /usr/lib/libguile.so.17(scm_map+0x289) [0x7f0d1841d999]
[debmeep:23486] [ 5] mpb-mpi(maxwell_operator+0xef) [0x42463f]
[debmeep:23485] [14] /usr/lib/libguile.so.17(+0x4fe3b) [0x7f0d18414e3b]
[debmeep:23486] [ 6] mpb-mpi(eigensolver+0x966) [0x42bb36]
[debmeep:23485] [15] /usr/lib/libguile.so.17(+0x4f7dc) [0x7f0d184147dc]
[debmeep:23486] [ 7] mpb-mpi(solve_kpoint+0xbf0) [0x410e20]
[debmeep:23485] *** End of error message ***
[debmeep:23486] [ 8] mpb-mpi(solve_kpoint_aux+0x38) [0x40c1b8]
[debmeep:23486] [ 9] /usr/lib/libguile.so.17(+0x51668) [0x7f2860529668]
[debmeep:23486] [10] /usr/lib/libguile.so.17(+0x50274) [0x7f2860528274]
[debmeep:23486] [11] /usr/lib/libguile.so.17(+0x4f7dc) [0x7f28605277dc]
[debmeep:23486] [12] /usr/lib/libguile.so.17(scm_eval_body+0xa8)
[0x7f286052f768]
[debmeep:23486] [13] /usr/lib/libguile.so.17(scm_map+0x289) [0x7f2860530999]
[debmeep:23486] [14] /usr/lib/libguile.so.17(+0x4fe3b) [0x7f2860527e3b]
[debmeep:23486] [15] /usr/lib/libguile.so.17(+0x4f7dc) [0x7f28605277dc]
[debmeep:23486] *** End of error message ***
--------------------------------------------------------------------------
mpirun noticed that process rank 0 with PID 23485 on node debmeep exited
on signal 11 (Segmentation fault).
--------------------------------------------------------------------------
boz@debmeep:/tmp$
_______________________________________________
mpb-discuss mailing list
[email protected]
http://ab-initio.mit.edu/cgi-bin/mailman/listinfo/mpb-discuss