We have a new cluster in our lab, and it seems to work with some parallel code. But fail's with other codes:
For example, when I try to run this example:
http://www.neuron.yale.edu/neuron/stati ... NetManager
Code: Select all
load_file("stdrun.hoc")
tstop = 1000
load_file("netparmpi.hoc")
objref pnm
ncell = 128
pnm = new ParallelNetManager(ncell)
pnm.round_robin()
for i=0, ncell-1 if (pnm.gid_exists(i)) {
pnm.register_cell(i, new IntFire1())
}
for i=0, ncell-1 {
pnm.nc_append(i, (i+1)%ncell, -1, 1.1, 2)
}
// stimulate
objref stim, ncstim
if (pnm.gid_exists(4)) {
stim = new NetStim(.5)
ncstim = new NetCon(stim, pnm.pc.gid2obj(4))
ncstim.weight = 1.1
ncstim.delay = 0
stim.number=1
stim.start=1
}
pnm.set_maxstep(100)
pnm.want_all_spikes()
stdinit()
runtime = startsw()
print "Till herhe"
pnm.psolve(tstop)
print "Till here2"
runtime = startsw() - runtime
for i=0, pnm.spikevec.size-1 {
print pnm.spikevec.x[i], pnm.idvec.x[i]
}
pnm.pc.runworker
pnm.pc.done
Code: Select all
numprocs=123
NEURON -- Release 7.3 (849:5be3d097b917) 2013-04-11
Duke, Yale, and the BlueBrain Project -- Copyright 1984-2013
See http://www.neuron.yale.edu/neuron/credits
1
1
1
1
1
.
.
.
.
.
Till herhe
Till herhe
Till herhe
nrn_timeout t=2
--------------------------------------------------------------------------
MPI_ABORT was invoked on rank 0 in communicator MPI_COMM_WORLD
with errorcode 0.
NOTE: invoking MPI_ABORT causes Open MPI to kill all MPI processes.
You may or may not see output from other processes, depending on
exactly when Open MPI kills them.
--------------------------------------------------------------------------
--------------------------------------------------------------------------
mpiexec has exited due to process rank 0 with PID 47117 on
node illll-48 exiting without calling "finalize". This may
have caused other processes in the application to be
terminated by signals sent by mpiexec (as reported here).
--------------------------------------------------------------------------
How can I try to diagnose the problem further?
Thank You.