Could you run it is gdb and get the backtrace? gdb slurmctld (gdb) run -Dvv ...crash... (gdb) bt full
That might give us something. Danny > -----Original Message----- > From: [email protected] [mailto:[email protected]] > On Behalf Of Tyler > Strickland > Sent: Friday, May 13, 2011 11:33 AM > To: [email protected] > Subject: Re: [slurm-dev] slurmctld not starting on OSX 10.5 > > At the risk (OK, guarantee) of showing my ignorance, how might I go > about doing that? One of the past list posts said to run 'ulimit -c > unlimited' followed by slurmctld -D, after which the core dump would be > placed in the current directory (/tmp). Unfortunately, nothing is to be > found in the folder after the crash. > > Thanks, > Tyler > > > > On 05/13/2011 02:14 PM, Jette, Moe wrote: > > If you can get a core file on SIGBUS and generate a backtrace, that may > > help. > > ________________________________________ > > From: [email protected] [[email protected]] On > > Behalf Of Tyler Strickland > [[email protected]] > > Sent: Friday, May 13, 2011 10:42 AM > > To: [email protected] > > Subject: [slurm-dev] slurmctld not starting on OSX 10.5 > > > > All, > > > > After the fun with getting SLURM compiled light night, I've finally > > succeeded at getting it installed. slurmd starts up fine but slurmctld > > doesn't - and there are no errors indicating why. When I try to run it > > with -D the words "Bus Error" are printed and the log appearing much > > line the one below. > > > > The logfile for "slurmd -cvvvvvvvvv" > > > > Thanks, > > Tyler > > > > [2011-05-13T13:39:29] pidfile not locked, assuming no running daemon > > [2011-05-13T13:39:29] debug: sched: slurmctld starting > > [2011-05-13T13:39:29] debug3: Trying to load plugin > > /usr/local/lib/slurm/accounting_storage_none.so > > [2011-05-13T13:39:29] Accounting storage NOT INVOKED plugin loaded > > [2011-05-13T13:39:29] debug3: Success. > > [2011-05-13T13:39:29] debug3: not enforcing associations and no list was > > given so we are giving a blank list > > [2011-05-13T13:39:29] debug2: No Assoc usage file > > (/var/lib/slurm/slurmctld/assoc_usage) to recover > > [2011-05-13T13:39:29] slurmctld version 2.2.5 started on cluster cluster > > [2011-05-13T13:39:29] debug3: Trying to load plugin > > /usr/local/lib/slurm/crypto_munge.so > > [2011-05-13T13:39:29] Munge cryptographic signature plugin loaded > > [2011-05-13T13:39:29] debug3: Success. > > [2011-05-13T13:39:29] debug3: Trying to load plugin > > /usr/local/lib/slurm/select_cons_res.so > > [2011-05-13T13:39:29] debug3: Success. > > [2011-05-13T13:39:29] debug3: Trying to load plugin > > /usr/local/lib/slurm/preempt_none.so > > [2011-05-13T13:39:29] preempt/none loaded > > [2011-05-13T13:39:29] debug3: Success. > > [2011-05-13T13:39:29] debug3: Trying to load plugin > > /usr/local/lib/slurm/checkpoint_none.so > > [2011-05-13T13:39:29] debug3: Success. > > [2011-05-13T13:39:29] Checkpoint plugin loaded: checkpoint/none > > [2011-05-13T13:39:29] debug3: Trying to load plugin > > /usr/local/lib/slurm/jobacct_gather_none.so > > [2011-05-13T13:39:29] Job accounting gather NOT_INVOKED plugin loaded > > [2011-05-13T13:39:29] debug3: Success. > > [2011-05-13T13:39:29] debug: No backup controller to shutdown > > [2011-05-13T13:39:29] debug3: Trying to load plugin > > /usr/local/lib/slurm/switch_none.so > > [2011-05-13T13:39:29] switch NONE plugin loaded > > [2011-05-13T13:39:29] debug3: Success. > > [2011-05-13T13:39:29] debug3: Trying to load plugin > > /usr/local/lib/slurm/topology_none.so > > [2011-05-13T13:39:29] topology NONE plugin loaded > > [2011-05-13T13:39:29] debug3: Success. > > [2011-05-13T13:39:29] debug: No DownNodes > > [2011-05-13T13:39:29] debug3: Trying to load plugin > > /usr/local/lib/slurm/jobcomp_none.so > > [2011-05-13T13:39:29] debug3: Success. > > [2011-05-13T13:39:29] debug3: Trying to load plugin > > /usr/local/lib/slurm/sched_backfill.so > > [2011-05-13T13:39:29] sched: Backfill scheduler plugin loaded > > [2011-05-13T13:39:29] debug3: Success. > > [2011-05-13T13:39:29] debug: No job state file > > (/var/lib/slurm/slurmctld/job_state) to recover > > [2011-05-13T13:39:29] cons_res: select_p_node_init > >
