Hi Akhil, That's great!
So the reason behind the simulator was to test how the slurm scheduler reacts to scheduling parameters tuning. You can reproduce a workload using different parameter values like fair sharing percentage or queue priorities. Or maybe you need to know which would be the impact doing a weekly reservation... However, I think the simulator is a really useful tool for researchers as well. What you are doing is exactly what the simulator is designed for: it avoids to reproduce a scheduling algorithm based on "simple" parameters. The more complexity you put in a synthetic scheduler the more sense it makes to use a real scheduler. So the simulator allows you to test the current slurm algorithms or to test the one you are working on. There are an important number of details that make an impact on scheduling which is hard to implement from scratch. So why not to use a real scheduler with years of development inside? The simulator does not change how slurm works so it is probably a good tool for testing workloads covering a long period in just some hours. For example, I can take the last 3 month workload from our big machine and reproduce it in the simulator in just a couple of hours. I hope you find it useful. Best regards On 07/27/2013 11:23 PM, Akhil langer wrote: > Hi Alejandro, > > I did the installation from scratch again and it is now working. > Thanks for your help! > Can you please give the answers to my other questions in my last > reply. Thanks again! > > Akhil > > > On Fri, Jul 26, 2013 at 1:15 PM, Akhil langer <[email protected] > <mailto:[email protected]>> wrote: > > Hi Alajendro, > > I want to measure slurm's scheduling throughput. and/or given a > set of jobs, how does slurm schedule them. We are writing a simple > scheduler (that does not use/require slurm) for our specific > problem and want to see how it compares with slurm's scheduling > policy. So the use case of the simulator is very simple - given a > set of jobs we want to know how (in which order) slurm would > execute them. Do you think doing these simulations will be > difficult with the simulator? Can you please tell any information > if there is anything that will ease these experiments of mine, as > you might have done this before. > > I did reset.sh and I am getting some other problem now. slurmctld > is not starting when exec_sim.pl <http://exec_sim.pl> is called. > It is again having some issues in changing owner/permissions of a > file. log files are attached > > > On Fri, Jul 26, 2013 at 12:32 PM, Alejandro Lucero Palau > <[email protected] <mailto:[email protected]>> wrote: > > Hi Akhil, > > It is working fine. It seems slurmctld has a job with that > jobid from previous executions. > > Just execute reset.pl <http://reset.pl> script before > exec_sim.pl <http://exec_sim.pl> > > All of this just gives you an easy way to test the simulator > but you will need to work a bit harder for getting something > useful from it. > > By the way, what do you have in mind about using the simulator? > > I have not had time lately to work on it but I will as soon as > I get a chance. > > Regards > > > On 07/26/2013 11:41 AM, Akhil langer wrote: >> Alejandro, >> Please find attached the log files. >> >> Thanks, >> Akhil >> >> >> On Fri, Jul 26, 2013 at 1:28 AM, Alejandro Lucero Palau >> <[email protected] <mailto:[email protected]>> wrote: >> >> Hi Akhil, >> >> This should not happen if you have followed instructions >> about user installation. >> >> Please, send me the log files including sim_mgr.log >> >> >> >> On 07/25/2013 07:33 PM, Akhil langer wrote: >>> Thanks Alejandro, >>> >>> That solved the problem. Now all the daemons start. >>> However, exec_pl gives this error for every job: >>> sbatch: error: Batch job submission failed: I/O error >>> writing script/environment to file. >>> >>> Can you please tell which file it is trying to write, I >>> can change its permissions. >>> >>> Thanks, >>> Akhil >>> >>> >>> On Thu, Jul 25, 2013 at 5:37 AM, Alejandro Lucero Palau >>> <[email protected] >>> <mailto:[email protected]>> wrote: >>> >>> Hi Akhil, >>> >>> It seems the slurmctld can not contact with the >>> slurmdbd. >>> >>> exec_sim.pl <http://exec_sim.pl> starts the >>> controller and slurmd but it trust in having >>> slurmdbd working. >>> >>> If you have more problems once you start the >>> slurmdbd daemon, I will need the sim_mgr.log file as >>> well. >>> >>> Regards >>> >>> On 07/24/2013 07:42 PM, Akhil langer wrote: >>>> Alejandro, >>>> >>>> I have attached all the logs. >>>> I am using Ubuntu 12.04.02 instead of 12.04.01. >>>> Also, I am using Virtualbox and not VmWare. >>>> Thanks for the help! >>>> >>>> >>>> On Wed, Jul 24, 2013 at 10:31 AM, Alejandro Lucero >>>> Palau <[email protected] >>>> <mailto:[email protected]>> wrote: >>>> >>>> Hi, >>>> >>>> That error should not be the problem. >>>> >>>> Can you send me the full log files? >>>> >>>> Are you using same distribution and VM as >>>> commented in installation instruction file? >>>> >>>> >>>> >>>> On 07/23/2013 09:25 PM, Akhil langer wrote: >>>>> I followed all the instructions of getting >>>>> started with the slurm simulator on a new >>>>> Ubuntu VM. Everytihng seems fine. But when I >>>>> run ./exec_sim.pl >>>>> <http://exec_sim.pl/> SIM_DIR 100, I get the >>>>> following errors in the slurmctld.log file: >>>>> slurmctld: error: unable to open pidfile >>>>> /var/run/slurmctld.pid: Permission denied >>>>> >>>>> /var/run/* files have read permissions to >>>>> users, I am not sure why is slurm trying to >>>>> open it in write mode. >>>>> This error goes away if I do sudo >>>>> ./exec_sim.pl <http://exec_sim.pl/> but then >>>>> other errors come up such as SlurmUser is not >>>>> set to root, etc.. >>>>> >>>>> How to fix this? >>>>> >>>>> >>>>> On Tue, Jul 23, 2013 at 2:10 PM, Akhil langer >>>>> <[email protected] >>>>> <mailto:[email protected]>> wrote: >>>>> >>>>> I followed all the instructions of getting >>>>> started with the slurm simulator on a new >>>>> Ubuntu VM. Everytihng seems fine. But when >>>>> I run ./exec_sim.pl <http://exec_sim.pl> >>>>> SIM_DIR 100, I get the following errors in >>>>> the slurmctld.log file: >>>>> slurmctld: error: unable to open pidfile >>>>> /var/run/slurmctld.pid: Permission denied >>>>> >>>>> /var/run/* files have read permissions to >>>>> users, I am not sure why is slurm trying >>>>> to open it in write mode. >>>>> This error goes away if I do sudo >>>>> ./exec_sim.pl <http://exec_sim.pl> but >>>>> then other errors come up such as >>>>> SlurmUser is not set to root, etc.. >>>>> >>>>> How to fix this? >>>>> >>>>> >>>> >>>> >>>> >>>> WARNING / LEGAL TEXT: This message is intended >>>> only for the use of the individual or entity to >>>> which it is addressed and may contain >>>> information which is privileged, confidential, >>>> proprietary, or exempt from disclosure under >>>> applicable law. If you are not the intended >>>> recipient or the person responsible for >>>> delivering the message to the intended >>>> recipient, you are strictly prohibited from >>>> disclosing, distributing, copying, or in any >>>> way using this message. If you have received >>>> this communication in error, please notify the >>>> sender and destroy and delete any copies you >>>> may have received. >>>> >>>> http://www.bsc.es/disclaimer >>>> <http://www.bsc.es/disclaimer.htm> >>>> >>>> >>> >>> >>> >>> WARNING / LEGAL TEXT: This message is intended only >>> for the use of the individual or entity to which it >>> is addressed and may contain information which is >>> privileged, confidential, proprietary, or exempt >>> from disclosure under applicable law. If you are not >>> the intended recipient or the person responsible for >>> delivering the message to the intended recipient, >>> you are strictly prohibited from disclosing, >>> distributing, copying, or in any way using this >>> message. If you have received this communication in >>> error, please notify the sender and destroy and >>> delete any copies you may have received. >>> >>> http://www.bsc.es/disclaimer >>> <http://www.bsc.es/disclaimer.htm> >>> >>> >> >> >> >> WARNING / LEGAL TEXT: This message is intended only for >> the use of the individual or entity to which it is >> addressed and may contain information which is >> privileged, confidential, proprietary, or exempt from >> disclosure under applicable law. If you are not the >> intended recipient or the person responsible for >> delivering the message to the intended recipient, you are >> strictly prohibited from disclosing, distributing, >> copying, or in any way using this message. If you have >> received this communication in error, please notify the >> sender and destroy and delete any copies you may have >> received. >> >> http://www.bsc.es/disclaimer >> <http://www.bsc.es/disclaimer.htm> >> >> > > > > WARNING / LEGAL TEXT: This message is intended only for the > use of the individual or entity to which it is addressed and > may contain information which is privileged, confidential, > proprietary, or exempt from disclosure under applicable law. > If you are not the intended recipient or the person > responsible for delivering the message to the intended > recipient, you are strictly prohibited from disclosing, > distributing, copying, or in any way using this message. If > you have received this communication in error, please notify > the sender and destroy and delete any copies you may have > received. > > http://www.bsc.es/disclaimer <http://www.bsc.es/disclaimer.htm> > > > WARNING / LEGAL TEXT: This message is intended only for the use of the individual or entity to which it is addressed and may contain information which is privileged, confidential, proprietary, or exempt from disclosure under applicable law. If you are not the intended recipient or the person responsible for delivering the message to the intended recipient, you are strictly prohibited from disclosing, distributing, copying, or in any way using this message. If you have received this communication in error, please notify the sender and destroy and delete any copies you may have received. http://www.bsc.es/disclaimer
