Can you also please tell if I can simulate moldable jobs i.e. they do not have a fixed number of requested nodes rather they specify the minimum and maximum number of nodes that the job should run on. If yes, where can I specify the min and max # of nodes for the job. Is there a simulator documentation that can answer some of these basic questions of mine. Please let me know. Thanks!
On Mon, Jul 29, 2013 at 9:01 AM, Akhil langer <[email protected]> wrote: > Thanks Alejandro for your reply. > Yes, this is what I am looking for. > Can you please tell where to specify the list of jobs that I want the > simulator to schedule? > > Thanks, > Akhil > > > On Mon, Jul 29, 2013 at 3:15 AM, Alejandro Lucero Palau < > [email protected]> wrote: > >> ** >> Hi Akhil, >> >> That's great! >> >> So the reason behind the simulator was to test how the slurm scheduler >> reacts to scheduling parameters tuning. You can reproduce a workload using >> different parameter values like fair sharing percentage or queue >> priorities. Or maybe you need to know which would be the impact doing a >> weekly reservation... >> >> However, I think the simulator is a really useful tool for researchers as >> well. What you are doing is exactly what the simulator is designed for: it >> avoids to reproduce a scheduling algorithm based on "simple" parameters. >> The more complexity you put in a synthetic scheduler the more sense it >> makes to use a real scheduler. So the simulator allows you to test the >> current slurm algorithms or to test the one you are working on. There are >> an important number of details that make an impact on scheduling which is >> hard to implement from scratch. So why not to use a real scheduler with >> years of development inside? The simulator does not change how slurm works >> so it is probably a good tool for testing workloads covering a long period >> in just some hours. For example, I can take the last 3 month workload from >> our big machine and reproduce it in the simulator in just a couple of >> hours. >> >> I hope you find it useful. >> >> Best regards >> >> >> On 07/27/2013 11:23 PM, Akhil langer wrote: >> >> Hi Alejandro, >> >> I did the installation from scratch again and it is now working. Thanks >> for your help! >> Can you please give the answers to my other questions in my last reply. >> Thanks again! >> >> Akhil >> >> >> On Fri, Jul 26, 2013 at 1:15 PM, Akhil langer <[email protected]>wrote: >> >>> Hi Alajendro, >>> >>> I want to measure slurm's scheduling throughput. and/or given a set of >>> jobs, how does slurm schedule them. We are writing a simple scheduler (that >>> does not use/require slurm) for our specific problem and want to see how it >>> compares with slurm's scheduling policy. So the use case of the simulator >>> is very simple - given a set of jobs we want to know how (in which order) >>> slurm would execute them. Do you think doing these simulations will be >>> difficult with the simulator? Can you please tell any information if there >>> is anything that will ease these experiments of mine, as you might have >>> done this before. >>> >>> I did reset.sh and I am getting some other problem now. slurmctld is >>> not starting when exec_sim.pl is called. It is again having some issues >>> in changing owner/permissions of a file. log files are attached >>> >>> >>> On Fri, Jul 26, 2013 at 12:32 PM, Alejandro Lucero Palau < >>> [email protected]> wrote: >>> >>>> Hi Akhil, >>>> >>>> It is working fine. It seems slurmctld has a job with that jobid from >>>> previous executions. >>>> >>>> Just execute reset.pl script before exec_sim.pl >>>> >>>> All of this just gives you an easy way to test the simulator but you >>>> will need to work a bit harder for getting something useful from it. >>>> >>>> By the way, what do you have in mind about using the simulator? >>>> >>>> I have not had time lately to work on it but I will as soon as I get a >>>> chance. >>>> >>>> Regards >>>> >>>> >>>> On 07/26/2013 11:41 AM, Akhil langer wrote: >>>> >>>> Alejandro, >>>> Please find attached the log files. >>>> >>>> Thanks, >>>> Akhil >>>> >>>> >>>> On Fri, Jul 26, 2013 at 1:28 AM, Alejandro Lucero Palau < >>>> [email protected]> wrote: >>>> >>>>> Hi Akhil, >>>>> >>>>> This should not happen if you have followed instructions about user >>>>> installation. >>>>> >>>>> Please, send me the log files including sim_mgr.log >>>>> >>>>> >>>>> >>>>> On 07/25/2013 07:33 PM, Akhil langer wrote: >>>>> >>>>> Thanks Alejandro, >>>>> >>>>> That solved the problem. Now all the daemons start. However, exec_pl >>>>> gives this error for every job: >>>>> sbatch: error: Batch job submission failed: I/O error writing >>>>> script/environment to file. >>>>> >>>>> Can you please tell which file it is trying to write, I can change >>>>> its permissions. >>>>> >>>>> Thanks, >>>>> Akhil >>>>> >>>>> >>>>> On Thu, Jul 25, 2013 at 5:37 AM, Alejandro Lucero Palau < >>>>> [email protected]> wrote: >>>>> >>>>>> Hi Akhil, >>>>>> >>>>>> It seems the slurmctld can not contact with the slurmdbd. >>>>>> >>>>>> exec_sim.pl starts the controller and slurmd but it trust in having >>>>>> slurmdbd working. >>>>>> >>>>>> If you have more problems once you start the slurmdbd daemon, I will >>>>>> need the sim_mgr.log file as well. >>>>>> >>>>>> Regards >>>>>> >>>>>> On 07/24/2013 07:42 PM, Akhil langer wrote: >>>>>> >>>>>> Alejandro, >>>>>> >>>>>> I have attached all the logs. >>>>>> I am using Ubuntu 12.04.02 instead of 12.04.01. Also, I am using >>>>>> Virtualbox and not VmWare. >>>>>> Thanks for the help! >>>>>> >>>>>> >>>>>> On Wed, Jul 24, 2013 at 10:31 AM, Alejandro Lucero Palau < >>>>>> [email protected]> wrote: >>>>>> >>>>>>> Hi, >>>>>>> >>>>>>> That error should not be the problem. >>>>>>> >>>>>>> Can you send me the full log files? >>>>>>> >>>>>>> Are you using same distribution and VM as commented in installation >>>>>>> instruction file? >>>>>>> >>>>>>> >>>>>>> >>>>>>> On 07/23/2013 09:25 PM, Akhil langer wrote: >>>>>>> >>>>>>> I followed all the instructions of getting started with the slurm >>>>>>> simulator on a new Ubuntu VM. Everytihng seems fine. But when I run ./ >>>>>>> exec_sim.pl SIM_DIR 100, I get the following errors in the >>>>>>> slurmctld.log file: >>>>>>> slurmctld: error: unable to open pidfile /var/run/slurmctld.pid: >>>>>>> Permission denied >>>>>>> >>>>>>> /var/run/* files have read permissions to users, I am not sure why >>>>>>> is slurm trying to open it in write mode. >>>>>>> This error goes away if I do sudo ./exec_sim.pl but then other >>>>>>> errors come up such as SlurmUser is not set to root, etc.. >>>>>>> >>>>>>> How to fix this? >>>>>>> >>>>>>> >>>>>>> On Tue, Jul 23, 2013 at 2:10 PM, Akhil langer >>>>>>> <[email protected]>wrote: >>>>>>> >>>>>>>> I followed all the instructions of getting started with the slurm >>>>>>>> simulator on a new Ubuntu VM. Everytihng seems fine. But when I run ./ >>>>>>>> exec_sim.pl SIM_DIR 100, I get the following errors in the >>>>>>>> slurmctld.log file: >>>>>>>> slurmctld: error: unable to open pidfile /var/run/slurmctld.pid: >>>>>>>> Permission denied >>>>>>>> >>>>>>>> /var/run/* files have read permissions to users, I am not sure >>>>>>>> why is slurm trying to open it in write mode. >>>>>>>> This error goes away if I do sudo ./exec_sim.pl but then other >>>>>>>> errors come up such as SlurmUser is not set to root, etc.. >>>>>>>> >>>>>>>> How to fix this? >>>>>>>> >>>>>>> >>>>>>> >>>>>>> >>>>>>> >>>>>>> WARNING / LEGAL TEXT: This message is intended only for the use of >>>>>>> the individual or entity to which it is addressed and may contain >>>>>>> information which is privileged, confidential, proprietary, or exempt >>>>>>> from >>>>>>> disclosure under applicable law. If you are not the intended recipient >>>>>>> or >>>>>>> the person responsible for delivering the message to the intended >>>>>>> recipient, you are strictly prohibited from disclosing, distributing, >>>>>>> copying, or in any way using this message. If you have received this >>>>>>> communication in error, please notify the sender and destroy and delete >>>>>>> any >>>>>>> copies you may have received. >>>>>>> >>>>>>> http://www.bsc.es/disclaimer <http://www.bsc.es/disclaimer.htm> >>>>>>> >>>>>> >>>>>> >>>>>> >>>>>> >>>>>> WARNING / LEGAL TEXT: This message is intended only for the use of >>>>>> the individual or entity to which it is addressed and may contain >>>>>> information which is privileged, confidential, proprietary, or exempt >>>>>> from >>>>>> disclosure under applicable law. If you are not the intended recipient or >>>>>> the person responsible for delivering the message to the intended >>>>>> recipient, you are strictly prohibited from disclosing, distributing, >>>>>> copying, or in any way using this message. If you have received this >>>>>> communication in error, please notify the sender and destroy and delete >>>>>> any >>>>>> copies you may have received. >>>>>> >>>>>> http://www.bsc.es/disclaimer <http://www.bsc.es/disclaimer.htm> >>>>>> >>>>> >>>>> >>>>> >>>>> >>>>> WARNING / LEGAL TEXT: This message is intended only for the use of the >>>>> individual or entity to which it is addressed and may contain information >>>>> which is privileged, confidential, proprietary, or exempt from disclosure >>>>> under applicable law. If you are not the intended recipient or the person >>>>> responsible for delivering the message to the intended recipient, you are >>>>> strictly prohibited from disclosing, distributing, copying, or in any way >>>>> using this message. If you have received this communication in error, >>>>> please notify the sender and destroy and delete any copies you may have >>>>> received. >>>>> >>>>> http://www.bsc.es/disclaimer <http://www.bsc.es/disclaimer.htm> >>>>> >>>> >>>> >>>> >>>> >>>> WARNING / LEGAL TEXT: This message is intended only for the use of the >>>> individual or entity to which it is addressed and may contain information >>>> which is privileged, confidential, proprietary, or exempt from disclosure >>>> under applicable law. If you are not the intended recipient or the person >>>> responsible for delivering the message to the intended recipient, you are >>>> strictly prohibited from disclosing, distributing, copying, or in any way >>>> using this message. If you have received this communication in error, >>>> please notify the sender and destroy and delete any copies you may have >>>> received. >>>> >>>> http://www.bsc.es/disclaimer <http://www.bsc.es/disclaimer.htm> >>>> >>> >>> >> >> >> >> WARNING / LEGAL TEXT: This message is intended only for the use of the >> individual or entity to which it is addressed and may contain information >> which is privileged, confidential, proprietary, or exempt from disclosure >> under applicable law. If you are not the intended recipient or the person >> responsible for delivering the message to the intended recipient, you are >> strictly prohibited from disclosing, distributing, copying, or in any way >> using this message. If you have received this communication in error, >> please notify the sender and destroy and delete any copies you may have >> received. >> >> http://www.bsc.es/disclaimer <http://www.bsc.es/disclaimer.htm> >> > >
