Hi Alejandro, Can you please tell a sample trace_builder command line arguments (including the accounts, partition, etc), to generate the test.trace file. With the arguments I tried (with only 1 job in q log file), the exec_sim.pldoes not schedule that job and keeps it in the queue. A sample trace_builder command will help (and please tell the corresponding rsv.trace file also). Thanks a lot!
Thanks, Akhil On Tue, Jul 30, 2013 at 1:48 AM, Alejandro Lucero Palau < [email protected]> wrote: > ** > Hi Akhil, > > The simulator gets the jobs from a trace file "test.trace", reservations > from another file, "rsv.trace" and users from "users.sim". If you want your > own jobs, you need to create another test.trace file. The slurm used by the > simulator does not have any limitation regarding jobs requirements, so you > can submit jobs using all the options sbatch supports. However, sim_mgr > and the job trace file format have some limitations so if you need specific > job options like min and max for nodes or cpus, you have to modify the > sim_mgr.c program. I would like to overcome this limitation but I have not > time for working on this by now. You can take a look to the file attached > where another guy did some changes to a basic program for jobs traces > creation. > > Documentation is not available except the simulator paper where design is > explained and the docs you have already read about installation. You better > start getting used to slurm commands like sbatch and slurmctld > configuration parameters. > > > > > On 07/29/2013 11:02 PM, Akhil langer wrote: > > Can you also please tell if I can simulate moldable jobs i.e. they do not > have a fixed number of requested nodes rather they specify the minimum and > maximum number of nodes that the job should run on. If yes, where can I > specify the min and max # of nodes for the job. Is there a simulator > documentation that can answer some of these basic questions of mine. Please > let me know. Thanks! > > > On Mon, Jul 29, 2013 at 9:01 AM, Akhil langer <[email protected]>wrote: > >> Thanks Alejandro for your reply. >> Yes, this is what I am looking for. >> Can you please tell where to specify the list of jobs that I want the >> simulator to schedule? >> >> Thanks, >> Akhil >> >> >> On Mon, Jul 29, 2013 at 3:15 AM, Alejandro Lucero Palau < >> [email protected]> wrote: >> >>> Hi Akhil, >>> >>> That's great! >>> >>> So the reason behind the simulator was to test how the slurm scheduler >>> reacts to scheduling parameters tuning. You can reproduce a workload using >>> different parameter values like fair sharing percentage or queue >>> priorities. Or maybe you need to know which would be the impact doing a >>> weekly reservation... >>> >>> However, I think the simulator is a really useful tool for researchers >>> as well. What you are doing is exactly what the simulator is designed for: >>> it avoids to reproduce a scheduling algorithm based on "simple" parameters. >>> The more complexity you put in a synthetic scheduler the more sense it >>> makes to use a real scheduler. So the simulator allows you to test the >>> current slurm algorithms or to test the one you are working on. There are >>> an important number of details that make an impact on scheduling which is >>> hard to implement from scratch. So why not to use a real scheduler with >>> years of development inside? The simulator does not change how slurm works >>> so it is probably a good tool for testing workloads covering a long period >>> in just some hours. For example, I can take the last 3 month workload from >>> our big machine and reproduce it in the simulator in just a couple of >>> hours. >>> >>> I hope you find it useful. >>> >>> Best regards >>> >>> >>> On 07/27/2013 11:23 PM, Akhil langer wrote: >>> >>> Hi Alejandro, >>> >>> I did the installation from scratch again and it is now working. >>> Thanks for your help! >>> Can you please give the answers to my other questions in my last reply. >>> Thanks again! >>> >>> Akhil >>> >>> >>> On Fri, Jul 26, 2013 at 1:15 PM, Akhil langer <[email protected]>wrote: >>> >>>> Hi Alajendro, >>>> >>>> I want to measure slurm's scheduling throughput. and/or given a set >>>> of jobs, how does slurm schedule them. We are writing a simple scheduler >>>> (that does not use/require slurm) for our specific problem and want to see >>>> how it compares with slurm's scheduling policy. So the use case of the >>>> simulator is very simple - given a set of jobs we want to know how (in >>>> which order) slurm would execute them. Do you think doing these simulations >>>> will be difficult with the simulator? Can you please tell any information >>>> if there is anything that will ease these experiments of mine, as you might >>>> have done this before. >>>> >>>> I did reset.sh and I am getting some other problem now. slurmctld is >>>> not starting when exec_sim.pl is called. It is again having some >>>> issues in changing owner/permissions of a file. log files are attached >>>> >>>> >>>> On Fri, Jul 26, 2013 at 12:32 PM, Alejandro Lucero Palau < >>>> [email protected]> wrote: >>>> >>>>> Hi Akhil, >>>>> >>>>> It is working fine. It seems slurmctld has a job with that jobid from >>>>> previous executions. >>>>> >>>>> Just execute reset.pl script before exec_sim.pl >>>>> >>>>> All of this just gives you an easy way to test the simulator but you >>>>> will need to work a bit harder for getting something useful from it. >>>>> >>>>> By the way, what do you have in mind about using the simulator? >>>>> >>>>> I have not had time lately to work on it but I will as soon as I get a >>>>> chance. >>>>> >>>>> Regards >>>>> >>>>> >>>>> On 07/26/2013 11:41 AM, Akhil langer wrote: >>>>> >>>>> Alejandro, >>>>> Please find attached the log files. >>>>> >>>>> Thanks, >>>>> Akhil >>>>> >>>>> >>>>> On Fri, Jul 26, 2013 at 1:28 AM, Alejandro Lucero Palau < >>>>> [email protected]> wrote: >>>>> >>>>>> Hi Akhil, >>>>>> >>>>>> This should not happen if you have followed instructions about user >>>>>> installation. >>>>>> >>>>>> Please, send me the log files including sim_mgr.log >>>>>> >>>>>> >>>>>> >>>>>> On 07/25/2013 07:33 PM, Akhil langer wrote: >>>>>> >>>>>> Thanks Alejandro, >>>>>> >>>>>> That solved the problem. Now all the daemons start. However, exec_pl >>>>>> gives this error for every job: >>>>>> sbatch: error: Batch job submission failed: I/O error writing >>>>>> script/environment to file. >>>>>> >>>>>> Can you please tell which file it is trying to write, I can change >>>>>> its permissions. >>>>>> >>>>>> Thanks, >>>>>> Akhil >>>>>> >>>>>> >>>>>> On Thu, Jul 25, 2013 at 5:37 AM, Alejandro Lucero Palau < >>>>>> [email protected]> wrote: >>>>>> >>>>>>> Hi Akhil, >>>>>>> >>>>>>> It seems the slurmctld can not contact with the slurmdbd. >>>>>>> >>>>>>> exec_sim.pl starts the controller and slurmd but it trust in having >>>>>>> slurmdbd working. >>>>>>> >>>>>>> If you have more problems once you start the slurmdbd daemon, I will >>>>>>> need the sim_mgr.log file as well. >>>>>>> >>>>>>> Regards >>>>>>> >>>>>>> On 07/24/2013 07:42 PM, Akhil langer wrote: >>>>>>> >>>>>>> Alejandro, >>>>>>> >>>>>>> I have attached all the logs. >>>>>>> I am using Ubuntu 12.04.02 instead of 12.04.01. Also, I am using >>>>>>> Virtualbox and not VmWare. >>>>>>> Thanks for the help! >>>>>>> >>>>>>> >>>>>>> On Wed, Jul 24, 2013 at 10:31 AM, Alejandro Lucero Palau < >>>>>>> [email protected]> wrote: >>>>>>> >>>>>>>> Hi, >>>>>>>> >>>>>>>> That error should not be the problem. >>>>>>>> >>>>>>>> Can you send me the full log files? >>>>>>>> >>>>>>>> Are you using same distribution and VM as commented in installation >>>>>>>> instruction file? >>>>>>>> >>>>>>>> >>>>>>>> >>>>>>>> On 07/23/2013 09:25 PM, Akhil langer wrote: >>>>>>>> >>>>>>>> I followed all the instructions of getting started with the slurm >>>>>>>> simulator on a new Ubuntu VM. Everytihng seems fine. But when I run ./ >>>>>>>> exec_sim.pl SIM_DIR 100, I get the following errors in the >>>>>>>> slurmctld.log file: >>>>>>>> slurmctld: error: unable to open pidfile /var/run/slurmctld.pid: >>>>>>>> Permission denied >>>>>>>> >>>>>>>> /var/run/* files have read permissions to users, I am not sure >>>>>>>> why is slurm trying to open it in write mode. >>>>>>>> This error goes away if I do sudo ./exec_sim.pl but then other >>>>>>>> errors come up such as SlurmUser is not set to root, etc.. >>>>>>>> >>>>>>>> How to fix this? >>>>>>>> >>>>>>>> >>>>>>>> On Tue, Jul 23, 2013 at 2:10 PM, Akhil langer <[email protected] >>>>>>>> > wrote: >>>>>>>> >>>>>>>>> I followed all the instructions of getting started with the slurm >>>>>>>>> simulator on a new Ubuntu VM. Everytihng seems fine. But when I run ./ >>>>>>>>> exec_sim.pl SIM_DIR 100, I get the following errors in the >>>>>>>>> slurmctld.log file: >>>>>>>>> slurmctld: error: unable to open pidfile /var/run/slurmctld.pid: >>>>>>>>> Permission denied >>>>>>>>> >>>>>>>>> /var/run/* files have read permissions to users, I am not sure >>>>>>>>> why is slurm trying to open it in write mode. >>>>>>>>> This error goes away if I do sudo ./exec_sim.pl but then other >>>>>>>>> errors come up such as SlurmUser is not set to root, etc.. >>>>>>>>> >>>>>>>>> How to fix this? >>>>>>>>> >>>>>>>> >>>>>>>> >>>>>>>> >>>>>>>> >>>>>>>> WARNING / LEGAL TEXT: This message is intended only for the use of >>>>>>>> the individual or entity to which it is addressed and may contain >>>>>>>> information which is privileged, confidential, proprietary, or exempt >>>>>>>> from >>>>>>>> disclosure under applicable law. If you are not the intended recipient >>>>>>>> or >>>>>>>> the person responsible for delivering the message to the intended >>>>>>>> recipient, you are strictly prohibited from disclosing, distributing, >>>>>>>> copying, or in any way using this message. If you have received this >>>>>>>> communication in error, please notify the sender and destroy and >>>>>>>> delete any >>>>>>>> copies you may have received. >>>>>>>> >>>>>>>> http://www.bsc.es/disclaimer <http://www.bsc.es/disclaimer.htm> >>>>>>>> >>>>>>> >>>>>>> >>>>>>> >>>>>>> >>>>>>> WARNING / LEGAL TEXT: This message is intended only for the use of >>>>>>> the individual or entity to which it is addressed and may contain >>>>>>> information which is privileged, confidential, proprietary, or exempt >>>>>>> from >>>>>>> disclosure under applicable law. If you are not the intended recipient >>>>>>> or >>>>>>> the person responsible for delivering the message to the intended >>>>>>> recipient, you are strictly prohibited from disclosing, distributing, >>>>>>> copying, or in any way using this message. If you have received this >>>>>>> communication in error, please notify the sender and destroy and delete >>>>>>> any >>>>>>> copies you may have received. >>>>>>> >>>>>>> http://www.bsc.es/disclaimer <http://www.bsc.es/disclaimer.htm> >>>>>>> >>>>>> >>>>>> >>>>>> >>>>>> >>>>>> WARNING / LEGAL TEXT: This message is intended only for the use of >>>>>> the individual or entity to which it is addressed and may contain >>>>>> information which is privileged, confidential, proprietary, or exempt >>>>>> from >>>>>> disclosure under applicable law. If you are not the intended recipient or >>>>>> the person responsible for delivering the message to the intended >>>>>> recipient, you are strictly prohibited from disclosing, distributing, >>>>>> copying, or in any way using this message. If you have received this >>>>>> communication in error, please notify the sender and destroy and delete >>>>>> any >>>>>> copies you may have received. >>>>>> >>>>>> http://www.bsc.es/disclaimer <http://www.bsc.es/disclaimer.htm> >>>>>> >>>>> >>>>> >>>>> >>>>> >>>>> WARNING / LEGAL TEXT: This message is intended only for the use of the >>>>> individual or entity to which it is addressed and may contain information >>>>> which is privileged, confidential, proprietary, or exempt from disclosure >>>>> under applicable law. If you are not the intended recipient or the person >>>>> responsible for delivering the message to the intended recipient, you are >>>>> strictly prohibited from disclosing, distributing, copying, or in any way >>>>> using this message. If you have received this communication in error, >>>>> please notify the sender and destroy and delete any copies you may have >>>>> received. >>>>> >>>>> http://www.bsc.es/disclaimer <http://www.bsc.es/disclaimer.htm> >>>>> >>>> >>>> >>> >>> >>> >>> WARNING / LEGAL TEXT: This message is intended only for the use of the >>> individual or entity to which it is addressed and may contain information >>> which is privileged, confidential, proprietary, or exempt from disclosure >>> under applicable law. If you are not the intended recipient or the person >>> responsible for delivering the message to the intended recipient, you are >>> strictly prohibited from disclosing, distributing, copying, or in any way >>> using this message. If you have received this communication in error, >>> please notify the sender and destroy and delete any copies you may have >>> received. >>> >>> http://www.bsc.es/disclaimer <http://www.bsc.es/disclaimer.htm> >>> >> >> > > > > WARNING / LEGAL TEXT: This message is intended only for the use of the > individual or entity to which it is addressed and may contain information > which is privileged, confidential, proprietary, or exempt from disclosure > under applicable law. If you are not the intended recipient or the person > responsible for delivering the message to the intended recipient, you are > strictly prohibited from disclosing, distributing, copying, or in any way > using this message. If you have received this communication in error, > please notify the sender and destroy and delete any copies you may have > received. > > http://www.bsc.es/disclaimer <http://www.bsc.es/disclaimer.htm> >
