Thanks Alejandro for your reply. Yes, this is what I am looking for. Can you please tell where to specify the list of jobs that I want the simulator to schedule?
Thanks, Akhil On Mon, Jul 29, 2013 at 3:15 AM, Alejandro Lucero Palau < [email protected]> wrote: > ** > Hi Akhil, > > That's great! > > So the reason behind the simulator was to test how the slurm scheduler > reacts to scheduling parameters tuning. You can reproduce a workload using > different parameter values like fair sharing percentage or queue > priorities. Or maybe you need to know which would be the impact doing a > weekly reservation... > > However, I think the simulator is a really useful tool for researchers as > well. What you are doing is exactly what the simulator is designed for: it > avoids to reproduce a scheduling algorithm based on "simple" parameters. > The more complexity you put in a synthetic scheduler the more sense it > makes to use a real scheduler. So the simulator allows you to test the > current slurm algorithms or to test the one you are working on. There are > an important number of details that make an impact on scheduling which is > hard to implement from scratch. So why not to use a real scheduler with > years of development inside? The simulator does not change how slurm works > so it is probably a good tool for testing workloads covering a long period > in just some hours. For example, I can take the last 3 month workload from > our big machine and reproduce it in the simulator in just a couple of > hours. > > I hope you find it useful. > > Best regards > > > On 07/27/2013 11:23 PM, Akhil langer wrote: > > Hi Alejandro, > > I did the installation from scratch again and it is now working. Thanks > for your help! > Can you please give the answers to my other questions in my last reply. > Thanks again! > > Akhil > > > On Fri, Jul 26, 2013 at 1:15 PM, Akhil langer <[email protected]>wrote: > >> Hi Alajendro, >> >> I want to measure slurm's scheduling throughput. and/or given a set of >> jobs, how does slurm schedule them. We are writing a simple scheduler (that >> does not use/require slurm) for our specific problem and want to see how it >> compares with slurm's scheduling policy. So the use case of the simulator >> is very simple - given a set of jobs we want to know how (in which order) >> slurm would execute them. Do you think doing these simulations will be >> difficult with the simulator? Can you please tell any information if there >> is anything that will ease these experiments of mine, as you might have >> done this before. >> >> I did reset.sh and I am getting some other problem now. slurmctld is >> not starting when exec_sim.pl is called. It is again having some issues >> in changing owner/permissions of a file. log files are attached >> >> >> On Fri, Jul 26, 2013 at 12:32 PM, Alejandro Lucero Palau < >> [email protected]> wrote: >> >>> Hi Akhil, >>> >>> It is working fine. It seems slurmctld has a job with that jobid from >>> previous executions. >>> >>> Just execute reset.pl script before exec_sim.pl >>> >>> All of this just gives you an easy way to test the simulator but you >>> will need to work a bit harder for getting something useful from it. >>> >>> By the way, what do you have in mind about using the simulator? >>> >>> I have not had time lately to work on it but I will as soon as I get a >>> chance. >>> >>> Regards >>> >>> >>> On 07/26/2013 11:41 AM, Akhil langer wrote: >>> >>> Alejandro, >>> Please find attached the log files. >>> >>> Thanks, >>> Akhil >>> >>> >>> On Fri, Jul 26, 2013 at 1:28 AM, Alejandro Lucero Palau < >>> [email protected]> wrote: >>> >>>> Hi Akhil, >>>> >>>> This should not happen if you have followed instructions about user >>>> installation. >>>> >>>> Please, send me the log files including sim_mgr.log >>>> >>>> >>>> >>>> On 07/25/2013 07:33 PM, Akhil langer wrote: >>>> >>>> Thanks Alejandro, >>>> >>>> That solved the problem. Now all the daemons start. However, exec_pl >>>> gives this error for every job: >>>> sbatch: error: Batch job submission failed: I/O error writing >>>> script/environment to file. >>>> >>>> Can you please tell which file it is trying to write, I can change its >>>> permissions. >>>> >>>> Thanks, >>>> Akhil >>>> >>>> >>>> On Thu, Jul 25, 2013 at 5:37 AM, Alejandro Lucero Palau < >>>> [email protected]> wrote: >>>> >>>>> Hi Akhil, >>>>> >>>>> It seems the slurmctld can not contact with the slurmdbd. >>>>> >>>>> exec_sim.pl starts the controller and slurmd but it trust in having >>>>> slurmdbd working. >>>>> >>>>> If you have more problems once you start the slurmdbd daemon, I will >>>>> need the sim_mgr.log file as well. >>>>> >>>>> Regards >>>>> >>>>> On 07/24/2013 07:42 PM, Akhil langer wrote: >>>>> >>>>> Alejandro, >>>>> >>>>> I have attached all the logs. >>>>> I am using Ubuntu 12.04.02 instead of 12.04.01. Also, I am using >>>>> Virtualbox and not VmWare. >>>>> Thanks for the help! >>>>> >>>>> >>>>> On Wed, Jul 24, 2013 at 10:31 AM, Alejandro Lucero Palau < >>>>> [email protected]> wrote: >>>>> >>>>>> Hi, >>>>>> >>>>>> That error should not be the problem. >>>>>> >>>>>> Can you send me the full log files? >>>>>> >>>>>> Are you using same distribution and VM as commented in installation >>>>>> instruction file? >>>>>> >>>>>> >>>>>> >>>>>> On 07/23/2013 09:25 PM, Akhil langer wrote: >>>>>> >>>>>> I followed all the instructions of getting started with the slurm >>>>>> simulator on a new Ubuntu VM. Everytihng seems fine. But when I run ./ >>>>>> exec_sim.pl SIM_DIR 100, I get the following errors in the >>>>>> slurmctld.log file: >>>>>> slurmctld: error: unable to open pidfile /var/run/slurmctld.pid: >>>>>> Permission denied >>>>>> >>>>>> /var/run/* files have read permissions to users, I am not sure why >>>>>> is slurm trying to open it in write mode. >>>>>> This error goes away if I do sudo ./exec_sim.pl but then other >>>>>> errors come up such as SlurmUser is not set to root, etc.. >>>>>> >>>>>> How to fix this? >>>>>> >>>>>> >>>>>> On Tue, Jul 23, 2013 at 2:10 PM, Akhil langer >>>>>> <[email protected]>wrote: >>>>>> >>>>>>> I followed all the instructions of getting started with the slurm >>>>>>> simulator on a new Ubuntu VM. Everytihng seems fine. But when I run ./ >>>>>>> exec_sim.pl SIM_DIR 100, I get the following errors in the >>>>>>> slurmctld.log file: >>>>>>> slurmctld: error: unable to open pidfile /var/run/slurmctld.pid: >>>>>>> Permission denied >>>>>>> >>>>>>> /var/run/* files have read permissions to users, I am not sure why >>>>>>> is slurm trying to open it in write mode. >>>>>>> This error goes away if I do sudo ./exec_sim.pl but then other >>>>>>> errors come up such as SlurmUser is not set to root, etc.. >>>>>>> >>>>>>> How to fix this? >>>>>>> >>>>>> >>>>>> >>>>>> >>>>>> >>>>>> WARNING / LEGAL TEXT: This message is intended only for the use of >>>>>> the individual or entity to which it is addressed and may contain >>>>>> information which is privileged, confidential, proprietary, or exempt >>>>>> from >>>>>> disclosure under applicable law. If you are not the intended recipient or >>>>>> the person responsible for delivering the message to the intended >>>>>> recipient, you are strictly prohibited from disclosing, distributing, >>>>>> copying, or in any way using this message. If you have received this >>>>>> communication in error, please notify the sender and destroy and delete >>>>>> any >>>>>> copies you may have received. >>>>>> >>>>>> http://www.bsc.es/disclaimer <http://www.bsc.es/disclaimer.htm> >>>>>> >>>>> >>>>> >>>>> >>>>> >>>>> WARNING / LEGAL TEXT: This message is intended only for the use of the >>>>> individual or entity to which it is addressed and may contain information >>>>> which is privileged, confidential, proprietary, or exempt from disclosure >>>>> under applicable law. If you are not the intended recipient or the person >>>>> responsible for delivering the message to the intended recipient, you are >>>>> strictly prohibited from disclosing, distributing, copying, or in any way >>>>> using this message. If you have received this communication in error, >>>>> please notify the sender and destroy and delete any copies you may have >>>>> received. >>>>> >>>>> http://www.bsc.es/disclaimer <http://www.bsc.es/disclaimer.htm> >>>>> >>>> >>>> >>>> >>>> >>>> WARNING / LEGAL TEXT: This message is intended only for the use of the >>>> individual or entity to which it is addressed and may contain information >>>> which is privileged, confidential, proprietary, or exempt from disclosure >>>> under applicable law. If you are not the intended recipient or the person >>>> responsible for delivering the message to the intended recipient, you are >>>> strictly prohibited from disclosing, distributing, copying, or in any way >>>> using this message. If you have received this communication in error, >>>> please notify the sender and destroy and delete any copies you may have >>>> received. >>>> >>>> http://www.bsc.es/disclaimer <http://www.bsc.es/disclaimer.htm> >>>> >>> >>> >>> >>> >>> WARNING / LEGAL TEXT: This message is intended only for the use of the >>> individual or entity to which it is addressed and may contain information >>> which is privileged, confidential, proprietary, or exempt from disclosure >>> under applicable law. If you are not the intended recipient or the person >>> responsible for delivering the message to the intended recipient, you are >>> strictly prohibited from disclosing, distributing, copying, or in any way >>> using this message. If you have received this communication in error, >>> please notify the sender and destroy and delete any copies you may have >>> received. >>> >>> http://www.bsc.es/disclaimer <http://www.bsc.es/disclaimer.htm> >>> >> >> > > > > WARNING / LEGAL TEXT: This message is intended only for the use of the > individual or entity to which it is addressed and may contain information > which is privileged, confidential, proprietary, or exempt from disclosure > under applicable law. If you are not the intended recipient or the person > responsible for delivering the message to the intended recipient, you are > strictly prohibited from disclosing, distributing, copying, or in any way > using this message. If you have received this communication in error, > please notify the sender and destroy and delete any copies you may have > received. > > http://www.bsc.es/disclaimer <http://www.bsc.es/disclaimer.htm> >
