Hi Michael.

That is great you were able to find and fix the bugs.

I had some Maui "experts" who were nice enough to trouble shoot the 
issues with me on our Cluster - we made several changes to no avail.    
I created accounts for the experts on our HPC cluster and over several 
weeks period they eventually gave up as they could not figure it out.   
I too posted the issues but got very little traction/help at that time.

For us, using Moab with no other changes, limit changes or anything else 
other modications other than changing the port and "maui.cfg" to 
moab.cfg" and using Moab fixed all of our issues.    For us it was not a 
limit issue or any other obvious change.    We came to the conclusion 
that it was internal bugs with Maui that were not easily fixed.

I will say that GE user's group is tremendously helpful in comparison to 
Maui user's list.   There is MUCH more activity with GE in the order of 
100x more in comparison.

Not trying to persuade or change anyone's opinion, but stating my own 
observations that may help others with similar issues.

Best,
Joseph
https://hpc.oit.uci.edu


On 12/09/2015 01:44 PM, Michel Béland wrote:
> Hi Joseph,
>
>> For whatever it is worth, Maui has some serious bugs when it is in 
>> full use.
>
> We had problems initially when we first used Maui on our big cluster. 
> Most of them were fixed by increasing some limits in the include 
> files. We also had a problem with some idle jobs not running 
> (showstart would show they should run immediately, but they would 
> not). This was fixed by commenting out some code. This was in a patch 
> published on this list many years ago, but it never made it to a release.
>
> The bugs caused by the Torque 5 change in attribute format are really 
> the show stoppers for us, hence my desire to fix them.
>
> The others bugs I can live with, for now.
>
>> I had Maui running for a VERY long time and it would behave 
>> differently when it was mostly idle as when it was under heavy use - 
>> we have thousands of cores.
>>
>> In my frustration I downloaded and enabled "moab" eval and as if by 
>> magic all of the weirdness we were seeing in Maui went away over a 
>> 2-month period.   After two months of use, when I reverted back to 
>> Maui, all of the same weirdness came back.
>>
>> We eventually dropped Maui and went with Son of Grid Engine as Moab 
>> was price prohibited for us.   Grid Engine has been working very well 
>> albeit via several home grown custom modifications.
>
> Good for you, but Torque still needs a free alternative to Moab. 
> pbs_sched is out of the question, unless it is heavily modified to add 
> missing features like backfilling. Maui is the closest approximation 
> to a usable free scheduler for Torque. It would be nice if users 
> helped to fix the bugs instead of giving up, but I understand that 
> users do not necessarly have time, skill or will to do so.
>
>

_______________________________________________
mauiusers mailing list
[email protected]
http://www.supercluster.org/mailman/listinfo/mauiusers

Reply via email to