On Sat, Mar 19, 2011 at 4:23 PM, Jesse Becker <[email protected]> wrote:
>> We will need a better fix for OSX, but as usual I would recommend using
>> Linux as it is better supported in the HPC world.
>
> Actally, this problem can affect Linux systems as well, especially with
> regards to NFS mounts.  It's an ugly problem, and one that isn't easy to
> fix.

Hi Jesse,

Modern Linux kernels should have enough supplementary group IDs
available, can you describe the problem in detail??

Thanks,
Rayson



>
>>
>> -Ron
>>
>>
>>
>>>
>>> Thanks
>>> Stephen
>>>
>>> ############################################################################
>>> # Stephen Dennis : Senior Sales Engineer
>>> # Univa Corporation:  http://univa.com
>>> # [email protected]
>>> : 310 310 0738 : skype stephendennis.com
>>>
>>> ############################################################################
>>> ________________________________________
>>> From: [email protected]
>>> [[email protected]]
>>> On Behalf Of Barry McInnes [[email protected]]
>>> Sent: Friday, March 18, 2011 11:31 AM
>>> To: [email protected]
>>> Subject: [gridengine users] ge62u5 mac 10.6 too many group
>>> ids
>>>
>>> Hi,
>>> When running gmaster on 10.5 we get user submit errors when
>>> they are in
>>> too many groups, so the job fails. SOme users in less
>>> groups (6-8) can
>>> run jobs eg the first user cannot submit the second user
>>> can
>>> [mac27:~/SGE] bmcinnes% id bmcinnes
>>> uid=2101(bmcinnes) gid=200(climate)
>>> groups=200(climate),1953027852(PSD\sysadmins),829578209(PSD\domain
>>>
>>> admins),801476512(PSD\log1),204(_developer),100(_lpoperator),98(_lpadmin),81(_appserveradm),80(admin),79(_appserverusr),62(netaccounts),12(everyone),1207(rain),1100(systems),998(lmadmin),900(sawrtrs),400(cuac),2109053379(PSD\domain
>>> users),1858905114(PSD\denied rodc password replication
>>>
>>> group),1358185131(PSD\it_wikis),404(com.apple.sharepoint.group.3),928177777(PSD\coopcall),401(com.apple.access_screensharing),403(com.apple.sharepoint.group.2),402(com.apple.sharepoint.group.1)
>>> [mac27:~/SGE] bmcinnes%
>>> [mac27:~/SGE] bmcinnes%
>>> [mac27:~/SGE] bmcinnes% id ppegion
>>> uid=3009(ppegion) gid=200(climate)
>>>
>>> groups=200(climate),62(netaccounts),12(everyone),594189391(PSD\climate),247203070(PSD\psd1group),2109053379(PSD\domain
>>>
>>> users),404(com.apple.sharepoint.group.3),928177777(PSD\coopcall),403(com.apple.sharepoint.group.2),402(com.apple.sharepoint.group.1)
>>> [mac27:~/SGE] bmcinnes%
>>>
>>> The Mac OS is adding groups membership to users, as well as
>>> our group
>>> settings.
>>>
>>> When we go to Mac 10.6 Intel, the qmaster server fails to
>>> put any nodes
>>> in service, due to the same error, so users have no chance
>>> to even
>>> submit jobs
>>>
>>> 03/16/2011 13:41:49|worker|g5s2|W|rescheduling job 15015.1
>>> 03/16/2011 13:41:49|worker|g5s2|E|queue quad marked QERROR
>>> as result of
>>> ob 15015's failure at host mac40.psd.esrl.noaa.gov
>>> 03/16/2011 14:02:49|worker|g5s2|W|job 15015.1 failed on
>>> host
>>> mac65.psd.esrl.noaa.gov general before job because:
>>> 03/16/2011 14:02:49
>>> [0:22624]: can't set additional group id (uid=0, euid=0):
>>> the user
>>> already has too many group ids
>>> 03/16/2011 14:02:49|worker|g5s2|W|rescheduling job 15015.1
>>> 03/16/2011 14:02:49|worker|g5s2|E|queue quad marked QERROR
>>> as result of
>>> job 15015's failure at host mac65.psd.esrl.noaa.gov
>>> 03/16/2011 14:08:19|worker|g5s2|W|job 15015.1 failed on
>>> host
>>> mac18.psd.esrl.noaa.gov general before job because:
>>> 03/16/2011 14:08:19
>>> [0:42391]: can't set additional group id (uid=0, euid=0):
>>> the user
>>> already has too many group ids
>>>
>>> We are using Active Directory authentication, and the Mac
>>> clients are
>>> all 10.6.6.
>>> We tried OGE 62u7 with the same group id error.
>>>
>>> We are currently back at 10.5 PPC qmaster server to get
>>> jobs submitted
>>> and run.
>>>
>>> Any help appreciated.
>>> _______________________________________________
>>> users mailing list
>>> [email protected]
>>> https://gridengine.org/mailman/listinfo/users
>>>
>>>
>>> ---------------------------------------------------------------------
>>>
>>>
>>> Notice from Univa Postmaster:
>>>
>>>
>>> This email message is for the sole use of the intended
>>> recipient(s) and may contain confidential and privileged
>>> information. Any unauthorized review, use, disclosure or
>>> distribution is prohibited. If you are not the intended
>>> recipient, please contact the sender by reply email and
>>> destroy all copies of the original message. This message has
>>> been content scanned by the Univa Mail system.
>>>
>>>
>>>
>>> ---------------------------------------------------------------------
>>>
>>>
>>> _______________________________________________
>>> users mailing list
>>> [email protected]
>>> https://gridengine.org/mailman/listinfo/users
>>>
>>
>>
>>
>>
>> _______________________________________________
>> users mailing list
>> [email protected]
>> https://gridengine.org/mailman/listinfo/users
>
> --
> Jesse Becker
> NHGRI Linux support (Digicon Contractor)
> _______________________________________________
> users mailing list
> [email protected]
> https://gridengine.org/mailman/listinfo/users
>

_______________________________________________
users mailing list
[email protected]
https://gridengine.org/mailman/listinfo/users

Reply via email to