I rebooted the machine and now i have a new error:

[sudo] password for carlos:
error resolving local host: can't resolve host name (h_errno =
HOST_NOT_FOUND)
error resolving local host: can't resolve host name (h_errno =
HOST_NOT_FOUND)

sge_qmaster didn't start!
This is not a qmaster host!
Check your /usr/global/sge-6.2u5-bin//default/common/act_qmaster file!


2011/5/27 Carlos Scaloni <[email protected]>

> I did changes:
>
> If I type hostname the output is: proyecto-192.local
>
> cat /usr/global/sge-6.2u5-bin/default/common/host_aliases
>
> proyecto-192.local   proyecto-10.local
>
> cat /usr/global/sge-6.2u5-bin/default/common/act_qmaster
> proyecto-192.local
>
> cat /etc/hosts
> 127.0.0.1   localhost localhost.localdomain localhost4
> localhost4.localdomain4
> ::1         localhost
> 192.168.56.101 proyecto-192.local
> 10.0.2.15 proyecto-10.local
>
> I have the same problem:
>
>
> starting sge_qmaster
> sge_qmaster start problem
> sge_qmaster didn't start!
>
> But now I don't have the msg of "05/27/2011 01:10:39| main|proyecto|C|abort
> qmaster startup due to communication errors" in /tmp/sge_messages
>
>
>
> 2011/5/27 "Hung-Sheng Tsao (Lao Tsao 老曹) Ph.D." <[email protected]>
>
>>  IMHO, you need to have two different name for two interfaces and pick one
>> as your hostname
>> 192.168.56.101 proyecto-192.local -->hostname
>>  10.0.2.15 proyecto-10.local
>> then create  /usr/global/sge-6.2u5-bin/default/common/host_aliases
>> proyecto-192.local   proyecto-10.local
>> assume act_qmaster is proyecto-192.local
>> restart qmaster and execd
>>
>>
>>
>>
>> On 5/26/2011 6:20 PM, Carlos Scaloni wrote:
>>
>> It's a machine with 2 interfaces!
>>
>> I want to install the qmaster and the execd in the same machine!
>>
>> what do i do? do i change the /etc/hosts file??
>>
>> thanks
>>
>> 2011/5/27 "Hung-Sheng Tsao (Lao Tsao 老曹) Ph.D." <[email protected]>
>>
>>>  you have two IP address
>>>
>>> 192.168.56.101 proyecto.local
>>>  10.0.2.15 proyecto.local
>>>  so which one is proyecto.local
>>>
>>> what is your execd_host's IP, 10. or 192?
>>>
>>>
>>>
>>> On 5/26/2011 5:39 PM, Carlos Scaloni wrote:
>>>
>>> I have a problem with this, because if i write: echo
>>> $SGE_ROOT/$SGE_CELL/act_qmaster the answer is:  /usr/global/sge//act_qmaster
>>>
>>> but the file is there:
>>> /usr/global/sge-6.2u5-bin/default/common/act_qmaster
>>>
>>> It is its content:
>>> cat /usr/global/sge-6.2u5-bin/default/common/act_qmaster
>>> proyecto.local
>>>
>>>  hostname return this: proyecto.local
>>>
>>>
>>>
>>> 2011/5/26 LaoTsao <[email protected]>
>>>
>>>> Do not  think you can assign the same host name for two ip in /etc/hosts
>>>> What is act_qmaster. Content
>>>> What is. hostname return
>>>>
>>>>
>>>>
>>>> Sent from my iPad
>>>> Hung-Sheng Tsao ( LaoTsao) Ph.D
>>>>
>>>> On May 26, 2011, at 5:14 PM, Carlos Scaloni <[email protected]>
>>>> wrote:
>>>>
>>>> >
>>>> > Hi friends
>>>> >
>>>> > I installed the sge6_2u5 but when i try to start the qmaster i see
>>>> this:
>>>> >
>>>> > /etc/init.d/sgemaster.p6444
>>>> >    starting sge_qmaster
>>>> >
>>>> > sge_qmaster start problem
>>>> >
>>>> > sge_qmaster didn't start!
>>>> >
>>>> >
>>>> > and in /tmp/sge_messages.txt :
>>>> >
>>>> > 05/26/2011 20:44:53|  main|proyecto|C|abort qmaster startup due to
>>>> communication errors
>>>> >
>>>> > I don't know what the problem is!
>>>> >
>>>> >
>>>> > I installed it with: sudo ./install_qmaster The installation finished
>>>> without any error!
>>>> > Options that i used: admin user is sgeadmin, sge_qmaster port 6444,
>>>> sge_execd port 6445, classic pooling, gid range 20000-21000
>>>> > the rest options by default!
>>>> >
>>>> > I try to start it with: /etc/init.d/sgemaster.p6444
>>>> >
>>>> > The file /tmp/sge_message contains this:
>>>> > 05/26/2011 20:42:39| main|proyecto|C|abort qmaster startup due to
>>>> communication errors
>>>> > 05/26/2011 20:44:53| main|proyecto|C|abort qmaster startup due to
>>>> communication errors
>>>> >
>>>> > My hostname is:
>>>> > hostname
>>>> > proyecto.local
>>>> >
>>>> > And i have /etc/host so:
>>>> >
>>>> > cat /etc/hosts
>>>> > 127.0.0.1 localhost
>>>> > ::1 localhost
>>>> > 192.168.56.101 proyecto.local
>>>> > 10.0.2.15 proyecto.local
>>>> >
>>>> >
>>>> > Can anyone help me, please??
>>>> >
>>>> > thanks in advance
>>>> >
>>>> >
>>>>   > _______________________________________________
>>>> > users mailing list
>>>> > [email protected]
>>>> > https://gridengine.org/mailman/listinfo/users
>>>>
>>>
>>>
>>>
>>> _______________________________________________
>>> users mailing 
>>> [email protected]https://gridengine.org/mailman/listinfo/users
>>>
>>>
>>> _______________________________________________
>>> users mailing list
>>> [email protected]
>>> https://gridengine.org/mailman/listinfo/users
>>>
>>>
>>
>
_______________________________________________
users mailing list
[email protected]
https://gridengine.org/mailman/listinfo/users

Reply via email to