Hi Ethan

We will put the 60 sec delay in our service script is there any  
specifics to how this should be done or just a sleep 60. What type of  
logs, etc should we collect to send back for analysis?

Thanks

Bruce

On Oct 29, 2009, at 2:10 PM, Ethan Quach wrote:

>
>
> Bruce Rothermal wrote:
>> Can someone please address this bug it is holding up very important  
>> mile stones within our project.
>> Thanks
>> Begin forwarded message:
>>> *From: *Bruce Rothermal <Bruce.Rothermal at Sun.COM <mailto:Bruce.Rothermal 
>>> at Sun.COM 
>>> >>
>>> *Date: *October 27, 2009 1:38:41 PM MDT
>>> *To: *caiman-discuss <caiman-discuss at opensolaris.org 
>>> <mailto:caiman-discuss at opensolaris.org 
>>> >>
>>> *Subject: **[caiman-discuss] OpenSolaris Network services startup  
>>> broken*
>>>
>>> Hi All
>>>
>>> We are using OpenSolaris 200906 on V20Z hardware.
>>>
>>> We are developing an HPC grid provisioning system for OpenSolaris  
>>> and are running into problems with system services failing due to  
>>> race conditions.
>>>
>>> The system is using installadm to provide automated install of  
>>> OpenSolaris to systems that run a post install configuration using  
>>> services. On repeated attempts these post install services are  
>>> failing because network services are not yet up. We have attempted  
>>> to set dependencies to make sure that these network services are  
>>> up first before running our service. During this process of  
>>> analyzing what is going wrong we are also seeing that OpenSolaris  
>>> services are failing because there dependencies are not yet  
>>> available. We believe that problems in the installer system  
>>> installadm and dhcp are the cause of the initial delay which then  
>>> snowballs from there.
>
> Which network service do you set your tortuga service to depend on?
>
>
> After an initial install, AI currently only supports setting up the
> system to use NWAM.  There can be a latency from when the nwam
> service comes online before it actually establishes a network.
> This could potentially be what you're running into.
>
> Just to see if this is the case, can you try putting a ~60 second
> sleep at the beginning of the tortuga service's method script?
>
>
> [If this ends up being the case, there are other more workable
> workarounds as options, and further questions/discussions ...]
>
>
> thanks,
> -ethan
>
>
>>>
>>> I've attached the system messages log.
>>>
>>> Our service is the tortuga application mentioned on lines 176 and  
>>> 178.
>>>
>>> You should also notice that on line 175 mDNSResponder fails.
>>>
>>> On line 173 the ethernet interface is just coming up.
>>>
>>> Before the ethernet interface is up, on line 172 routed is  
>>> failing. This should not be running before the interface is  
>>> available in my opinion.
>>>
>>> On line 170 the dhcpagent is failing to bind.
>>>
>>> Anyway the system startup will fail and drop in to maintenance  
>>> mode. Once we have logged into the system via the console. We run  
>>> svcs -xv and see that the dns service and our tortuga service is  
>>> in maintenance mode. I disable the dns and reenable the service  
>>> and it runs fine. The same for our tortuga service and everything  
>>> works from there. The problem is that this is supposed to be an  
>>> automated provisioning system and this all has to work  
>>> automatically without manual intervention.
>>>
>> ------------------------------------------------------------------------
>>>
>>>
>> ------------------------------------------------------------------------
>> ------------------------------------------------------------------------
>>>
>>>
>>> Bruce Rothermal
>>> Email: bruce.rothermal at sun.com <mailto:bruce.rothermal at sun.com>
>>> Skype: bruce.rothermal
>>> Google Talk: bruce.rothermal at gmail.com <mailto:bruce.rothermal at 
>>> gmail.com 
>>> >
>>>
>>>
>>>
>>>
>>> _______________________________________________
>>> caiman-discuss mailing list
>>> caiman-discuss at opensolaris.org <mailto:caiman-discuss at opensolaris.org 
>>> >
>>> http://mail.opensolaris.org/mailman/listinfo/caiman-discuss
>> ------------------------------------------------------------------------
>> ------------------------------------------------------------------------
>> Bruce Rothermal
>> Email: bruce.rothermal at sun.com <mailto:bruce.rothermal at sun.com>
>> Skype: bruce.rothermal Google Talk: bruce.rothermal at gmail.com 
>> <mailto:bruce.rothermal at gmail.com 
>> >
>> ------------------------------------------------------------------------
>> _______________________________________________
>> caiman-discuss mailing list
>> caiman-discuss at opensolaris.org
>> http://mail.opensolaris.org/mailman/listinfo/caiman-discuss

-------------- next part --------------
A non-text attachment was scrubbed...
Name: 1.jpg
Type: image/jpeg
Size: 3120 bytes
Desc: not available
URL: 
<http://mail.opensolaris.org/pipermail/caiman-discuss/attachments/20091102/b033db89/attachment.jpg>
-------------- next part --------------


Bruce Rothermal
Email: bruce.rothermal at sun.com
Skype: bruce.rothermal
Google Talk: bruce.rothermal at gmail.com




Reply via email to