Hi Ethan We will put the 60 sec delay in our service script is there any specifics to how this should be done or just a sleep 60. What type of logs, etc should we collect to send back for analysis?
Thanks Bruce On Oct 29, 2009, at 2:10 PM, Ethan Quach wrote: > > > Bruce Rothermal wrote: >> Can someone please address this bug it is holding up very important >> mile stones within our project. >> Thanks >> Begin forwarded message: >>> *From: *Bruce Rothermal <Bruce.Rothermal at Sun.COM <mailto:Bruce.Rothermal >>> at Sun.COM >>> >> >>> *Date: *October 27, 2009 1:38:41 PM MDT >>> *To: *caiman-discuss <caiman-discuss at opensolaris.org >>> <mailto:caiman-discuss at opensolaris.org >>> >> >>> *Subject: **[caiman-discuss] OpenSolaris Network services startup >>> broken* >>> >>> Hi All >>> >>> We are using OpenSolaris 200906 on V20Z hardware. >>> >>> We are developing an HPC grid provisioning system for OpenSolaris >>> and are running into problems with system services failing due to >>> race conditions. >>> >>> The system is using installadm to provide automated install of >>> OpenSolaris to systems that run a post install configuration using >>> services. On repeated attempts these post install services are >>> failing because network services are not yet up. We have attempted >>> to set dependencies to make sure that these network services are >>> up first before running our service. During this process of >>> analyzing what is going wrong we are also seeing that OpenSolaris >>> services are failing because there dependencies are not yet >>> available. We believe that problems in the installer system >>> installadm and dhcp are the cause of the initial delay which then >>> snowballs from there. > > Which network service do you set your tortuga service to depend on? > > > After an initial install, AI currently only supports setting up the > system to use NWAM. There can be a latency from when the nwam > service comes online before it actually establishes a network. > This could potentially be what you're running into. > > Just to see if this is the case, can you try putting a ~60 second > sleep at the beginning of the tortuga service's method script? > > > [If this ends up being the case, there are other more workable > workarounds as options, and further questions/discussions ...] > > > thanks, > -ethan > > >>> >>> I've attached the system messages log. >>> >>> Our service is the tortuga application mentioned on lines 176 and >>> 178. >>> >>> You should also notice that on line 175 mDNSResponder fails. >>> >>> On line 173 the ethernet interface is just coming up. >>> >>> Before the ethernet interface is up, on line 172 routed is >>> failing. This should not be running before the interface is >>> available in my opinion. >>> >>> On line 170 the dhcpagent is failing to bind. >>> >>> Anyway the system startup will fail and drop in to maintenance >>> mode. Once we have logged into the system via the console. We run >>> svcs -xv and see that the dns service and our tortuga service is >>> in maintenance mode. I disable the dns and reenable the service >>> and it runs fine. The same for our tortuga service and everything >>> works from there. The problem is that this is supposed to be an >>> automated provisioning system and this all has to work >>> automatically without manual intervention. >>> >> ------------------------------------------------------------------------ >>> >>> >> ------------------------------------------------------------------------ >> ------------------------------------------------------------------------ >>> >>> >>> Bruce Rothermal >>> Email: bruce.rothermal at sun.com <mailto:bruce.rothermal at sun.com> >>> Skype: bruce.rothermal >>> Google Talk: bruce.rothermal at gmail.com <mailto:bruce.rothermal at >>> gmail.com >>> > >>> >>> >>> >>> >>> _______________________________________________ >>> caiman-discuss mailing list >>> caiman-discuss at opensolaris.org <mailto:caiman-discuss at opensolaris.org >>> > >>> http://mail.opensolaris.org/mailman/listinfo/caiman-discuss >> ------------------------------------------------------------------------ >> ------------------------------------------------------------------------ >> Bruce Rothermal >> Email: bruce.rothermal at sun.com <mailto:bruce.rothermal at sun.com> >> Skype: bruce.rothermal Google Talk: bruce.rothermal at gmail.com >> <mailto:bruce.rothermal at gmail.com >> > >> ------------------------------------------------------------------------ >> _______________________________________________ >> caiman-discuss mailing list >> caiman-discuss at opensolaris.org >> http://mail.opensolaris.org/mailman/listinfo/caiman-discuss -------------- next part -------------- A non-text attachment was scrubbed... Name: 1.jpg Type: image/jpeg Size: 3120 bytes Desc: not available URL: <http://mail.opensolaris.org/pipermail/caiman-discuss/attachments/20091102/b033db89/attachment.jpg> -------------- next part -------------- Bruce Rothermal Email: bruce.rothermal at sun.com Skype: bruce.rothermal Google Talk: bruce.rothermal at gmail.com