I'm not sure that cluster fscks count as "special experience." ;)
- WJR On Fri, Feb 18, 2011 at 13:58, Jonathan Link <[email protected]>wrote: > Agreed, and I knew what you said. :-) > I'm only speculating based on the time frame of the problem, as I'll fully > admit I'm out of my depth of experience on clustered products. I know you > have special experience with clusters, so I bow out... > > On Fri, Feb 18, 2011 at 2:52 PM, William Robbins <[email protected]>wrote: > >> Crap...would *not* affect both nodes at the same time. >> >> - WJR >> >> >> >> On Fri, Feb 18, 2011 at 13:50, William Robbins <[email protected]>wrote: >> >>> Not underestimating the power of the luser variable...but I would expect >>> that would affect both nodes at the same time. >>> >>> - WJR >>> >>> >>> On Fri, Feb 18, 2011 at 13:43, Jonathan Link >>> <[email protected]>wrote: >>> >>>> Pure speculation, but the time frame to me screams: >>>> User runs a manual query that in their experience takes a long time to >>>> process (they don't know why) so they set it to start as they leave for the >>>> day, and then take action on the results the next day... >>>> >>>> >>>> On Fri, Feb 18, 2011 at 8:48 AM, Ziots, Edward <[email protected]>wrote: >>>> >>>>> I have a two node X64bit Windows 2003 SP2 enterprise edition cluster >>>>> running SQL 2005 Standard Edition 64bit. >>>>> >>>>> >>>>> >>>>> What I am seeing is event ID’s 1123, 1124 in the event logs on each >>>>> Cluster Node, and we are getting complaints of disconnects from the >>>>> database. >>>>> >>>>> >>>>> >>>>> We are seeing it happen around 5:50-6:00pm each night. ( shows in the >>>>> cluster log and we seen it via pings) >>>>> >>>>> >>>>> >>>>> 1) We have eliminated the backup of the server, which happens at >>>>> 3:30am in the morning ( via Legato) >>>>> >>>>> 2) I have gone through with Microsoft Support the entire KB >>>>> 892422. Which covers these errors. >>>>> >>>>> 3) I have switched out the cables to the public and the private >>>>> NIC’s with no change in issues. >>>>> >>>>> 4) RSS/TCP Chimney are disabled in the registry and on the NIC’s >>>>> on each node. >>>>> >>>>> 5) NIC Drivers are the latest from HP Site ( NC373i) and EMC >>>>> Powerpath software 5.3 SP1 for the SAN disk on each node. >>>>> >>>>> >>>>> >>>>> Basically we are pinging the Owning Node server from our workstations >>>>> and we loose about 5-10 pings during this time, on both the primary and >>>>> the >>>>> secondary nodes of the cluster. ( both are into the same Cisco Switch >>>>> 45xx) >>>>> >>>>> >>>>> >>>>> We also was pinging each of the servers from each other ( both on the >>>>> same switch/VLAN) and we also saw the ping loss at the same time. >>>>> >>>>> >>>>> >>>>> Only idea I had is to move the public NIC’s to another switch to >>>>> eliminate the switch as the point of contention, or get new hardware and >>>>> migrate the databases off this cluster and decommission it. >>>>> >>>>> >>>>> >>>>> I checked other cluster nodes connected to these switches ( 32bit) and >>>>> we don’t see this problem. >>>>> >>>>> >>>>> >>>>> Anything I might be missing or overlooked? Questions, or bouncing some >>>>> ideas off the wall is appreciated… >>>>> >>>>> >>>>> >>>>> Z >>>>> >>>>> >>>>> >>>>> Edward E. Ziots >>>>> >>>>> CISSP, Network +, Security + >>>>> >>>>> Network Engineer >>>>> >>>>> Lifespan Organization >>>>> >>>>> Email:[email protected] >>>>> >>>>> Cell:401-639-3505 >>>>> >>>>> >>>>> >>>>> ~ Finally, powerful endpoint security that ISN'T a resource hog! ~ >>>>> ~ <http://www.sunbeltsoftware.com/Business/VIPRE-Enterprise/> ~ >>>>> >>>>> --- >>>>> To manage subscriptions click here: >>>>> http://lyris.sunbelt-software.com/read/my_forums/ >>>>> or send an email to [email protected] >>>>> with the body: unsubscribe ntsysadmin >>>>> >>>> >>>> ~ Finally, powerful endpoint security that ISN'T a resource hog! ~ >>>> ~ <http://www.sunbeltsoftware.com/Business/VIPRE-Enterprise/> ~ >>>> >>>> --- >>>> To manage subscriptions click here: >>>> http://lyris.sunbelt-software.com/read/my_forums/ >>>> or send an email to [email protected] >>>> with the body: unsubscribe ntsysadmin >>>> >>> >>> ~ Finally, powerful endpoint security that ISN'T a resource hog! ~ >>> ~ <http://www.sunbeltsoftware.com/Business/VIPRE-Enterprise/> ~ >>> >>> --- >>> To manage subscriptions click here: >>> http://lyris.sunbelt-software.com/read/my_forums/ >>> or send an email to [email protected] >>> with the body: unsubscribe ntsysadmin >>> >> >> ~ Finally, powerful endpoint security that ISN'T a resource hog! ~ >> ~ <http://www.sunbeltsoftware.com/Business/VIPRE-Enterprise/> ~ >> >> --- >> To manage subscriptions click here: >> http://lyris.sunbelt-software.com/read/my_forums/ >> or send an email to [email protected] >> with the body: unsubscribe ntsysadmin >> > > ~ Finally, powerful endpoint security that ISN'T a resource hog! ~ > ~ <http://www.sunbeltsoftware.com/Business/VIPRE-Enterprise/> ~ > > --- > To manage subscriptions click here: > http://lyris.sunbelt-software.com/read/my_forums/ > or send an email to [email protected] > with the body: unsubscribe ntsysadmin > ~ Finally, powerful endpoint security that ISN'T a resource hog! ~ ~ <http://www.sunbeltsoftware.com/Business/VIPRE-Enterprise/> ~ --- To manage subscriptions click here: http://lyris.sunbelt-software.com/read/my_forums/ or send an email to [email protected] with the body: unsubscribe ntsysadmin
