I need few hours to check how things are going.
Maybe im wrong with disable offloading... but it loooks stable now, no hangs

-- 
banuchka

On 14 April 2017 at 13:05:59, Jarrod Johnson (jjohns...@lenovo.com) wrote:

> Bad udp checksum is a side effect of wireshark when offloading is
> enabled.  What is happening is that wireshark captures the data before the
> TX offloaded checksumming occurs.  So the TX looks incorrect, but it’s
> intentional because the hardware will do it instead.  RX should look fine,
> and every TX should **look** like bad checksum if offload enabled, but
> it’s actually going to be fine on the wire.
>
>
>
>
>
> *From:* banuchka [mailto:tyrche...@gmail.com]
> *Sent:* Friday, April 14, 2017 7:53 AM
> *To:* xCAT Users Mailing list; Jarrod Johnson
> *Subject:* Re: [xcat-user] Confluent as console server. Consoles hangs
> ~after 24h.
>
>
>
> I can see keepalive messages to consoles… and I've seen “bad udp cksum”
> hope that is a problem.
>
>
>
> I’ve turned off TX/RX offloading(and generic offloading as well) on my eth
> card. Now tcpdump on 623 looks much better.
>
>
>
> On 14 April 2017 at 12:42:03, Jarrod Johnson (jjohns...@lenovo.com) wrote:
>
> If you ctrl-e, c, o, does it restore the console after the time?
>
>
>
> Can you tell that it goes after exactly 24hours on the dot?
>
>
>
> When console hung, does ‘ipmitool sol activate’ say ‘session already
> active’?
>
>
>
> Does /var/log/confluent/consoles/<nodename> have any interesting events
> crop up?
>
>
>
> Pyghmi will do keepalive as well, and if that’s the problem, it should be
> much shorter than 24 hours.  In fact, it should be checking if the SOL
> payload is active and owned by confluent specifically every couple of
> minutes.
>
>
>
> *From:* banuchka [mailto:tyrche...@gmail.com <tyrche...@gmail.com>]
> *Sent:* Friday, April 14, 2017 5:55 AM
> *To:* xcat-user@lists.sourceforge.net
> *Subject:* Re: [xcat-user] Confluent as console server. Consoles hangs
> ~after 24h.
>
>
>
> My last reply was incorrect. Problems still here. Im trying to find
> something usefull inbetween confluent/pyghmi...
>
> Confluent restart solves hangs/reopen all connections.
>
> I think it isnt the best option to restart confluent 1 or 2 times in 24h.
>
> --
> banuchka
>
> On 13 April 2017 at 17:03:19, banuchka (tyrche...@gmail.com) wrote:
>
> It is Dell’s related problem, not 100% but…
>
> Confluent from current master is doing things well :)
>
> Thanks for pretty nice tool “confluentdbutil".
>
>
>
> On 13 April 2017 at 11:30:14, banuchka (tyrche...@gmail.com) wrote:
>
> Looks like that problem was before… The fix was to use ipmitool with
> keepalive(one from xcat repos).
>
> Here pyghmi is used maybe that the reason?
>
>
>
> On 13 April 2017 at 08:22:28, banuchka (tyrche...@gmail.com) wrote:
>
> Hi,
>
>
>
> Im trying to completely migrate from conserver to confluent, but catch
> strange behaviour.
>
> Some of my consoles hangs ~after 24, so no any new messages in their logs
> or in rcons.
>
> I send messages with timestamp from OS >/dev/console every 30-60min and
> take a look on them for monitoring purposes(consoles availability
> monitoring).
>
> I can open rcons and hit enter, after few secs console is waking
> up(strange). I didnt see it happen with conserver or maybe im wrong...
>
> Some details:
>
> - as i can see the bigest part of consoles with hangs behaviour are Dell
> idrac. Doesnt matter which type of RacSerial or IPMISerial is in use.
>
> - racreset hard/ipmitool bmc reset didnt do the things
>
> - hit enter to console wake it up(for example with expect i can send
> \r\n\f, but it looks bad)
>
> - i didnt try to clean confluent's conf and restart it. Not sure it may
> help.
>
> - HP consoles works well, same ipmi
>
> - few consoles with custom pluging works good as well
>
>
>
> So maybe my question is not about confluent, but if some of you have some
> knowledge about same problems please share it! ;)
>
>
>
> --
> banuchka
>
> --
> banuchka
>
> --
> banuchka
>
> ------------------------------------------------------------------------------
>
> Check out the vibrant tech community on one of the world's most
> engaging tech sites, Slashdot.org!
> http://sdm.link/slashdot_______________________________________________
> xCAT-user mailing list
> xCAT-user@lists.sourceforge.net
> https://lists.sourceforge.net/lists/listinfo/xcat-user
>
> --
> banuchka
>
------------------------------------------------------------------------------
Check out the vibrant tech community on one of the world's most
engaging tech sites, Slashdot.org! http://sdm.link/slashdot
_______________________________________________
xCAT-user mailing list
xCAT-user@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/xcat-user

Reply via email to