It runs fine on the other 50 boxes. Why would it be different?

Regards
Michael Knill

On 17/9/20, 6:07 pm, "Michael Keuter" <li...@mksolutions.info> wrote:

    I've never used Zabbix. 
    Maybe Zabbix is the issue … (it has some similarities to Monit in a way)
    Try to disable it for a while on the problematic boxes …

    > Am 16.09.2020 um 23:09 schrieb Michael Knill 
<michael.kn...@ipcsolutions.com.au>:
    > 
    > I do have Zabbix running on these boxes which checks memory and will 
notify me if its getting too low and this has not happened. This is what 
alerted me to the Zabbix client memory leak.
    > Also the lockups are very sporadic. Can happen after a couple of weeks or 
the next day.
    > 
    > Regards
    > Michael Knill
    > 
    > On 16/9/20, 11:21 pm, "Michael Keuter" <li...@mksolutions.info> wrote:
    > 
    >   Hi all,
    > 
    >   it might be completely unrelated, but it reminds me at an issue that 
only occured many years ago only on 32bit Geode boxes (Alix/net5501) after we 
integrated Monit into AstLinux.
    >   On these boxes all RAM was eaten up slowly and then these boxes locked 
up after a few days. That's why we didn't installed Monit for these build types.
    > 
    >   To figure it out I then created a cronjob that logged uptime, RAM and 
used Asterisk channels into a logfile every 10 minutes.
    > 
    >> Am 16.09.2020 um 15:07 schrieb Lonnie Abelbeck 
<li...@lonnie.abelbeck.com>:
    >> 
    >> Hi Daryl,
    >> 
    >> Has the AstLinux version changed in the last couple weeks ?
    >> 
    >> What version are you running ?
    >> 
    >> Lonnie
    >> 
    >> 
    >> 
    >>> On Sep 16, 2020, at 8:01 AM, Daryl Richards via Astlinux-users 
<astlinux-users@lists.sourceforge.net> wrote:
    >>> 
    >>> This is good timing (in a bad way..) Time for a "Me too!"
    >>> 
    >>> Recently, over the last couple weeks my APU2 has started locking up 
exactly the same way, just hard lock. I have a serial console cable hooked up 
and there are no messages printed out before, there's just nothing. When the 
problem first started I didn't have a console cable hooked up but I put it in 
to see if any messages were appearing..
    >>> 
    >>> My system is on a UPS. Nothing else in the rack glitches.
    >>> 
    >>> On 2020-09-15 8:36 p.m., Michael Knill wrote:
    >>>> Thanks Chris
    >>>> Yep I have plenty out there too with no issues. It also happened on a 
Qotom box so not hardware related.
    >>>> Regards
    >>>> Michael Knill
    >>>> *From: *AstLinux List <astlinux-users@lists.sourceforge.net>
    >>>> *Reply to: *AstLinux List <astlinux-users@lists.sourceforge.net>
    >>>> *Date: *Wednesday, 16 September 2020 at 8:53 am
    >>>> *To: *AstLinux List <astlinux-users@lists.sourceforge.net>
    >>>> *Cc: *The Cadillac Kid <eldorado...@yahoo.com>
    >>>> *Subject: *Re: [Astlinux-users] APU2 keeps locking up
    >>>> for whatever its worth..  from a hardware standpoint I have probably 
200 APU2s in the field and dont have then just freeze like that..  granted they 
arent running astlinux, but just mentioning it from a power / hardware point of 
view.
    >>>> mine are all running Centos 6.X and asterisk 11 or 13
    >>>> I have had a few chan_sip freezes.. (we wrote a watchdog to catch 
those and restart asterisk).. we get maybe 1 every 3 or 4 months .. (not each 
site but collectively)
    >>>> I have kenrel panicked an APU before by upping and downing the 
ethernet port too much (or so it seems)..  it doesnt happen very often and is 
hard to repeat in the lab.  but have done it esp on an install..  where one is 
prone to plug and unplug cables multiple times in succession for dressing in..
    >>>> I have had a few power bricks go bad.. in all cases there was just no 
output..  no lights on the board at all. its not been that many 3 or 4..  
considering we have sites in some pretty lightning-prone areas i dont feel too 
bad about a few power bricks.
    >>>> we run UPSs on all of our sites
    >>>> we run the US power supply that is sold on the PC-Engines store.
    >>>> On Tuesday, September 15, 2020, 6:04:44 PM EDT, Michael Knill 
<michael.kn...@ipcsolutions.com.au> wrote:
    >>>> Yep I would say environmental for 1, maybe 2 sites but for 3 sites and 
all only recently I cant see how it could be.
    >>>> No serial or long ethernet cables are connected.
    >>>> Yes completely different power adaptors when I changed from APU2 -> 
Qotom.
    >>>> Yes they are all DSL however ALL different modems and service types.
    >>>> I'm thinking I will upgrade all Runnix versions at these sites and if 
still happening upgrade to 1.3.10 and if still happening then I have no idea 
what to do.
    >>>> Regards
    >>>> Michael Knill
    >>>> On 16/9/20, 7:36 am, "Lonnie Abelbeck" <li...@lonnie.abelbeck.com 
<mailto:li...@lonnie.abelbeck.com>> wrote:
    >>>>  So a UPS may solve the issue as with Site 1 ?
    >>>>  Do you have any serial cables connected ?
    >>>>  Any long ethernet cables connected ?
    >>>>  Is this with a variety of power adapters ?  ie. the APU2 -> Qotom 
switch did the power adapter change as well ?
    >>>>  Sure sounds environmental to me.
    >>>>  I've seen DC-to-DC UPSs for about $40 USD, but never tried one.
    >>>> https://protectli.com/product/uninterruptible-power-supply/
    >>>>  Lonnie
    >>>>> On Sep 15, 2020, at 3:58 PM, Michael Knill 
<michael.kn...@ipcsolutions.com.au<mailto:michael.kn...@ipcsolutions.com.au>> 
wrote:
    >>>>> 
    >>>>> Ok I'm reviving this thread as I have now had my third site with this 
issue.
    >>>>> 
    >>>>> Symptom:
    >>>>> Astlinux completely locks up and requires a power reset. The log 
shows NOTHING in all cases.
    >>>>> 
    >>>>> Troubleshooting conducted:
    >>>>> Site 1 - The problem has not occurred since both a UPS AND Power 
Filter have been added
    >>>>> Site 2 - A power filter has been added and the problem has 
reoccurred. I have completely changed the system from an APU2 to Qotom and it 
did the same thing again
    >>>>> Site 3 - A known working APU2 was installed and it locked up 
yesterday.
    >>>>> 
    >>>>> I have checked mSATA cards and they were different across systems at 
these sites.
    >>>>> All systems are running 1.3.7.1 but I am running this at many other 
sites on the same hardware with no issues.
    >>>>> 
    >>>>> Power quality testing is extremely expensive and surely I cant be 
having a power issue at 3 sites!
    >>>>> The whole thing just doesn't make any sense and I don't know where to 
go from here.
    >>>>> Any ideas?
    >>>>> 
    >>>>> Regards
    >>>>> Michael Knill
    >>>>> 
    >>>>> On 22/4/20, 9:24 pm, "Michael Knill" 
<michael.kn...@ipcsolutions.com.au<mailto:michael.kn...@ipcsolutions.com.au>> 
wrote:
    >>>>> 
    >>>>> Looks like this problem was bad power. I was told by the local IT Guy 
that there are regular brownouts so I installed a UPS. No more problems since 
doing so.
    >>>>> Seems like APU's don't like low voltage scenarios.
    >>>>> 
    >>>>> Thanks for your help.
    >>>>> 
    >>>>> Regards
    >>>>> Michael Knill
    >>>>> 
    >>>>> On 13/4/20, 1:05 pm, "Michael Knill" 
<michael.kn...@ipcsolutions.com.au<mailto:michael.kn...@ipcsolutions.com.au>> 
wrote:
    >>>>> 
    >>>>>     Yes could be but very unusual. I have never had this problem with 
any other APU.
    >>>>>     Maybe I will look for a good surge suppressor.
    >>>>> 
    >>>>>     Regards
    >>>>>     Michael Knill
    >>>>> 
    >>>>>     On 13/4/20, 12:47 pm, "Lonnie Abelbeck" 
<li...@lonnie.abelbeck.com <mailto:li...@lonnie.abelbeck.com>> wrote:
    >>>>> 
    >>>>>         Interesting, maybe bad power (spikes, noise, etc.)
    >>>>> 
    >>>>>         Test with a UPS attached or good surge suppresser.
    >>>>> 
    >>>>>         Lonnie
    >>>>> 
    >>>>> 
    >>>>> 
    >>>>>> On Apr 12, 2020, at 9:17 PM, Michael Knill 
<michael.kn...@ipcsolutions.com.au<mailto:michael.kn...@ipcsolutions.com.au>> 
wrote:
    >>>>>> 
    >>>>>> Hi Lonnie
    >>>>>> 
    >>>>>> I have replaced the hardware already and it did EXACTLY the same 
thing ☹
    >>>>>> 
    >>>>>> Regards
    >>>>>> Michael Knill
    >>>>>> 
    >>>>>> On 13/4/20, 12:15 pm, "Lonnie Abelbeck" <li...@lonnie.abelbeck.com 
<mailto:li...@lonnie.abelbeck.com>> wrote:
    >>>>>> 
    >>>>>> 
    >>>>>> 
    >>>>>>> On Apr 12, 2020, at 7:03 PM, Michael Knill 
<michael.kn...@ipcsolutions.com.au<mailto:michael.kn...@ipcsolutions.com.au>> 
wrote:
    >>>>>>> 
    >>>>>>> May not have anything to do with Astlinux but I have a site which 
completely locks up e.g. cannot even communicate using serial port.
    >>>>>>> On reboot its fine but there is nothing in the logs but the bootup 
messages.
    >>>>>>> So far I have replaced the hardware with new storage as well and 
power supply and it is still doing it.
    >>>>>>> 
    >>>>>>> It is currently running Astlinux 1.3.7.1 which has been running 
fine on another APU2 but its only been 8 days.
    >>>>>>> 
    >>>>>>> Any ideas what I can do next?
    >>>>>>> 
    >>>>>>> Regards
    >>>>>>> Michael Knill
    >>>>>> 
    >>>>>> Sounds like an APU2 hardware issue.
    >>>>>> 
    >>>>>> Maybe Pascal will give you a replacement.
    >>>>>> 
    >>>>>> Lonnie
    >>>>>> 


    Michael

    http://www.mksolutions.info





    _______________________________________________
    Astlinux-users mailing list
    Astlinux-users@lists.sourceforge.net
    https://lists.sourceforge.net/lists/listinfo/astlinux-users

    Donations to support AstLinux are graciously accepted via PayPal to 
pay...@krisk.org.


_______________________________________________
Astlinux-users mailing list
Astlinux-users@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/astlinux-users

Donations to support AstLinux are graciously accepted via PayPal to 
pay...@krisk.org.

Reply via email to