Send netdisco-users mailing list submissions to
[email protected]
To subscribe or unsubscribe via the World Wide Web, visit
https://lists.sourceforge.net/lists/listinfo/netdisco-users
or, via email, send a message with subject or body 'help' to
[email protected]
You can reach the person managing the list at
[email protected]
When replying, please edit your Subject line so it is more specific
than "Re: Contents of netdisco-users digest..."
Today's Topics:
1. Re: scheduled jobs stop (Ricardo Stella)
2. Re: scheduled jobs stop ([email protected])
--- Begin Message ---
Also happening here. I also had exported the DB in order to install on a
new VM with new OS. Had a couple of problems that I posted but had this
same error on the logs.
Noticed all jobs queued for a couple of days and nothing running.
Last message on logs was:
Argument "" isn't numeric in read at
/home/netdisco/perl5/lib/perl5/MCE/Queue.pm line 1439, <__ANONIO__> line 1.
Sereal: Error: Bad Sereal header: Not a valid Sereal document. at offset 1
of input at srl_decoder.c line 580 at
/home/netdisco/perl5/lib/perl5/MCE/Queue.pm line 1445, <__ANONIO__> line 1.
Restarting it seems to get the jobs running again.
On Mon, Jan 27, 2020 at 10:54 AM marco via netdisco-users <
[email protected]> wrote:
> Hi there
>
> I had set up a new ND2 host on debian buster some weeks ago
> for experimental purpose
> I have another ND2 host up and running since years
>
> Software Version
> App::Netdisco 2.44.4
> SNMP::Info 3.70
> DB Schema 61
> PostgreSQL 12.00.1
> Perl 5.28.1
>
> I restore db from another ND2
> and copy deployment.yml
> It worked
>
> But I noticed that it stops running the scheduled jobs after some times
> (days)
> I had to restart netdisco-backend,
>
>
> here some info I collect
>
> from netdisco-backend.log
> ...
> [392] 2020-01-24 15:15:18 debug mgr (2): getting potential jobs for 1
> workers
> [2700] 2020-01-24 15:15:18 debug [172.17.185.50] arpnip - processed
> 373 ARP Cache entries
> [2700] 2020-01-24 15:15:18 debug [172.17.185.50] arpnip - processed 0
> IPv6 Neighbor Cache entries
> [2700] 2020-01-24 15:15:18 info pol (3): wrapping up arpnip
> job(22423168) - status done at Fri Jan 24 16:15:18 2020
> [392] 2020-01-24 15:15:18 debug getsome: cancelled 0E0 duplicate(s) of
> job 22423235
> [392] 2020-01-24 15:15:18 info mgr (2): job 22423235 booked out for
> this processing node
> Argument "PID_2700" isn't numeric in read at
> /home/netdisco/perl5/lib/perl5/MCE/Queue.pm line 477, <__ANONIO__> line
> 31470.
> Sereal: Error: Bad Sereal header: Not a valid Sereal document. at
> offset 1 of input at srl_decoder.c line 580 at
> /home/netdisco/perl5/lib/perl5/MCE/Queue.pm line 480, <__ANONIO__> line
> 31470.
>
> root@deb-netdisco:~# systemctl status netdisco-backend.service
> ● netdisco-backend.service - Netdisco Backend Service
> Loaded: loaded (/etc/systemd/system/netdisco-backend.service; enabled;
> vendor preset: enabled)
> Active: active (running) since Fri 2020-01-24 09:53:03 CET; 3 days ago
> Process: 110 ExecStart=/home/netdisco/bin/netdisco-backend start
> (code=exited, status=0/SUCCESS)
> Main PID: 216 (netdisco-backen)
> Tasks: 2 (limit: 4915)
> Memory: 143.0M
> CGroup: /system.slice/netdisco-backend.service
> └─216 netdisco-backend
>
> gen 24 09:53:02 deb-netdisco systemd[1]: Starting Netdisco Backend
> Service...
> gen 24 09:53:03 deb-netdisco netdisco-backend[110]: Netdisco Backend
> [Started]
> gen 24 09:53:03 deb-netdisco netdisco-backend[110]: config watcher:
> watching /home/netdisco/environments for updates.
> gen 24 09:53:03 deb-netdisco systemd[1]: Started Netdisco Backend
> Service.
> gen 24 10:01:48 deb-netdisco netdisco-backend[110]: --
> /home/netdisco/environments/deployment.yml updated.
> gen 24 10:01:48 deb-netdisco netdisco-backend[110]: config watcher:
> sending TERM to the server (pid:217)...
>
> root@deb-netdisco:~# ps aux | grep netd
> netdisco 216 0.0 0.3 22840 16008 ? S gen24 6:19
> netdisco-backend
> netdisco 281 0.0 0.3 20744 13680 ? S gen24 0:00 perl
> /home/netdisco/bin/netdisco-web start
> netdisco 282 0.0 0.3 22152 16696 ? S gen24 0:47
> starman master --disable-keepalive --user 1001 --group 1001
> /home/netdisco/perl5/bin/netdisco-web-fg
> netdisco 372 0.0 0.0 0 0 ? Z gen24 0:16 [nd2:
> master] <defunct>
> netdisco 373 0.0 2.7 135148 117200 ? S gen24 0:06
> starman worker --disable-keepalive --user 1001 --group 1001
> /home/netdisco/perl5/bin/netdisco-web-fg
> netdisco 374 0.0 2.8 136000 118000 ? S gen24 0:06
> starman worker --disable-keepalive --user 1001 --group 1001
> /home/netdisco/perl5/bin/netdisco-web-fg
> netdisco 375 0.0 2.7 133744 115940 ? S gen24 0:06
> starman worker --disable-keepalive --user 1001 --group 1001
> /home/netdisco/perl5/bin/netdisco-web-fg
> netdisco 376 0.0 2.8 137420 119504 ? S gen24 0:06
> starman worker --disable-keepalive --user 1001 --group 1001
> /home/netdisco/perl5/bin/netdisco-web-fg
> netdisco 377 0.0 2.7 133792 115996 ? S gen24 0:05
> starman worker --disable-keepalive --user 1001 --group 1001
> /home/netdisco/perl5/bin/netdisco-web-fg
> root 3405 0.0 0.0 6096 824 pts/0 S+ 10:59 0:00 grep
> netd
>
>
> after stop and start
> root@deb-netdisco:~# systemctl start netdisco-backend.service
>
> it seems to work again
> [392] 2020-01-24 15:15:18 info mgr (2): job 22423235 booked out for
> this processing node
> Argument "PID_2700" isn't numeric in read at
> /home/netdisco/perl5/lib/perl5/MCE/Queue.pm line 477, <__ANONIO__> line
> 31470.
> Sereal: Error: Bad Sereal header: Not a valid Sereal document. at
> offset 1 of input at srl_decoder.c line 580 at
> /home/netdisco/perl5/lib/perl5/MCE/Queue.pm line 480, <__ANONIO__> line
> 31470.
> [3429] 2020-01-27 10:10:08 warn App::Netdisco 2.044004 backend
> [3429] 2020-01-27 10:10:08 info resolving backend hostname...
> [3433] 2020-01-27 10:10:08 info applying role Scheduler to worker 1
> [3436] 2020-01-27 10:10:08 info applying role Poller to worker 4
> ...
>
>
> _______________________________________________
> Netdisco mailing list
> [email protected]
> https://sourceforge.net/p/netdisco/mailman/netdisco-users/
--
°(((=((===°°°(((================================================
--- End Message ---
--- Begin Message ---
Hi Ricardo
Sorry I forgot to sign my email
By the way I'm Marco
It stopped again yesterday after few hour.
...
[5754] 2020-01-27 17:06:59 debug -> run worker main/wirelessnodes/100
[5754] 2020-01-27 17:06:59 info pol (3): wrapping up macsuck job(22425208)
- status done at Mon Jan 27 18:06:59 2020
[5750] 2020-01-27 17:06:59 debug [172.17.119.6] macsuck - port 1:43 vlan
unknown : 1 nodes
Argument "PID_5754" isn't numeric in abs at
/home/netdisco/perl5/lib/perl5/MCE/Core/Manager.pm line 206, <__ANONIO__> line
32948.
Can't call method "_mce_m_pending" on an undefined value at
/home/netdisco/perl5/lib/perl5/MCE/Queue.pm line 679, <__ANONIO__> line 32949.
I activated debug, it seems that some scheduled jobs (macsuck, discoverall
etc.) cause the error "Argument "PID_####" isn't numeric " and it zombies
netdisco-backend child
ps aux | grep netd
netdisco 3428 0.0 0.3 22840 15848 ? S gen27 2:05
netdisco-backend
netdisco 3429 0.0 0.0 0 0 ? Z gen27 0:15 [nd2:
master] <defunct>
I can't say if it is caused by my new setup/configuration or something else
Marco
> Il 27 gennaio 2020 alle 17.03 Ricardo Stella <[email protected]> ha scritto:
>
>
> Also happening here. I also had exported the DB in order to install on a new
> VM with new OS. Had a couple of problems that I posted but had this same
> error on the logs.
> Noticed all jobs queued for a couple of days and nothing running.
> Last message on logs was:
> Argument "" isn't numeric in read at
> /home/netdisco/perl5/lib/perl5/MCE/Queue.pm line 1439, line 1.
> Sereal: Error: Bad Sereal header: Not a valid Sereal document. at offset 1 of
> input at srl_decoder.c line 580 at
> /home/netdisco/perl5/lib/perl5/MCE/Queue.pm line 1445, line 1.
>
> Restarting it seems to get the jobs running again.
>
>
> On Mon, Jan 27, 2020 at 10:54 AM marco via netdisco-users <
> [email protected]> wrote:
> > Hi there
> >
> > I had set up a new ND2 host on debian buster some weeks ago
> > for experimental purpose
> > I have another ND2 host up and running since years
> >
> > Software Version
> > App::Netdisco 2.44.4
> > SNMP::Info 3.70
> > DB Schema 61
> > PostgreSQL 12.00.1
> > Perl 5.28.1
> >
> > I restore db from another ND2
> > and copy deployment.yml
> > It worked
> >
> > But I noticed that it stops running the scheduled jobs after some times
> > (days)
> > I had to restart netdisco-backend,
> >
> >
> > here some info I collect
> >
> > from netdisco-backend.log
> > ...
> > [392] 2020-01-24 15:15:18 debug mgr (2): getting potential jobs for 1
> > workers
> > [2700] 2020-01-24 15:15:18 debug [172.17.185.50] arpnip - processed
> > 373 ARP Cache entries
> > [2700] 2020-01-24 15:15:18 debug [172.17.185.50] arpnip - processed 0
> > IPv6 Neighbor Cache entries
> > [2700] 2020-01-24 15:15:18 info pol (3): wrapping up arpnip
> > job(22423168) - status done at Fri Jan 24 16:15:18 2020
> > [392] 2020-01-24 15:15:18 debug getsome: cancelled 0E0 duplicate(s) of
> > job 22423235
> > [392] 2020-01-24 15:15:18 info mgr (2): job 22423235 booked out for
> > this processing node
> > Argument "PID_2700" isn't numeric in read at
> > /home/netdisco/perl5/lib/perl5/MCE/Queue.pm line 477, line 31470.
> > Sereal: Error: Bad Sereal header: Not a valid Sereal document. at
> > offset 1 of input at srl_decoder.c line 580 at
> > /home/netdisco/perl5/lib/perl5/MCE/Queue.pm line 480, line 31470.
> >
> > root@deb-netdisco:~# systemctl status netdisco-backend.service
> > ● netdisco-backend.service - Netdisco Backend Service
> > Loaded: loaded (/etc/systemd/system/netdisco-backend.service; enabled;
> > vendor preset: enabled)
> > Active: active (running) since Fri 2020-01-24 09:53:03 CET; 3 days ago
> > Process: 110 ExecStart=/home/netdisco/bin/netdisco-backend start
> > (code=exited, status=0/SUCCESS)
> > Main PID: 216 (netdisco-backen)
> > Tasks: 2 (limit: 4915)
> > Memory: 143.0M
> > CGroup: /system.slice/netdisco-backend.service
> > └─216 netdisco-backend
> >
> > gen 24 09:53:02 deb-netdisco systemd[1]: Starting Netdisco Backend
> > Service...
> > gen 24 09:53:03 deb-netdisco netdisco-backend[110]: Netdisco Backend
> > [Started]
> > gen 24 09:53:03 deb-netdisco netdisco-backend[110]: config watcher:
> > watching /home/netdisco/environments for updates.
> > gen 24 09:53:03 deb-netdisco systemd[1]: Started Netdisco Backend
> > Service.
> > gen 24 10:01:48 deb-netdisco netdisco-backend[110]: --
> > /home/netdisco/environments/deployment.yml updated.
> > gen 24 10:01:48 deb-netdisco netdisco-backend[110]: config watcher:
> > sending TERM to the server (pid:217)...
> >
> > root@deb-netdisco:~# ps aux | grep netd
> > netdisco 216 0.0 0.3 22840 16008 ? S gen24 6:19
> > netdisco-backend
> > netdisco 281 0.0 0.3 20744 13680 ? S gen24 0:00 perl
> > /home/netdisco/bin/netdisco-web start
> > netdisco 282 0.0 0.3 22152 16696 ? S gen24 0:47
> > starman master --disable-keepalive --user 1001 --group 1001
> > /home/netdisco/perl5/bin/netdisco-web-fg
> > netdisco 372 0.0 0.0 0 0 ? Z gen24 0:16 [nd2:
> > master]
> > netdisco 373 0.0 2.7 135148 117200 ? S gen24 0:06
> > starman worker --disable-keepalive --user 1001 --group 1001
> > /home/netdisco/perl5/bin/netdisco-web-fg
> > netdisco 374 0.0 2.8 136000 118000 ? S gen24 0:06
> > starman worker --disable-keepalive --user 1001 --group 1001
> > /home/netdisco/perl5/bin/netdisco-web-fg
> > netdisco 375 0.0 2.7 133744 115940 ? S gen24 0:06
> > starman worker --disable-keepalive --user 1001 --group 1001
> > /home/netdisco/perl5/bin/netdisco-web-fg
> > netdisco 376 0.0 2.8 137420 119504 ? S gen24 0:06
> > starman worker --disable-keepalive --user 1001 --group 1001
> > /home/netdisco/perl5/bin/netdisco-web-fg
> > netdisco 377 0.0 2.7 133792 115996 ? S gen24 0:05
> > starman worker --disable-keepalive --user 1001 --group 1001
> > /home/netdisco/perl5/bin/netdisco-web-fg
> > root 3405 0.0 0.0 6096 824 pts/0 S+ 10:59 0:00 grep
> > netd
> >
> >
> > after stop and start
> > root@deb-netdisco:~# systemctl start netdisco-backend.service
> >
> > it seems to work again
> > [392] 2020-01-24 15:15:18 info mgr (2): job 22423235 booked out for
> > this processing node
> > Argument "PID_2700" isn't numeric in read at
> > /home/netdisco/perl5/lib/perl5/MCE/Queue.pm line 477, line 31470.
> > Sereal: Error: Bad Sereal header: Not a valid Sereal document. at
> > offset 1 of input at srl_decoder.c line 580 at
> > /home/netdisco/perl5/lib/perl5/MCE/Queue.pm line 480, line 31470.
> > [3429] 2020-01-27 10:10:08 warn App::Netdisco 2.044004 backend
> > [3429] 2020-01-27 10:10:08 info resolving backend hostname...
> > [3433] 2020-01-27 10:10:08 info applying role Scheduler to worker 1
> > [3436] 2020-01-27 10:10:08 info applying role Poller to worker 4
> > ...
> >
> >
> > _______________________________________________
> > Netdisco mailing list
> > [email protected]
> > https://sourceforge.net/p/netdisco/mailman/netdisco-users/
>
> --
> °((( = (( ===°°° ((( ================================================
--- End Message ---
_______________________________________________
Netdisco mailing list - Digest Mode
[email protected]
https://lists.sourceforge.net/lists/listinfo/netdisco-users