Re: Upgrading Shepherd services

2024-05-17 Thread Ludovic Courtès
Hi Felix,

Felix Lechner via "Development of GNU Guix and the GNU System
distribution."  skribis:

> I have a lot of custom Shepherd services.  Every so often I make a
> mistake that stalls the step in 'guix deploy' that upgrades Shepherd
> services, but without any error messages.
>
> Unfortunately, I can also no longer run 'herd status', which likewise
> hangs, or 'reboot'.  How may I debug such issues in my operating-system
> declaration, please?

The standard service constructors are de-constructors cannot block
shepherd entirely (at least not AFAIK).  So my suggestion would be to
first look at any service you’re using that has a custom ‘start’ or
‘stop’ method doing weird things; make sure none of them can block.

Another thing: when shepherd is blocked, try “sudo strace -p1” so see
what syscall it’s waiting to complete (that’s the likely problem).

In addition, check the last lines of /var/log/messages so see what
shepherd was trying to do before blocking.

HTH!

Ludo’.



Re: [PATCH 0/1] Possible fix of the Cuirass build with the latest Guix

2024-05-17 Thread Ludovic Courtès
Hi,

Rodion Goritskov  skribis:

> Got some problems due to the change re-exports in (guix utils).
> Also, should I send patches for Cuirass here or to the guix-patches?

Good catch!  I see that Maxim fixed these issues just recently; it
should be fine now.

Thanks,
Ludo’.



Re: `make check` fails when trying to build from Git

2024-05-17 Thread Ludovic Courtès
Hi Ashvith,

Ashvith Shetty  skribis:

> while setting up the build environment: executing 
> `/home/ashvith/Desktop/guix/test-tmp/store/a2k16z6jzwzvvg00bhf4mf9v0k65r7kq-guile-bootstrap-2.0/bin/guile':
>  Exec format error

What does this command return:

  file 
/home/ashvith/Desktop/guix/test-tmp/store/a2k16z6jzwzvvg00bhf4mf9v0k65r7kq-guile-bootstrap-2.0/bin/guile

?

Thanks for testing!

Ludo’.



Re: Upgrading Shepherd services

2024-05-17 Thread Attila Lendvai
> I have a lot of custom Shepherd services. Every so often I make a
> mistake that stalls the step in 'guix deploy' that upgrades Shepherd
> services, but without any error messages.
> 
> Unfortunately, I can also no longer run 'herd status', which likewise
> hangs, or 'reboot'. How may I debug such issues in my operating-system
> declaration, please?


Ludo,

this is the kind of issue for which extensive logging is needed. i.e. there's 
no self-contained reproducer (or is there, Felix?), and it requires a live 
environment to experience it.

and i suspect that i may even have fixed this in one of the commits that cleans 
up shepherd's error handling. one of the issues i remember is that an exception 
from the start (or stop?) GEXP of a service sometimes brought shepherd into a 
non-responsive state (without any sign of it in its logs).

Felix,

i'm planning to rebase my branch on Ludo's devel branch. it's not trivial 
because Ludo continues hacking shepherd, but i'll hopefully do it in the next 
few days. after that you may give it a try and see if you experience this issue 
again, and if you do then you can have plenty of logs to give you a clue 
why/how it happens.

if you do have a reproducer, then i'd be interested in adding it as a test in 
the shepherd codebase.

https://codeberg.org/attila-lendvai-patches/shepherd/commits/branch/various

-- 
• attila lendvai
• PGP: 963F 5D5F 45C7 DFCD 0A39
--
“It is humiliating to realize that when you drive yourself underground, when 
you fake who you are, often you do so for people you do not even like or 
respect.”
— Nathaniel Branden (1930–2014)