** Description changed:

- From time to time (sporadically and very rarely - maybe in one or two
- attempt out of twenty) I face a situation where the installer system (on
- s390x) does not boot-up completely.
+ [impact]
+ When there is a serial console configured, there was a unit cycle
+ 
+ serial-getty@$TTY.service -> cloud-final.service -> multi-user.target ->
+ getty.target -> serial-getty@$TTY.service
+ 
+ (or something like that)
+ 
+ Depending on which unit systemd kills to resolve the cycle, this can
+ result in cloud-init never completing which leads to the subiquity
+ server waiting forever for it and nothing useful can be done (other than
+ restarting and hoping for better luck next time).
+ 
+ Because subiquity itself waits for cloud-init (and this has been true
+ for a long time now) there is no need for serial-getty@$TTY.service to
+ wait on cloud-final.service.
+ 
+ [regression potential]
+ This change results in shuffling the systemd units around a fair bit, but the 
new arrangement has been tested in devel for a few months now and works well 
there. It's also much more straightforward than the current setup.
+ 
+ [test case]
+ This is a bit tricky as it's an intermittent failure. Basically, boot the 
live installer with a serial console configured a bunch of times and (10?) 
check that the installer starts up properly each time.
+ 
+ [original description]
+ From time to time (sporadically and very rarely - maybe in one or two attempt 
out of twenty) I face a situation where the installer system (on s390x) does 
not boot-up completely.
  
  This happened to me in the past already, but since it happened only one or 
twice I thought it's due to resource constraints on the system or so.
  But since I faced it now again on LPAR (before it was on z/VM), I'm opening 
this ticket now.
  
  In the latest case I used the focal daily live image from the 20th of
  July with installer 20.06.1 (but that also happened with previous
  versions).
  
  The situation is like this:
  
  In boot of the installer ends here (LPAR):
  ...
  "[  128.200711] cloud-init[1375]: The key's randomart image is:"
  "[  128.200735] cloud-init[1375]: +--[ED25519 256]--+"
  "[  128.200758] cloud-init[1375]: |o       .....ooo |"
  "[  128.200781] cloud-init[1375]: |.= .   . +. o.  .|"
  "[  128.200804] cloud-init[1375]: |+ * . . * o.o .  |"
  "[  128.200826] cloud-init[1375]: |.= o . = = = +   |"
  "[  128.200849] cloud-init[1375]: |o   + . S o +    |"
  "[  128.200876] cloud-init[1375]: |   + o = . .     |"
  "[  128.200900] cloud-init[1375]: |    o + .        |"
  "[  128.200925] cloud-init[1375]: |     o.=. E      |"
  "[  128.200947] cloud-init[1375]: |     .+.o+o.     |"
  "[  128.200977] cloud-init[1375]: +----[SHA256]-----+"
  "[  138.898906] cloud-init[2217]: Cloud-init v. 
20.2-45-g5f7825e2-0ubuntu1~20.04."
  "1 running 'modules:config' at Wed, 22 Jul 2020 11:27:39 +0000. Up 138.77 
seconds"
  .
  "[  138.898966] cloud-init[2217]: Set the following 'random' passwords"
  "[  138.899001] cloud-init[2217]: installer:aecmewaoicnai"
  
  or another example (z/VM):
  ...
- ¬   93.463680| cloud-init¬1282|: +--¬ED25519 256|--+ 
- ¬   93.463713| cloud-init¬1282|: !Eo=o  ....       ! 
- ¬   93.463749| cloud-init¬1282|: !.Bo.o  ... o     ! 
- ¬   93.463782| cloud-init¬1282|: !**.*... o =      ! 
- ¬   93.463818| cloud-init¬1282|: !*=O o. o . .     ! 
- ¬   93.463849| cloud-init¬1282|: !**++    S        ! 
- ¬   93.463886| cloud-init¬1282|: !§o+..            ! 
- ¬   93.463918| cloud-init¬1282|: !+*o.             ! 
- ¬   93.463954| cloud-init¬1282|: !.o.              ! 
- ¬   93.463988| cloud-init¬1282|: !.                ! 
- ¬   93.464028| cloud-init¬1282|: +----¬SHA256|-----+ 
+ ¬   93.463680| cloud-init¬1282|: +--¬ED25519 256|--+
+ ¬   93.463713| cloud-init¬1282|: !Eo=o  ....       !
+ ¬   93.463749| cloud-init¬1282|: !.Bo.o  ... o     !
+ ¬   93.463782| cloud-init¬1282|: !**.*... o =      !
+ ¬   93.463818| cloud-init¬1282|: !*=O o. o . .     !
+ ¬   93.463849| cloud-init¬1282|: !**++    S        !
+ ¬   93.463886| cloud-init¬1282|: !§o+..            !
+ ¬   93.463918| cloud-init¬1282|: !+*o.             !
+ ¬   93.463954| cloud-init¬1282|: !.o.              !
+ ¬   93.463988| cloud-init¬1282|: !.                !
+ ¬   93.464028| cloud-init¬1282|: +----¬SHA256|-----+
  ¬  104.841438| cloud-init¬2004|: Cloud-init v. 
20.2-45-g5f7825e2-0ubuntu1ß20.04.
  1 running 'modules:config' at Mon, 20 Jul 2020 10:46:38 +0000. Up 104.63 
seconds
- . 
- ¬  104.841490| cloud-init¬2004|: Set the following 'random' passwords 
- ¬  104.841516| cloud-init¬2004|: installer:U9NJDuvXFw6X2fxG7pP8 
+ .
+ ¬  104.841490| cloud-init¬2004|: Set the following 'random' passwords
+ ¬  104.841516| cloud-init¬2004|: installer:U9NJDuvXFw6X2fxG7pP8
  
  But it is not complete at this point.
  A completed boot of the installer system ends like this:
  
  "It is possible to connect to the installer over the network, which"
  "might allow the use of a more capable terminal and can offer more languages"
  "than can be rendered in the Linux console."
  
- 
  "To connect, SSH to installer@<IP address>."
- 
  
  "The password you should use is "ydnjdnciu"kZ4tR4vRvPxHPerCNU8g""."
  
  "The host key fingerprints are:"
  
  "RSA     SHA256:n+6TJsfdCBII2PO89GMU10mG1oFvEIFBT2v6uPN0Jz0"
  "ECDSA   SHA256:VcDS5ac8xswXxFEghjo1ZIcue38AM6HJg0poIxdeeec"
  "ED25519 SHA256:lb/DVVhj1obDPhfo3M8oPqeAyduvlLcPFJCC8ZaiCJY"
  
  "Ubuntu 20.04 LTS ubuntu-server sclp_line0"
- 
  
  In such an above situation I also can't reach the subiquity UI:
  
  fheimes@T570:~$ ssh-keygen -f "/home/fheimes/.ssh/known_hosts" -R s1lp14
  # Host s1lp14 found: line 165
  /home/fheimes/.ssh/known_hosts updated.
  Original contents retained as /home/fheimes/.ssh/known_hosts.old
  fheimes@T570:~$ ssh installer@s1lp14
  The authenticity of host 's1lp14 (10.245.236.14)' can't be established.
  ECDSA key fingerprint is SHA256:AoYrJhWOFaaEtUun/aZAT8/VgNP+yq//Jhg7vzUwvxE.
  Are you sure you want to continue connecting (yes/no/[fingerprint])? yes
  Warning: Permanently added 's1lp14,10.245.236.14' (ECDSA) to the list of 
known hosts.
- installer@s1lp14's password: 
+ installer@s1lp14's password:
  Welcome to Ubuntu 20.04 LTS (GNU/Linux 5.4.0-40-generic s390x)
  
-  * Documentation:  https://help.ubuntu.com
-  * Management:     https://landscape.canonical.com
-  * Support:        https://ubuntu.com/advantage
+  * Documentation:  https://help.ubuntu.com
+  * Management:     https://landscape.canonical.com
+  * Support:        https://ubuntu.com/advantage
  
-   System information as of Wed Jul 22 11:28:32 UTC 2020
+   System information as of Wed Jul 22 11:28:32 UTC 2020
  
-   System load:    0.44      Memory usage: 4%   Processes:       180
-   Usage of /home: unknown   Swap usage:   0%   Users logged in: 0
+   System load:    0.44      Memory usage: 4%   Processes:       180
+   Usage of /home: unknown   Swap usage:   0%   Users logged in: 0
  
  0 updates can be installed immediately.
  0 of these updates are security updates.
- 
  
  The programs included with the Ubuntu system are free software;
  the exact distribution terms for each program are described in the
  individual files in /usr/share/doc/*/copyright.
  
  Ubuntu comes with ABSOLUTELY NO WARRANTY, to the extent permitted by
  applicable law.
  
- 
  Hence even gathering the logs is unfortunately not easily possible.

-- 
You received this bug notification because you are a member of Ubuntu
Bugs, which is subscribed to Ubuntu.
https://bugs.launchpad.net/bugs/1888497

Title:
  Booting the install system does not always succeed, hence a remote ssh
  login is not always possible

To manage notifications about this bug go to:
https://bugs.launchpad.net/subiquity/+bug/1888497/+subscriptions


-- 
ubuntu-bugs mailing list
ubuntu-bugs@lists.ubuntu.com
https://lists.ubuntu.com/mailman/listinfo/ubuntu-bugs

Reply via email to