Re: 11-STABLE system unbootable after update

2019-05-19 Thread Scott Bennett
Cyrus Rahman  wrote:

> When you upgrade the kernel you need to upgrade any loadable kernel
> modules at the same time.

 Yes.  For a time, listing them in /etc/src.conf would get them
rebuilt automatically as a sort of epilogue to building a kernel, but then
something changed quite a while back, and I had to comment that line out
and return to doing them manually.  Often they still work without being
rebuilt, and I got careless this time.  Sigh.
>
> Recompile any kmods from ports and you should be ok.

 Done, but the next reboot will be into the previous boot environment,
and it now seems unlikely that I will use this one again rather than
making a new one from a more up-to-date revision.
>
> STABLE doesn't always work.  It's good for adventurous people to try
> it so the bugs get found out, but occasionally it is painful.  Upgrade
> again right after the release in a few weeks.  Usually caution is in
> order just before things get to the BETA stage.  I upgraded just now
> do help test things and discovered your bug, which I posted about a
> few posts before yours on the list.

 Ah.  I suppose I will see that while catching up on my latest email
backlog caused by six days or so with no working system.  Thanks again
for your reply to my call for help.
>
> If the description I posted is similar to yours, go ahead and reply to
> my message on the list, and perhaps go to bugs.freebsd.org and search
> out 'loader', and add any information you might have (or at least
> document the fact that you were affected).

 Will do.
>
> You can quote me on the list, I simply wanted to have you try things
> out before putting my suggestions on it.  Over the years I have grown
> weary of unnecessary noise.
>
 Okay.  I've cc'ed the list this time.
 FWIW, once of the things I have been wishing for in trying new
revisions of 11-STABLE is a fix for the failure of the kernel to honor
the vm.max_wired sysctl variable.  The crash that gave me an opportunity
to try the broken revision was another case of the kernel having pagefixed
so much real memory that it was not only causing paging/swapping when it
should have, but I think the kernel itself couldn't get page frames it
needed fast enough in some situation.  I don't know whether this bug has
been found and fixed yet, though, so I have temporarily returned to
setting vm.kmem_size_max, which does seem to be honored.
 Anyway, thanks for bailing me out yesterday.  The system is now
somewhat usable while I satisfy my curiousity and should be fully usable
upon reversion, which will probably happen tomorrow (Monday) night.


  Scott Bennett, Comm. ASMELG, CFIAG
**
* Internet:   bennett at sdf.org   *xor*   bennett at freeshell.org  *
**
* "A well regulated and disciplined militia, is at all times a good  *
* objection to the introduction of that bane of all free governments *
* -- a standing army."   *
*-- Gov. John Hancock, New York Journal, 28 January 1790 *
**
___
freebsd-stable@freebsd.org mailing list
https://lists.freebsd.org/mailman/listinfo/freebsd-stable
To unsubscribe, send any mail to "freebsd-stable-unsubscr...@freebsd.org"


Re: 11-STABLE system unbootable after update

2019-05-19 Thread Scott Bennett
Matt Garber  wrote:

> On Sun, May 19, 2019 at 9:00 AM Scott Bennett  wrote:
>
> >
> >  Many, many thanks to the person who responded with the solution to
> > get past the
> > loader crash!!  My system is now getting work done again, and the rest of
> > the new
> > problems can be dealt with on a running system.
>
>
> Out of curiosity, you mentioned in your previous email you created a new
> boot environment for this upgrade? since additional issues remain beyond
> the loader, have you considered rolling back to the known-good BE and
> attempting the entire process again (with another, separate BE) in a week
> or so? Especially since that would hopefully allow you to continue your
> other work without any additional issues or oddities to sort through in the
> meantime?
>
 Yes, and I will likely do so, but not tonight.  I am still exploring
what is new/changed in this revision (besides a broken zfsloader), and I do
have mprime back to work for the time being, so reverting can wait another
day.  I have reverted a few times in the past, but was always able to start
that from the boot menu.  This time really threw me until I received a reply
to my plea for help telling me how to use the previous loader to get to the
boot menu.  I had already successfully run the r347183 kernel in single-user
mode, so I figured I could do so again.  What broke the zfsloader was the
installworld step.
 After reverting to r345498 I will try bringing my source tree up to date,
which would be quite a bit later than this broken r347183, and then run a
fresh buildworld and buildkernel anyway.  If all that fails, then I'll just
go back to r345498 again and sit it out until 11.3-RELEASE happens before
trying again.


  Scott Bennett, Comm. ASMELG, CFIAG
**
* Internet:   bennett at sdf.org   *xor*   bennett at freeshell.org  *
**
* "A well regulated and disciplined militia, is at all times a good  *
* objection to the introduction of that bane of all free governments *
* -- a standing army."   *
*-- Gov. John Hancock, New York Journal, 28 January 1790 *
**
___
freebsd-stable@freebsd.org mailing list
https://lists.freebsd.org/mailman/listinfo/freebsd-stable
To unsubscribe, send any mail to "freebsd-stable-unsubscr...@freebsd.org"


Re: 11-STABLE system unbootable after update

2019-05-19 Thread Matt Garber
On Sun, May 19, 2019 at 9:00 AM Scott Bennett  wrote:

>
>  Many, many thanks to the person who responded with the solution to
> get past the
> loader crash!!  My system is now getting work done again, and the rest of
> the new
> problems can be dealt with on a running system.


Out of curiosity, you mentioned in your previous email you created a new
boot environment for this upgradeā€¦ since additional issues remain beyond
the loader, have you considered rolling back to the known-good BE and
attempting the entire process again (with another, separate BE) in a week
or so? Especially since that would hopefully allow you to continue your
other work without any additional issues or oddities to sort through in the
meantime?


--
Matt
___
freebsd-stable@freebsd.org mailing list
https://lists.freebsd.org/mailman/listinfo/freebsd-stable
To unsubscribe, send any mail to "freebsd-stable-unsubscr...@freebsd.org"


Re: 11-STABLE system unbootable after update

2019-05-19 Thread Scott Bennett
 On Sat, 18 May 2019 08:02:20 -0500 Scott Bennett  wrote:

> I have been running 11.2-STABLE for a while at r345498.  Last weekend it 
> crashed,
>so I took the opportunity to install the most recent build I had lying around, 
>r347182.
>I created a new boot environment and installed the r347182 kernel into it, 
>shut the
>system down, and rebooted.  The new kernel came up and appeared to be working 
>okay, so
>I continued with the mergemaster -p -F, make installworld, and mergemaster -F, 
>then
>shut it down again, and rebooted.  It asked for the GELI key for the boot 
>pool, which
>I then entered.  The spinning slash cursor appeared and may have changed for 
>one frame
>or so, and then I got a message beginning with "BTX" and followed by several 
>lines of
>hexadecimal, and then it stopped.  I tried it again just to be sure, and the 
>result was
>exactly the same.
> Does anyone know whether the PMBR boot block or the loader in the 
> freebsd-boot
>partition changed between r345498 and r347182?  I found no warning in 
>/usr/src/UPDATING
 ^ I wrote the revision down wrong 
in my
   notes.  It was really r347183.

>about installworld potentially leaving a wasted system, so I don't have a 
>clear idea
>of what went wrong, much less whether I missed some instruction somewhere 
>about source
>updates.  If anyone can lend me a clue here, I would greatlyappreciate it.  I 
>only had
>one working machine, and now it is only working in a "rescue" mode by booting 
>from a
>DVD.  (Probably needless to say, but I will burn new DVDs with up-to-date 
>stuff as soon
>as my system is working the way it is supposed to again.)
> This motherboard is nearly 11 years old and does not boot from USB (in 
> spite of
>the BIOS menus say), so at the moment I am logged into SDF by running a long 
>out-of-date
>TrueOS installer DVD, which happens to be a pain to get to boot all the way, 
>but I've
>figured how to make it do it rather than get stuck with a logo on the screen 
>that never
>goes away.  Unfortunately, it includes no software to burn a CD or DVD, so I 
>cannot
>make a new bootable disk for the time being.  I will check email much later 
>today or
>this evening.

 So far I've received one reply, which was not copied to this list, yet the 
person
responding suggested something to try and also adked that I post the result to 
the list.
I would have done both anyway, but the respondent may have desired anonymity on 
the list,
so I am not quoting the message I received.
 The suggestion was to wait about a second after entering the GELI 
passphrase and
then hit the space bar on my keyboard.  At the resulting prompt, I should enter 
the path
given in the prompt, but with ".old" appended.  I did that, and YES!!!  It 
worked and
proceeded until I had a boot menu.  I opted for single-user mode and then 
responded to
further requests for GELI passphrases until eventually I had a root shell.  
Being unable
to reach the boot menu was a problem hadn't previously even crossed my mind.  I 
certainly
hope that doing updates from source in the future will not cause this same 
booby trap
again.
 At that point I renamed /boot/zfsloader to /boot/zfsloader.bad.r347183 and
/boot/zfsloader.old to /boot/zfsloader.  I also added a hard link to the latter 
as
/boot/zfsloader.good.r345498.
 All is still not well, however.  In multi-user mode, startx turns the 
screen black
and switches its power setting to standby.  After that it remains unresponsive 
until I
log in via a different vt and send SIGHUP to xorg.  After a rather lengthy 
delay (30-60
seconds, at a guess) it returns to the login session on the console vt.  I have 
now
commented out the "kld_list="/boot/modules/radeonkms.ko" line in 
/etc/rc.conf.local in
hopes that the next boot will get the scfb driver to take it instead of the 
radeonkms
driver from graphics/drm-next-kmod.  If someone knows, is this a case where 
rebuilding and
reinstalling graphics/drm-next-kmod?  If so, then I will do that, but I see 
that the
Makefile still appears to use the same distribution file.
 Many, many thanks to the person who responded with the solution to get 
past the
loader crash!!  My system is now getting work done again, and the rest of the 
new
problems can be dealt with on a running system.


  Scott Bennett, Comm. ASMELG, CFIAG
**
* Internet:   bennett at sdf.org   *xor*   bennett at freeshell.org  *
**
* "A well regulated and disciplined militia, is at all times a good  *
* objection to the introduction of that bane of all free governments *
* -- a standing army."   *
*-- Gov. John Hancock, New York Journal, 28 January 1790 *

11-STABLE system unbootable after update

2019-05-18 Thread Scott Bennett
 I have been running 11.2-STABLE for a while at r345498.  Last weekend it 
crashed,
so I took the opportunity to install the most recent build I had lying around, 
r347182.
I created a new boot environment and installed the r347182 kernel into it, shut 
the
system down, and rebooted.  The new kernel came up and appeared to be working 
okay, so
I continued with the mergemaster -p -F, make installworld, and mergemaster -F, 
then
shut it down again, and rebooted.  It asked for the GELI key for the boot pool, 
which
I then entered.  The spinning slash cursor appeared and may have changed for 
one frame
or so, and then I got a message beginning with "BTX" and followed by several 
lines of
hexadecimal, and then it stopped.  I tried it again just to be sure, and the 
result was
exactly the same.
 Does anyone know whether the PMBR boot block or the loader in the 
freebsd-boot
partition changed between r345498 and r347182?  I found no warning in 
/usr/src/UPDATING
about installworld potentially leaving a wasted system, so I don't have a clear 
idea
of what went wrong, much less whether I missed some instruction somewhere about 
source
updates.  If anyone can lend me a clue here, I would greatlyappreciate it.  I 
only had
one working machine, and now it is only working in a "rescue" mode by booting 
from a
DVD.  (Probably needless to say, but I will burn new DVDs with up-to-date stuff 
as soon
as my system is working the way it is supposed to again.)
 This motherboard is nearly 11 years old and does not boot from USB (in 
spite of
the BIOS menus say), so at the moment I am logged into SDF by running a long 
out-of-date
TrueOS installer DVD, which happens to be a pain to get to boot all the way, 
but I've
figured how to make it do it rather than get stuck with a logo on the screen 
that never
goes away.  Unfortunately, it includes no software to burn a CD or DVD, so I 
cannot
make a new bootable disk for the time being.  I will check email much later 
today or
this evening.
 Thanks in advance for any helpful ideas!


  Scott Bennett, Comm. ASMELG, CFIAG
**
* Internet:   bennett at sdf.org   *xor*   bennett at freeshell.org  *
**
* "A well regulated and disciplined militia, is at all times a good  *
* objection to the introduction of that bane of all free governments *
* -- a standing army."   *
*-- Gov. John Hancock, New York Journal, 28 January 1790 *
**
___
freebsd-stable@freebsd.org mailing list
https://lists.freebsd.org/mailman/listinfo/freebsd-stable
To unsubscribe, send any mail to "freebsd-stable-unsubscr...@freebsd.org"