Re: dump trying to access incorrect block numbers?

2017-07-08 Thread Rodney W. Grimes
> On 07/08/17 12:28, Rodney W. Grimes wrote:
> 
> > Since it has been speculated that this is occuring during the
> > creation of the snapshot, could you try just creating a snapshot
> > using mksnap_ffs and see if any errors occur?
> 
> After a short pause with disk activity, the same sorts of errors are 
> logged when using "mksnap_ffs /.snap2" where .snap2 did  not previously 
> exist,

Ok, so this simplifies what needs to be looked at, this is no
longer a dump(8) issue, but now a snapshot ufs issue.

This should be much easier for more people to try out.  

-- 
Rod Grimes rgri...@freebsd.org
___
freebsd-current@freebsd.org mailing list
https://lists.freebsd.org/mailman/listinfo/freebsd-current
To unsubscribe, send any mail to "freebsd-current-unsubscr...@freebsd.org"


Re: dump trying to access incorrect block numbers?

2017-07-08 Thread Mark Millard
[A normal multi-user boot's fsck activity can do
 fsck -B activity that gets the problem.]

On 2017-Jul-8, at 9:45 AM, Mark Millard  wrote:

> [I add notes about a problem that happens after the
> "fsck -B". Also  forgot to mention: production style
> kernel world builds were in use. And a tried a
> powerpc64 build and it works the same.]
> 
> On 2017-Jul-7, at 11:09 PM, Mark Millard  wrote:
> 
>> [This note has more information than one sent with extra text
>> in the subject but with a partially different "to" list.]
>> 
>> Peter Jeremy peter at rulingia.com wrote on
>> Sat Jul 8 02:00:47 UTC 2017 :
>> 
>>> When did you first notice this (what SVN revision)?
>>> Do you know what the last good SVN revision was?
>>> Is this a new or old filesystem?
>>> Is the filesystem mounted/active or not when you dump it?
>>> What are the relevant parameters for the filesystem on ada0s3a?
>>> Are you running softupdates, journalling etc?
>>> Which dump(8) phase is reporting the errors?
>>> What are the exact dump and fsck commands you ran?
>> 
>> I can add a little information with some contrast
>> and only "fsck -B" in use (with an unclean file
>> system from a prior crash), no dump use. Still:
>> a snapshot is involved in the below.
>> 
>> Unfortunately two problems with major consequences
>> for my involved context limit the svn range that I
>> can cover for the activity, the problem version
>> ranges being:
>> 
>> -r319722 through -r320651 (fixed by -r320652)
>> (actually this is why I had used "boot -s" 
>> in what I report later: I could get to a
>> shell prompt that way instead of crashing
>> before any login prompt; the crashes left
>> the file system in need of repair)
>> 
>> -r320509 through -r320561 (fixed by -r320570)
>> 
>> So I was using -r320570 to avoid one of the
>> two problems.
>> 
>> 
>> 
>> Context: 32-bit powerpc FreeBSD used on PowerMac G5
>> so-called "Quad-core". (So big-endian as well.)
>> Softupdates, no journalling. Long-in-use file
>> system having lots of FreeBSD versions updates
>> and port rebuilds over the time.
>> 
>> The following is from now, not from the time of the
>> example messages:
>> 
>> # dumpfs / | more
>> magic   19540119 (UFS2) timeFri Jul  7 22:53:34 2017
>> superblock location 65536   id  [  ]
>> ncg 158 size25165823blocks  24372006
>> bsize   32768   shift   15  mask0x8000
>> fsize   4096shift   12  mask0xf000
>> frag8   shift   3   fsbtodb 3
>> minfree 8%  optim   timesymlinklen 120
>> maxbsize 32768  maxbpg  4096maxcontig 4 contigsumsize 4
>> nbfree  2130375 ndir65518   nifree  11769796nffree  425065
>> bpg 20032   fpg 160256  ipg 80128   unrefs  0
>> nindir  4096inopb   128 maxfilesize 2252349704110079
>> sbsize  4096cgsize  32768   csaddr  5048cssize  4096
>> sblkno  24  cblkno  32  iblkno  40  dblkno  5048
>> cgrotor 127 fmod0   ronly   0   clean   0
>> metaspace 6408  avgfpdir 64 avgfilesize 16384
>> flags   soft-updates trim 
>> fsmnt   /
>> volname FBSDG4Srootfs   swuid   0   providersize25165823
>> . . .
>> 
>> 
>> 
>> What I had done that produced the messages was:
>> 
>> > leaves root (only) file system not marked clean
>> so fsck -B will actually do something below>
>> 
>> boot -s (so: single user mode)
>> # The next 3 lines are the content of a generic, manually-run script.
>> mount -u /
>> mount -a -t ufs (but there is no other file system)
>> swapon -a   (there is a swap partition)
>> #
>> fsck -B
>> 
>> That "fsck -B" caused the same kinds of lines
>> reported by Michael Butler, happening as fsck
>> makes a snapshot for the background processing
>> to use. (I have camera pictures and could type
>> in some of the lines if needed.)
>> 
>> After those lines was text like (typed in from
>> an example camera picture):
>> 
>> ** //.snap/fsck_snapshot
>> ** Last Mount on /
>> ** Root file system
>> ** Phase 1 - Check Blocks and Sizes
>> ** Phase 2 - Check Pathnames
>> ** Phase 3 - Check Connectivity
>> ** Phase 4 - Check Reference Counts
>> ** Phase 5 - Check Cyl groups
>> Reclaimed: 0 directories, 1 files, 22680 fragments
>> 780914 files, 4797127 used, 19552199 free (443479 frags, 3288590 blocks, 
>> 1.8% fragmentation)
>> 
>> * FILE SYSTEM MARKED CLEAN *
> 
> [I forgot or mention that the context was a
> production style kernel and world build,
> no invariants or other such.]
> 
> Since I'm running a patched -r320570 for the
> issue:
> 
> -r319722 through -r320651 (fixed by -r320652)
> 
> I went back and forced a power-off without
> shutdown and did the sequence:
> 
> boot -s (so: single user mode)
> # The next 3 lines are the content of a generic, manually-run script.
> mount -u /
> mount -a -t ufs (but there is no other file system)
> swapon -a   (there is a swap partition)
> #
> fsck -B
> 
> but always waited briefly after the fsck -B finished.
> 
> Like before the 

Re: dump trying to access incorrect block numbers?

2017-07-08 Thread Mark Millard
[I add notes about a problem that happens after the
"fsck -B". Also  forgot to mention: production style
kernel world builds were in use. And a tried a
powerpc64 build and it works the same.]

On 2017-Jul-7, at 11:09 PM, Mark Millard  wrote:

> [This note has more information than one sent with extra text
> in the subject but with a partially different "to" list.]
> 
> Peter Jeremy peter at rulingia.com wrote on
> Sat Jul 8 02:00:47 UTC 2017 :
> 
>> When did you first notice this (what SVN revision)?
>> Do you know what the last good SVN revision was?
>> Is this a new or old filesystem?
>> Is the filesystem mounted/active or not when you dump it?
>> What are the relevant parameters for the filesystem on ada0s3a?
>> Are you running softupdates, journalling etc?
>> Which dump(8) phase is reporting the errors?
>> What are the exact dump and fsck commands you ran?
> 
> I can add a little information with some contrast
> and only "fsck -B" in use (with an unclean file
> system from a prior crash), no dump use. Still:
> a snapshot is involved in the below.
> 
> Unfortunately two problems with major consequences
> for my involved context limit the svn range that I
> can cover for the activity, the problem version
> ranges being:
> 
> -r319722 through -r320651 (fixed by -r320652)
> (actually this is why I had used "boot -s" 
> in what I report later: I could get to a
> shell prompt that way instead of crashing
> before any login prompt; the crashes left
> the file system in need of repair)
> 
> -r320509 through -r320561 (fixed by -r320570)
> 
> So I was using -r320570 to avoid one of the
> two problems.
> 
> 
> 
> Context: 32-bit powerpc FreeBSD used on PowerMac G5
> so-called "Quad-core". (So big-endian as well.)
> Softupdates, no journalling. Long-in-use file
> system having lots of FreeBSD versions updates
> and port rebuilds over the time.
> 
> The following is from now, not from the time of the
> example messages:
> 
> # dumpfs / | more
> magic   19540119 (UFS2) timeFri Jul  7 22:53:34 2017
> superblock location 65536   id  [  ]
> ncg 158 size25165823blocks  24372006
> bsize   32768   shift   15  mask0x8000
> fsize   4096shift   12  mask0xf000
> frag8   shift   3   fsbtodb 3
> minfree 8%  optim   timesymlinklen 120
> maxbsize 32768  maxbpg  4096maxcontig 4 contigsumsize 4
> nbfree  2130375 ndir65518   nifree  11769796nffree  425065
> bpg 20032   fpg 160256  ipg 80128   unrefs  0
> nindir  4096inopb   128 maxfilesize 2252349704110079
> sbsize  4096cgsize  32768   csaddr  5048cssize  4096
> sblkno  24  cblkno  32  iblkno  40  dblkno  5048
> cgrotor 127 fmod0   ronly   0   clean   0
> metaspace 6408  avgfpdir 64 avgfilesize 16384
> flags   soft-updates trim 
> fsmnt   /
> volname FBSDG4Srootfs   swuid   0   providersize25165823
> . . .
> 
> 
> 
> What I had done that produced the messages was:
> 
>  leaves root (only) file system not marked clean
> so fsck -B will actually do something below>
> 
> boot -s (so: single user mode)
> # The next 3 lines are the content of a generic, manually-run script.
> mount -u /
> mount -a -t ufs (but there is no other file system)
> swapon -a   (there is a swap partition)
> #
> fsck -B
> 
> That "fsck -B" caused the same kinds of lines
> reported by Michael Butler, happening as fsck
> makes a snapshot for the background processing
> to use. (I have camera pictures and could type
> in some of the lines if needed.)
> 
> After those lines was text like (typed in from
> an example camera picture):
> 
> ** //.snap/fsck_snapshot
> ** Last Mount on /
> ** Root file system
> ** Phase 1 - Check Blocks and Sizes
> ** Phase 2 - Check Pathnames
> ** Phase 3 - Check Connectivity
> ** Phase 4 - Check Reference Counts
> ** Phase 5 - Check Cyl groups
> Reclaimed: 0 directories, 1 files, 22680 fragments
> 780914 files, 4797127 used, 19552199 free (443479 frags, 3288590 blocks, 1.8% 
> fragmentation)
> 
> * FILE SYSTEM MARKED CLEAN *

[I forgot or mention that the context was a
production style kernel and world build,
no invariants or other such.]

Since I'm running a patched -r320570 for the
issue:

-r319722 through -r320651 (fixed by -r320652)

I went back and forced a power-off without
shutdown and did the sequence:

boot -s (so: single user mode)
# The next 3 lines are the content of a generic, manually-run script.
mount -u /
mount -a -t ufs (but there is no other file system)
swapon -a   (there is a swap partition)
#
fsck -B

but always waited briefly after the fsck -B finished.

Like before the following happens as it tries to trim:
(typed in from camera picture)

panic: ffs_blkfree_cq: freeing free block
cpuid = 2 (varies, of course)
time = (varies)
KDB: stack backtrace
(stack addresses can vary: just an example here)
0xd23b17e0: at kdb_backtrace+0x5c
0xd23b1850: at vpanic+0x1e8

Re: dump trying to access incorrect block numbers?

2017-07-08 Thread Rodney W. Grimes
> On 07/07/17 21:53, Peter Jeremy wrote:
> > On 2017-Jul-07 10:44:36 -0400, Michael Butler  
> > wrote:
> >> Recent builds doing a backup (dump) cause nonsensical errors in syslog:
> > 
> > I can't directly offer any ideas but some more background might help:
> > When did you first notice this (what SVN revision)?
> 
> I was stuck on SVN r319721 on the i386 machine while the socket/union 
> issue was addressed. That version did not display the problem.
> 
> > Do you know what the last good SVN revision was?
> > Is this a new or old filesystem?
> 
> old - it's been years since this system was rebuilt.
> 
> > Is the filesystem mounted/active or not when you dump it?
> 
> Mounted and active.
> 
> > What are the relevant parameters for the filesystem on ada0s3a?
> 
> imb@toshi:/home/imb> dumpfs /
> 
> magic   19540119 (UFS2) timeFri Jul  7 22:43:49 2017
> superblock location 65536   id  [ 56c8bf68 1a8b12b5 ]
> ncg 516 size82575360blocks  79978821
> bsize   32768   shift   15  mask0x8000
> fsize   4096shift   12  mask0xf000
> frag8   shift   3   fsbtodb 3
> minfree 8%  optim   timesymlinklen 120
> maxbsize 32768  maxbpg  4096maxcontig 4 contigsumsize 4
> nbfree  3965346 ndir98169   nifree  40196026nffree  453383
> bpg 20035   fpg 160280  ipg 80256   unrefs  0
> nindir  4096inopb   128 maxfilesize 2252349704110079
> sbsize  4096cgsize  32768   csaddr  5056cssize  12288
> sblkno  24  cblkno  32  iblkno  40  dblkno  5056
> cgrotor 253 fmod0   ronly   0   clean   0
> metaspace 6408  avgfpdir 64 avgfilesize 16384
> flags   soft-updates
> fsmnt   /
> volname swuid   0   providersize82575360
> 
>   [ .. ]
> 
> > Are you running softupdates, journalling etc?
> 
> soft-updates only.
> 
> > Which dump(8) phase is reporting the errors?
> 
> The errors occur before the "date of the last level x dump" message - 
> presumably, this is while creating the snapshot.
> 
> > What are the exact dump and fsck commands you ran?
> 
> /sbin/dump 0Lauf - -C 32 /
> 
> none of the following report any (unexpected) errors:
> 
> fsck -f /
> fsck -f -r /
> fsck -f -Z /
> 
> > 
> >> I now have two UFS-based systems showing the same symptoms - what's up
> >> with this?
> > 
> > Was there anything you did on either filesystem that might have triggered 
> > it?
> 
> Other than update the kernel, no.

Since it has been speculated that this is occuring during the
creation of the snapshot, could you try just creating a snapshot
using mksnap_ffs and see if any errors occur?


-- 
Rod Grimes rgri...@freebsd.org
___
freebsd-current@freebsd.org mailing list
https://lists.freebsd.org/mailman/listinfo/freebsd-current
To unsubscribe, send any mail to "freebsd-current-unsubscr...@freebsd.org"


Re: dump trying to access incorrect block numbers?

2017-07-08 Thread Michael Butler

On 07/08/17 12:28, Rodney W. Grimes wrote:


Since it has been speculated that this is occuring during the
creation of the snapshot, could you try just creating a snapshot
using mksnap_ffs and see if any errors occur?


After a short pause with disk activity, the same sorts of errors are 
logged when using "mksnap_ffs /.snap2" where .snap2 did  not previously 
exist,


Michael


___
freebsd-current@freebsd.org mailing list
https://lists.freebsd.org/mailman/listinfo/freebsd-current
To unsubscribe, send any mail to "freebsd-current-unsubscr...@freebsd.org"


pkg, repos and cached certificates

2017-07-08 Thread Boris Samorodov
Hi All,

I use own pkg base repository. Today I tried to install packages
on a new system. However, pkg refused to use my repo, since it's
certificate is expired (several days). That's good.

However, pkg happily update packages at other systems. So, seems
that pkg does not perform checks / validates known certificates.
Which seems not as good.

Thanks,
-- 
WBR, bsam
___
freebsd-current@freebsd.org mailing list
https://lists.freebsd.org/mailman/listinfo/freebsd-current
To unsubscribe, send any mail to "freebsd-current-unsubscr...@freebsd.org"


[memstick install] auto-zfs error

2017-07-08 Thread Boris Samorodov
Hi All,

I tied to install a new FreeBSD-amd-12 system from official USB
installation memstick.img. Auto-UFS (GPT) installs fine and the system
boots fine. However, ZFS-Auto install succeeds, but is not loaded.
At the very beginning it gives something like "gpt sector  error,
gpt sector 1 error, can't find zroot..."

Is it a known error / should I give more (precise) errors?

I tried two recent images with the same result:
ftp://ftp.freebsd.org/pub/FreeBSD/snapshots/ISO-IMAGES/12.0/FreeBSD-12.0-CURRENT-amd64-20170703-r320599-memstick.img
ftp://ftp.freebsd.org/pub/FreeBSD/snapshots/ISO-IMAGES/12.0/FreeBSD-12.0-CURRENT-amd64-20170626-r320360-memstick.img

Thanks all,
-- 
WBR, bsam
___
freebsd-current@freebsd.org mailing list
https://lists.freebsd.org/mailman/listinfo/freebsd-current
To unsubscribe, send any mail to "freebsd-current-unsubscr...@freebsd.org"


Re: dump trying to access incorrect block numbers?

2017-07-08 Thread Mark Millard
[This note has more information than one sent with extra text
in the subject but with a partially different "to" list.]

Peter Jeremy peter at rulingia.com wrote on
Sat Jul 8 02:00:47 UTC 2017 :

> When did you first notice this (what SVN revision)?
> Do you know what the last good SVN revision was?
> Is this a new or old filesystem?
> Is the filesystem mounted/active or not when you dump it?
> What are the relevant parameters for the filesystem on ada0s3a?
> Are you running softupdates, journalling etc?
> Which dump(8) phase is reporting the errors?
> What are the exact dump and fsck commands you ran?

I can add a little information with some contrast
and only "fsck -B" in use (with an unclean file
system from a prior crash), no dump use. Still:
a snapshot is involved in the below.

Unfortunately two problems with major consequences
for my involved context limit the svn range that I
can cover for the activity, the problem version
ranges being:

-r319722 through -r320651 (fixed by -r320652)
(actually this is why I had used "boot -s" 
in what I report later: I could get to a
shell prompt that way instead of crashing
before any login prompt; the crashes left
the file system in need of repair)

-r320509 through -r320561 (fixed by -r320570)

So I was using -r320570 to avoid one of the
two problems.



Context: 32-bit powerpc FreeBSD used on PowerMac G5
so-called "Quad-core". (So big-endian as well.)
Softupdates, no journalling. Long-in-use file
system having lots of FreeBSD versions updates
and port rebuilds over the time.

The following is from now, not from the time of the
example messages:

# dumpfs / | more
magic   19540119 (UFS2) timeFri Jul  7 22:53:34 2017
superblock location 65536   id  [  ]
ncg 158 size25165823blocks  24372006
bsize   32768   shift   15  mask0x8000
fsize   4096shift   12  mask0xf000
frag8   shift   3   fsbtodb 3
minfree 8%  optim   timesymlinklen 120
maxbsize 32768  maxbpg  4096maxcontig 4 contigsumsize 4
nbfree  2130375 ndir65518   nifree  11769796nffree  425065
bpg 20032   fpg 160256  ipg 80128   unrefs  0
nindir  4096inopb   128 maxfilesize 2252349704110079
sbsize  4096cgsize  32768   csaddr  5048cssize  4096
sblkno  24  cblkno  32  iblkno  40  dblkno  5048
cgrotor 127 fmod0   ronly   0   clean   0
metaspace 6408  avgfpdir 64 avgfilesize 16384
flags   soft-updates trim 
fsmnt   /
volname FBSDG4Srootfs   swuid   0   providersize25165823
. . .



What I had done that produced the messages was:



boot -s (so: single user mode)
# The next 3 lines are the content of a generic, manually-run script.
mount -u /
mount -a -t ufs (but there is no other file system)
swapon -a   (there is a swap partition)
#
fsck -B

That "fsck -B" caused the same kinds of lines
reported by Michael Butler, happening as fsck
makes a snapshot for the background processing
to use. (I have camera pictures and could type
in some of the lines if needed.)

After those lines was text like (typed in from
an example camera picture):

** //.snap/fsck_snapshot
** Last Mount on /
** Root file system
** Phase 1 - Check Blocks and Sizes
** Phase 2 - Check Pathnames
** Phase 3 - Check Connectivity
** Phase 4 - Check Reference Counts
** Phase 5 - Check Cyl groups
Reclaimed: 0 directories, 1 files, 22680 fragments
780914 files, 4797127 used, 19552199 free (443479 frags, 3288590 blocks, 1.8% 
fragmentation)

* FILE SYSTEM MARKED CLEAN *


===
Mark Millard
markmi at dsl-only.net

___
freebsd-current@freebsd.org mailing list
https://lists.freebsd.org/mailman/listinfo/freebsd-current
To unsubscribe, send any mail to "freebsd-current-unsubscr...@freebsd.org"