Re: [9fans] fossil caching venti errors

2009-04-08 Thread Richard Miller
I've been seeing corruption of fossil /archive data, at a rate of about once a week. The symptom is that venti dir entries for /archive copies of some frequently referenced directories in my main fossil fs (generally it's /usr, /usr/miller or /mail/box/miller) contain incorrect venti scores.

Re: [9fans] fossil caching venti errors

2009-04-08 Thread ron minnich
I'm having a continuous problem, symptom being failures in archWalk, but had assumed it was a hard disk getting ready to die. fossil: diskReadRaw failed: /dev/sdC0/fossil: score 0x0021cac2: part=label block 2214594: illegal block address archive(0, 0x4d385643): cannot find block: error reading

Re: [9fans] fossil caching venti errors

2009-04-08 Thread C H Forsyth
I'm having a continuous problem, symptom being failures in archWalk, but had assumed it was a hard disk getting ready to die. i see those on one of my old drives, but i don't think it's the drive. i thought it might have been caused by a power failure catching out fossil or venti, but if you've

Re: [9fans] fossil caching venti errors

2009-04-08 Thread Richard Miller
archive(0, 0x4d385643): cannot find block: error reading block 0x0021cac2 archWalk 0x4d385643 failed; ptr is in 0x2075 offset 3 0x4d385643 is supposedly a block number, which seems unlikely unless you have a 10 terabyte disk. Looks like you have a VtEntry or pointer block with a corrupted

Re: [9fans] fossil caching venti errors

2009-04-08 Thread Richard Miller
ls: 0216/mail/box/cinap_lenrek/mbox: '0216/mail' does not exist ls: 0323/mail/box/cinap_lenrek/mbox: '0323/mail/box' does not exist oops... :( Schade. Is this a uni- or multi-processor?

Re: [9fans] fossil caching venti errors

2009-04-08 Thread Steve Simon
cd /n/dump/2009 for (i in *) { test -d $i$home/tmp || ls -d $i$home/tmp } for (i in *) { test -f $i/mail/box/$user/mbox || ls $i/mail/box/$user/mbox } no problems here, and my server is a dual cpu PIII. I last built a kernel on the 11th of feb so if this is a very recent but I may have

Re: [9fans] fossil caching venti errors

2009-04-08 Thread cinap_lenrek
uniprocessor machine -- cinap

Re: [9fans] fossil caching venti errors

2009-04-08 Thread Richard Miller
uniprocessor machine That eliminates some lines of investigation, thank you. could it be that the archiver for some reason is looking at rolled back blocks? I'll put in a console message when blocks are rolled back and see if there's a correlation.

Re: [9fans] fossil caching venti errors

2009-03-30 Thread Pavel Klinkovsky
md5sum: error reading /n/sources/plan9/sys/src/boot/pc/ether82563.c: venti i/o error or wrong score, block 72804bdc64d1a772cd4b2eaeda6f1e1b8f175b21 I have got similar problem with 'pull'. And the last update even ruined binaries of NDB... :-( Pavel

Re: [9fans] fossil caching venti errors

2009-03-30 Thread lucio
never mind. i think it's not a sign of the problem we were discussing, but possibly something is simply down. Then the error message needs some tidying up. It happened to me too and coincided with a total failure for replica. Sigh! ++L

Re: [9fans] fossil caching venti errors

2009-03-30 Thread erik quanstrom
On Mon Mar 30 15:10:25 EDT 2009, lu...@proxima.alt.za wrote: never mind. i think it's not a sign of the problem we were discussing, but possibly something is simply down. Then the error message needs some tidying up. It happened to me too and coincided with a total failure for replica.

Re: [9fans] fossil caching venti errors

2009-03-28 Thread Charles Forsyth
i've seen that just recently, but thought it might have been a failing (very old) drive, or a power failure beyond the endurance of the UPS. if neither of those are true in your case, it might be worth a deeper look. i also found there is a persistent problem that `check fix' won't fix. (since

Re: [9fans] fossil caching venti errors

2009-03-28 Thread david bulkow
This is likely too large a hammer, but when this happens I rebuild the venti index so that I can get past the issue. I see this more under Plan 9 than p9p. The block in error always exists in an arena and a checkarenas reports no errors. The problem usually persists across reboots until I

Re: [9fans] fossil caching venti errors

2009-03-28 Thread erik quanstrom
On Sat Mar 28 06:54:35 EDT 2009, fors...@terzarima.net wrote: i've seen that just recently, but thought it might have been a failing (very old) drive, or a power failure beyond the endurance of the UPS. if neither of those are true in your case, it might be worth a deeper look. i also found

Re: [9fans] fossil caching venti errors

2009-03-28 Thread Nathaniel W Filardo
On Sat, Mar 28, 2009 at 11:11:25AM +, Charles Forsyth wrote: i've seen that just recently, but thought it might have been a failing (very old) drive, or a power failure beyond the endurance of the UPS. if neither of those are true in your case, it might be worth a deeper look. i also

Re: [9fans] fossil caching venti errors

2009-03-28 Thread erik quanstrom
AFAIK the disk is doing just fine. Moreover, even during the period when fossil is complaining, venti/read on 9fs's score works just fine. So I don't believe the fault is venti's. i don't believe that conclusion is warranted. /sys/src/cmd/fossil/cache.c:683,684 is where this condition gets

Re: [9fans] fossil caching venti errors

2009-03-28 Thread Nathaniel W Filardo
On Sat, Mar 28, 2009 at 01:31:01PM -0400, erik quanstrom wrote: it would be interesting to know if the score of the block returned by venti/read is correct. venti/read should be doing this check automatically since libventi/client.c builds with int ventidoublechecksha1 = 1; by default. --nwf;

Re: [9fans] fossil caching venti errors

2009-03-28 Thread erik quanstrom
sorry if this is a dupe. my original reply seems to have gone missing due to a local mishap. venti/read should be doing this check automatically since libventi/client.c builds with int ventidoublechecksha1 = 1; by default. yet you're reporting that fossil thinks the score does not match.

[9fans] fossil caching venti errors

2009-03-27 Thread Nathaniel W Filardo
Entertaining. Indented lines are fossil console; nonindented are at a normal CPU prompt. cpu% 9fs 9fs: venti i/o error or wrong score, block 558b88fbae4e0aa894c614fb3eeccf4d2f7492ca main: venti tcp!xxx.xxx.xxx.xxx!x cpu% 9fs 9fs: venti i/o error or wrong score, block