Re: filesystem corruption ?

2003-03-21 Thread Bernd Schubert
On Friday 21 March 2003 08:32, you wrote:
 Hello!

 On Thu, Mar 20, 2003 at 07:23:48PM +0100, Bernd Schubert wrote:
   Hm, interesting.
   And what are the differences? How big are they?
 
  Since it are binaries files, a colleague had the idea to use hexdump and
  diff, so the command for the attached file was:
  diff (hexdump /worka/gdb) (hexdump /usr/bin/gdb)|sort -k 2 gdb.diff
  So the lines beginning with '' are from working gdb and lines beginning
  with '' are from corrupted gdb. When you look into the diff-file you
  will see, that only some bits per line have changed.

 I see.
 Basically you have two pages of data corrupted.
 And the corruption indeed looks like bit corruption.
 How about rebooting that box and checking if corruption pattern changes?
 Also I'd recommend you to run memtext86 for some time as this looks like
 bad memory pattern.

All of our machines have to pass a full memtest86 checking before we intend to 
use them - this machine is about 3 weeks old, of course it also had to run 
this test and furthermore it has ECC-memory.


   Any events happening between morning backup and time of problem
   discovery?
 
  Except, that I recompiled a kernel and we installed some programs using
  aptitude (its a debian system), nothing happend to the filesystem. There
  was also no reboot, no crash, etc.
  Update: The corruption probably happend at 15:48, since at this time also
  a xchat on one of the clients crashed and this was noticed by us at
  first. The xchat binary was also affected by the corruption.

 So, the beam of X-rays run through the memory module corrupting some bits?

There is the 'Environmental Physics Institut' in the floor below us and since 
we currently have an extremely high hardware failure rate, I have been joking 
for some time that they might be causing it (I believe they are indeed using 
x-ray beams). I should really ask them if their constructions are shielded 
properly ;-)

 ;) This stuff should not have been written to disk, so probably
 plain reboot should fix everything? Can you test that?

Yes of course, if something goes wrong we still have our fall back machine :-)

I will report in the afternoon if it worked.

Best regards,
Bernd


Re: filesystem corruption ?

2003-03-21 Thread Bernd Schubert
Hi,

 So, the beam of X-rays run through the memory module corrupting some bits?
 ;) This stuff should not have been written to disk, so probably
 plain reboot should fix everything? Can you test that?


indeed after rebooting everything is fine again. We will run another memtest86 
during the weekend, though I really don't believe we will find a problem.

Though this machine will be replaced by a real server in a few month, I'm 
still rather worried what happend. Even if its 'only' a hardware memory 
problem this means lots of trouble for us -- on the one hand it seems not to 
be memtest86 detectable and on the other hand our programs really do need 
working memory, but of course this is not of your concern.


Thanks for your help,
Bernd


Re: filesystem corruption ?

2003-03-21 Thread Oleg Drokin
Hello!

On Fri, Mar 21, 2003 at 02:01:38PM +0100, Bernd Schubert wrote:
  So, the beam of X-rays run through the memory module corrupting some bits?
  ;) This stuff should not have been written to disk, so probably
  plain reboot should fix everything? Can you test that?
 indeed after rebooting everything is fine again. We will run another memtest86 

So on-disk corruption is out of question.

 during the weekend, though I really don't believe we will find a problem.

Ask those physics guys to run some X-ray experiments while you are running memtest86 ;)

 Though this machine will be replaced by a real server in a few month, I'm 
 still rather worried what happend. Even if its 'only' a hardware memory 
 problem this means lots of trouble for us -- on the one hand it seems not to 
 be memtest86 detectable and on the other hand our programs really do need 

Well, it may be not detectable because no high-enerty beams are running around at
the time of test.

 working memory, but of course this is not of your concern.

I've learn in the school that if you put some bit amount of plumbum in between
some area and source of radiation, chances are radiation that will reach the
protected area will be of much lesser strenght.
In fact you might go to those guys and ask them what matherial (and how much of it)
is best suited to shield against stuff they generate.

Bye,
Oleg


Re: reiserfsck --rebuild-tree bug (loops forever)

2003-03-21 Thread Baldur Norddahl
Hi,

3.6.5 asked me to run --rebuild-sb and then --rebuild-tree. That worked,
and the filesystem is repaired.

Oleg Drokin suggested that I might be running a version that was not
compiled on the same server. I was actually using the version that
debian provided for me.

The namesys.com web site is still advertizing 3.6.4 as the latest
version, maybe that should be corrected.

Thanks,

Baldur

On Fri, 2003-03-21 at 10:37, Philippe Gramoullé wrote:
 Hi,
 
 3.6.4 is not the latest reiserfsck version.
 
 3.6.5 [1] is the latest and several bugs have been fixed since.
 
 Could you please retry with reiserfsck 3.6.5 ?
 
 Thanks,
 
 Philippe
 
 [1] Reiserfprogs : ftp://ftp.namesys.com/pub/reiserfsprogs/reiserfsprogs-3.6.5.tar.gz
 
 
 On 21 Mar 2003 10:31:24 +0100
 Baldur Norddahl [EMAIL PROTECTED] wrote:
 
   | dawn:~# reiserfsck -V
   |  reiserfsck 3.6.4, 2002
-- 
Baldur Norddahl [EMAIL PROTECTED]



Re: filesystem corruption ?

2003-03-21 Thread Bernd Schubert
 I've learn in the school that if you put some bit amount of plumbum in
 between some area and source of radiation, chances are radiation that will
 reach the protected area will be of much lesser strenght.
 In fact you might go to those guys and ask them what matherial (and how
 much of it) is best suited to shield against stuff they generate.

We already discussed during the lunch time to order somthing like this for our 
systems ;-) (would be a rather strange order for a usual computer company, 
wouldn't it ?)
But in fact, I'm now really going to contact  the those guys and ask if they 
have some stuff to detect their beams.

Have a nice weekend,
Bernd


[SPAM]

2003-03-21 Thread
 Start SpamAssassin results
18.40 points, 8 required;
*  0.9 -- From: ends in numbers
*  0.3 -- Message-Id has no @ sign
*  1.0 -- BODY: Body includes 8 consecutive 8-bit characters
*  4.0 -- BODY: Written in an undesired language
*  1.0 -- BODY: Message is 60% to 70% HTML
*  3.2 -- BODY: Character set indicates a foreign language
*  0.3 -- BODY: HTML mail with non-white background
*  0.3 -- BODY: HTML font color is red
*  0.4 -- BODY: HTML font color is green
*  0.0 -- BODY: HTML included in message
*  0.2 -- BODY: JavaScript code
*  0.4 -- BODY: Frontpage used to create the message
*  0.2 -- BODY: Includes a URL link to send an email
*  0.6 -- URI: Includes a link to a likely spammer email address
*  2.1 -- A foreign language charset used in headers
*  1.0 -- Forged mail pretending to be from MS Outlook
*  1.0 -- Message only has text/html MIME parts
*  1.0 -- From an address that is all numbers (non-phone)
*  0.5 -- A foreign language charset used in HTML markup

 End of SpamAssassin results

The original message did not contain plain text, and may be unsafe to
open with some email clients; in particular, it may contain a virus,
or confirm that your address can receive spam.  If you wish to view
it, it may be safer to save it to a file and open it with an editor.

---BeginMessage---
Title: 







  
  

  










  


  
  


[EMAIL PROTECTED] 

GO  
 
 
 
 
 
 
 
 
 
 
---End Message---


Re: filesystem corruption ?

2003-03-21 Thread Russell Coker
On Fri, 21 Mar 2003 14:07, Oleg Drokin wrote:
 I've learn in the school that if you put some bit amount of plumbum in

It's better known in English as lead.

The problem with lead is that it's poisonous and soft.  Having to wash your 
hands after touching your computer could get annoying.

Other metals such as copper and steel will reduce the radiation and can also 
be used for protection against mechanical damage.

The best way to reduce radiation is by distance.  The inverse-square law 
applies, so moving the computer further away from the experiment will reduce 
the radiation more easily than anything else you may do.  One thing to 
consider is disk-less X-term machines for if you need to operate a computer 
from near the experiment, so if the X-term crashed from radiation then your 
server with the data should continue running correctly.

-- 
http://www.coker.com.au/selinux/   My NSA Security Enhanced Linux packages
http://www.coker.com.au/bonnie++/  Bonnie++ hard drive benchmark
http://www.coker.com.au/postal/Postal SMTP/POP benchmark
http://www.coker.com.au/~russell/  My home page



reiserfsck block is in the tree already error

2003-03-21 Thread Alex Malinovich
I've been trying to rebuild the tree on a reiserfs partition that seems
to have gotten corrupted during a resize operation. Unfortunately, I've
gone from having one or two directories not accessible to having the
whole partition not accessible now. Everything seems to go ok until pass
2. At this point I start getting this error:

do_pass_2: The block (4901823) is in the tree already. Should not
happen.

This keeps coming up as fast as the terminal will display it. It seems
to be an endless loop as leaving it on for 4 hours resulted in an
essentially infinite number of error messages with no corresponding disk
activity. I'm running reiserfsck with the following command line:

reiserfsck -z -S --rebuild-sb --rebuild-tree /dev/hda3

I'm pretty sure that -z and --rebuild-sb are more or less unnecessary,
but I'm trying to be thorough. Running just

reiserfsck --rebuild-tree /dev/hda3

results in a similar error message. I don't remember the exact wording
of it, but it is along the lines of, marked as a leaf but is not a
leaf.

Any suggestions are very welcome. I'd certainly hate to lose all of this
data.

p.s. My system is Debian GNU/Linux Unstable, running a 2.4.20 kernel.

-- 
Alex Malinovich
Support Free Software, delete your Windows partition TODAY!
Encrypted mail preferred. You can get my public key from any of the
pgp.net keyservers. Key ID: A6D24837


signature.asc
Description: This is a digitally signed message part


Re: reiserfsck block is in the tree already error

2003-03-21 Thread Vitaly Fertman

Hi, 

On Friday 21 March 2003 20:05, Alex Malinovich wrote:
 I've been trying to rebuild the tree on a reiserfs partition that seems
 to have gotten corrupted during a resize operation. Unfortunately, I've
 gone from having one or two directories not accessible to having the
 whole partition not accessible now. Everything seems to go ok until pass
 2. At this point I start getting this error:

 do_pass_2: The block (4901823) is in the tree already. Should not
 happen.

It seems that reiserfsck was built on another computer, which has some 
other libs version, not statically. Would you try to rebuild it and try 
again.

-- 

Thanks,
Vitaly Fertman


[SPAM] ** You are approved.

2003-03-21 Thread zp26ciix
 Start SpamAssassin results
13.80 points, 8 required;
*  1.0 -- Subject talks about being approved
*  1.3 -- From: does not include a real name
*  1.0 -- Bulk email software fingerprint (eGroups) found in headers
*  1.4 -- BODY: Incorporates a tracking ID number
*  0.0 -- BODY: HTML included in message
*  1.0 -- BODY: Message is 40% to 50% HTML
*  0.2 -- BODY: HTML font color is missing hash (
*  0.4 -- RAW: MIME section missing boundary
*  1.4 -- RAW: Message text disguised using base-64 encoding
*  0.6 -- URI: Uses %-escapes inside a URL's hostname
*  1.0 -- URI: Completely unnecessary %-escapes inside a URL
*  2.5 -- Date: is 24 to 48 hours after Received: date
*  1.0 -- Message-Id is fake (in Outlook Express format)
*  1.0 -- Message only has text/html MIME parts

 End of SpamAssassin results

The original message did not contain plain text, and may be unsafe to
open with some email clients; in particular, it may contain a virus,
or confirm that your address can receive spam.  If you wish to view
it, it may be safer to save it to a file and open it with an editor.

---BeginMessage---


  albumin , water-soluble blood protein  
Your home refinance loan is approved!
To get your approved amount go
here.

To be excluded from further notices go
here.
  albumin , water-soluble blood protein  

1gate

8451KaEE0-223THUC8417XagP7-821YQee4045Vnyt3-248l44---End Message---


Re: datalogging and quota patches port to 2.4.21-pre6

2003-03-21 Thread Manuel Krause
On 03/20/2003 04:13 PM, Oleg Drokin wrote:
Hello!

   Ok, so I ported Chris' patches to 2.4.21-pre6 (should appear on your
   kernel.org mirror soon, have appeared in bk already).
   Patches are at
   ftp://namesys.com/pub/reiserfs-for-2.4/testing/data-logging-and-quota-2.4.21-pre6
   They are intend to replace similar named patches from Chris' ftp directory at
   ftp://ftp.suse.com/pub/people/mason/patches/data-logging/2.4.21
   Also if you want to try these patches with 2.4.21-pre5-ac3,
   you only need to apply my 2.4.21-pre6 versions of 
05-2.4.21-pre6-data-logging-36.diff.gz
   and 08-2.4.21-pre6-reiserfs-quota-26.diff.gz (but I have not tried this 
configuration yet).
   Enjoy.

Bye,
Oleg
Hi Oleg,

GRIN !!!  I really would like to enjoy these, but you seem to be far 
away in future. Still no official patch seems to be out -- at least 
for me on here. (Yes, I'll wait and would not use bk.)

Have your previously pending patches been included into 2.4.21-pre6
(they are: 02-trivial1.diff
   03-more-mount-checks.diff
   01-journal-overflow-fix.diff
 fromftp://ftp.namesys.com/pub/reiserfs-for-2.4/testing :
  iget5_locked_for_2.4.21-pre5-datalogging.diff.gz
 and posted to our ML @ Wed, 12 Mar 2003 10:46:42 +0300 :
direct-io-fix-II.diff
) ?
My question only results from beeing unable to get them all applied to 
2.4.21-pre5 with data-logging without rejects and me feeling unsafe to 
adjust them manually.

Thanks,

Manuel

--
If we'll never find peace, freedom and democracy with our souls -- how 
can our world? We only need to begin. Yes, first step first, in peace...



Re: reiserfsck block is in the tree already error

2003-03-21 Thread Alex Malinovich
On Fri, 2003-03-21 at 11:14, Vitaly Fertman wrote:
 Hi, 
 
 On Friday 21 March 2003 20:05, Alex Malinovich wrote:
  I've been trying to rebuild the tree on a reiserfs partition that seems
  to have gotten corrupted during a resize operation. Unfortunately, I've
  gone from having one or two directories not accessible to having the
  whole partition not accessible now. Everything seems to go ok until pass
  2. At this point I start getting this error:
 
  do_pass_2: The block (4901823) is in the tree already. Should not
  happen.
 
 It seems that reiserfsck was built on another computer, which has some 
 other libs version, not statically. Would you try to rebuild it and try 
 again.

I just downloaded the source and built it and it worked! Thank you VERY
much!

-- 
Alex Malinovich
Support Free Software, delete your Windows partition TODAY!
Encrypted mail preferred. You can get my public key from any of the
pgp.net keyservers. Key ID: A6D24837


signature.asc
Description: This is a digitally signed message part