[CentOS] XFS-filesystem corrupted by defragmentation Was: Performance problems with XFS on Centos 5.4

2010-04-13 Thread Bernhard Gschaider

Before I'd try to defragment my whole filesystem (see attached mail
for whole story) I figured Let's try it on some file.

So I did

 xfs_bmap /raid/Temp/someDiskimage.iso
[output shows 101 extents and 1 hole]

Then I defragmented the file
 xfs_fsr /raid/Temp/someDiskimage.iso
extents before:101 after:3 DONE

 xfs_bmap /raid/Temp/someDiskimage.iso
[output shows 3 extents and 1 hole]

and now comes the bummer: i wanted to check the fragmentation of the
whole filesystem (just for checking):

 xfs_db -r /dev/mapper/VolGroup00-LogVol04
xfs_db: unexpected XFS SB magic number 0x
xfs_db: read failed: Invalid argument
xfs_db: data size check failed
cache_node_purge: refcount was 1, not zero (node=0x2a25c20)
xfs_db: cannot read root inode (22)

THAT output was definitly not there when I did this the last time and
therefor the new fragmentation does not make me happy either

xfs_db frag
actual 0, ideal 0, fragmentation factor 0.00%

The file-system is still mounted and working and I don't dare to do
anything about it (am in a mild state of panic) because I think it
might not come back if I do.

Any suggestions most welcome (am googling myself before I do anything
about it).

I swear to god: I did not do anything else with the xfs_*-commands
than the stuff mentioned above

Bernhard

---BeginMessage---

 On Fri, 9 Apr 2010 10:59:02 -0400
 RW == Ross Walker rswwal...@gmail.com wrote:

RW On Apr 9, 2010, at 9:59 AM, Bernhard Gschaider
RW bgschaid_li...@ice-
sf.at wrote:

 Hi!
 
 During the last weeks I experienced some performance problems
 with a large file-system on XFS basis. Sometimes for instance
 ls is painfully. Immidiatly afterwards ls on the same directory
 is immidiate. I used strace on this ls and found that during
 the first ls the lstat-calls need approx 0.02s each while
 during the second ls the are two orders of magnitude faster.
 
 Googling around I stumbled upon some messages similar like this
 
 http://www.opensubscriber.com/message/linux-...@oss.sgi.com/1355060.html
 
 which have in common a) they're from around 2006 b) they
 suggest to increase a mount-option ihashsize. This mount option
 is listed as deprecated in the current kernel-doc
 
 So my question: does anyone have experience with that kind of
 performance problem? Do you think it is a XFS problem or are
 there some other tuning parameters in the kernel that could be
 modified for instance via /proc?
 
 The reason why I'm asking here is that it is a production
 file-system so I would be very unpopular if I experiment too
 much (a couple of reboots is OK ;) )
 
 Bernhard
 
 PS: the situation got worse during the last weeks when the
 file-system increased in size, so the option that some kind of
 buffer now is too small and I'm experiencing some kind of
 thrashing seems very likely to me

RW Are you defragging the file system regularly?

Uups. Never occured to me (Fragmentation is s Windoze)
Had a look:

xfs_db frag
actual 6349355, ideal 4865683, fragmentation factor 23.37%

This seems significant.

RW How much memory do you have in the system and how big is the
RW file system?

Memory on the system is 4Gig (2 DualCore Xenons). The filesystem is
3.5 TB of which 740 Gig are used. Which is the maximum amount used
during the one year that the filesystem is being used (that is why the
high fragmentation amazes me)

RW What are the XFS parameters for the file system?

Is this sufficent?

% xfs_info  /raid
meta-data=/dev/VolGroup00/LogVol05 isize=256agcount=32, agsize=29434880 blks
 =   sectsz=512   attr=0
data =   bsize=4096   blocks=941916160, imaxpct=25
 =   sunit=0  swidth=0 blks, unwritten=1
naming   =version 2  bsize=4096  
log  =internal   bsize=4096   blocks=32768, version=1
 =   sectsz=512   sunit=0 blks, lazy-count=0
realtime =none   extsz=4096   blocks=0, rtextents=0


RW What is the storage setup?

The filesystem is on a LVM-Volume which sits on a RAID 5 (Hardware
RAID) drive

RW Need the info.

So the way to go forward would be using xfs_fsr on that drive. I read
some horror stories about lost files, are these to be taken seriously
(I mean they were in some Ubuntu forums ;) )

Any other thoughts on parameters?

Thanks for your time

Bernhard
___
CentOS mailing list
CentOS@centos.org
http://lists.centos.org/mailman/listinfo/centos

---End Message---
___
CentOS mailing list
CentOS@centos.org
http://lists.centos.org/mailman/listinfo/centos


Re: [CentOS] XFS-filesystem corrupted by defragmentation Was: Performance problems with XFS on Centos 5.4

2010-04-13 Thread Bernhard Gschaider

 On Tue, 13 Apr 2010 11:58:39 +0100
 JP == James Pearson jame...@moving-picture.com wrote:

JP Bernhard Gschaider wrote:
 Before I'd try to defragment my whole filesystem (see attached
 mail for whole story) I figured Let's try it on some file.

JP Might be better to ask on the XFS list: x...@oss.sgi.com -
JP see:

JP http://oss.sgi.com/mailman/listinfo/xfs

Thank you. I did. I just figured that if this is a problem specific to
the version of the xfs-utils that come with CentOS somebody here might
have encountered it too
___
CentOS mailing list
CentOS@centos.org
http://lists.centos.org/mailman/listinfo/centos


Re: [CentOS] XFS-filesystem corrupted by defragmentation Was: Performance problems with XFS on Centos 5.4

2010-04-13 Thread Bernhard Gschaider

Just to close this thread and remove any doubt that it might have
raised about XFS: the problem was a PEBCAB[1] 
It was pointed out to me on the XFS-list that the device I used for
xfs_db was inconsistent with the info from xfs_info (I was blindly
copying the device from the output of df)

Footnotes: 
[1]  PEBCAB: Problem exists between chair and keyboard


 On Tue, 13 Apr 2010 11:54:53 +0200
 BG == Bernhard Gschaider bgschaid_li...@ice-sf.at wrote:

BG Before I'd try to defragment my whole filesystem (see attached
BG mail for whole story) I figured Let's try it on some file.

BG So I did

 xfs_bmap /raid/Temp/someDiskimage.iso
BG [output shows 101 extents and 1 hole]

BG Then I defragmented the file
 xfs_fsr /raid/Temp/someDiskimage.iso
BG extents before:101 after:3 DONE

 xfs_bmap /raid/Temp/someDiskimage.iso
BG [output shows 3 extents and 1 hole]

BG and now comes the bummer: i wanted to check the fragmentation
BG of the whole filesystem (just for checking):

 xfs_db -r /dev/mapper/VolGroup00-LogVol04
BG xfs_db: unexpected XFS SB magic number 0x xfs_db: read
BG failed: Invalid argument xfs_db: data size check failed
BG cache_node_purge: refcount was 1, not zero (node=0x2a25c20)
BG xfs_db: cannot read root inode (22)

BG THAT output was definitly not there when I did this the last
BG time and therefor the new fragmentation does not make me happy
BG either

xfs_db frag
BG actual 0, ideal 0, fragmentation factor 0.00%

BG The file-system is still mounted and working and I don't dare
BG to do anything about it (am in a mild state of panic) because
BG I think it might not come back if I do.

BG Any suggestions most welcome (am googling myself before I do
BG anything about it).

BG I swear to god: I did not do anything else with the
BG xfs_*-commands than the stuff mentioned above

BG Bernhard

BG From: Bernhard Gschaider bgschaid_li...@ice-sf.at Subject:
BG Re: [CentOS] Performance problems with XFS on Centos 5.4 To:
BG CentOS mailing list centos@centos.org Date: Mon, 12 Apr 2010
BG 18:22:24 +0200 Organization: ICE Stroemungsforschung Reply-To:
BG CentOS mailing list centos@centos.org


 On Fri, 9 Apr 2010 10:59:02 -0400
 RW == Ross Walker rswwal...@gmail.com wrote:

RW On Apr 9, 2010, at 9:59 AM, Bernhard Gschaider
RW bgschaid_li...@ice-
sf.at wrote:

 Hi!
 
 During the last weeks I experienced some performance problems
 with a large file-system on XFS basis. Sometimes for instance
 ls is painfully. Immidiatly afterwards ls on the same
 directory is immidiate. I used strace on this ls and found
 that during the first ls the lstat-calls need approx 0.02s
 each while during the second ls the are two orders of
 magnitude faster.
 
 Googling around I stumbled upon some messages similar like
 this
 
 http://www.opensubscriber.com/message/linux-...@oss.sgi.com/1355060.html
 
 which have in common a) they're from around 2006 b) they
 suggest to increase a mount-option ihashsize. This mount
 option is listed as deprecated in the current kernel-doc
 
 So my question: does anyone have experience with that kind of
 performance problem? Do you think it is a XFS problem or are
 there some other tuning parameters in the kernel that could be
 modified for instance via /proc?
 
 The reason why I'm asking here is that it is a production
 file-system so I would be very unpopular if I experiment too
 much (a couple of reboots is OK ;) )
 
 Bernhard
 
 PS: the situation got worse during the last weeks when the
 file-system increased in size, so the option that some kind of
 buffer now is too small and I'm experiencing some kind of
 thrashing seems very likely to me

RW Are you defragging the file system regularly?

BG Uups. Never occured to me (Fragmentation is s Windoze)
BG Had a look:

xfs_db frag
BG actual 6349355, ideal 4865683, fragmentation factor 23.37%

BG This seems significant.

RW How much memory do you have in the system and how big is the
RW file system?

BG Memory on the system is 4Gig (2 DualCore Xenons). The
BG filesystem is 3.5 TB of which 740 Gig are used. Which is the
BG maximum amount used during the one year that the filesystem is
BG being used (that is why the high fragmentation amazes me)

RW What are the XFS parameters for the file system?

BG Is this sufficent?

BG % xfs_info /raid meta-data=/dev/VolGroup00/LogVol05 isize=256
BG agcount=32, agsize=29434880 blks = sectsz=512 attr=0 data =
BG bsize=4096 blocks=941916160, imaxpct=25 = sunit=0 swidth=0
BG blks, unwritten=1 naming =version 2 bsize=4096 log =internal
BG bsize=4096 blocks=32768, version=1 = sectsz=512 sunit=0 blks,
BG lazy-count=0 realtime =none extsz=4096 blocks=0

Re: [CentOS] Performance problems with XFS on Centos 5.4

2010-04-12 Thread Bernhard Gschaider

 On Fri, 9 Apr 2010 10:59:02 -0400
 RW == Ross Walker rswwal...@gmail.com wrote:

RW On Apr 9, 2010, at 9:59 AM, Bernhard Gschaider
RW bgschaid_li...@ice-
sf.at wrote:

 Hi!
 
 During the last weeks I experienced some performance problems
 with a large file-system on XFS basis. Sometimes for instance
 ls is painfully. Immidiatly afterwards ls on the same directory
 is immidiate. I used strace on this ls and found that during
 the first ls the lstat-calls need approx 0.02s each while
 during the second ls the are two orders of magnitude faster.
 
 Googling around I stumbled upon some messages similar like this
 
 http://www.opensubscriber.com/message/linux-...@oss.sgi.com/1355060.html
 
 which have in common a) they're from around 2006 b) they
 suggest to increase a mount-option ihashsize. This mount option
 is listed as deprecated in the current kernel-doc
 
 So my question: does anyone have experience with that kind of
 performance problem? Do you think it is a XFS problem or are
 there some other tuning parameters in the kernel that could be
 modified for instance via /proc?
 
 The reason why I'm asking here is that it is a production
 file-system so I would be very unpopular if I experiment too
 much (a couple of reboots is OK ;) )
 
 Bernhard
 
 PS: the situation got worse during the last weeks when the
 file-system increased in size, so the option that some kind of
 buffer now is too small and I'm experiencing some kind of
 thrashing seems very likely to me

RW Are you defragging the file system regularly?

Uups. Never occured to me (Fragmentation is s Windoze)
Had a look:

xfs_db frag
actual 6349355, ideal 4865683, fragmentation factor 23.37%

This seems significant.

RW How much memory do you have in the system and how big is the
RW file system?

Memory on the system is 4Gig (2 DualCore Xenons). The filesystem is
3.5 TB of which 740 Gig are used. Which is the maximum amount used
during the one year that the filesystem is being used (that is why the
high fragmentation amazes me)

RW What are the XFS parameters for the file system?

Is this sufficent?

% xfs_info  /raid
meta-data=/dev/VolGroup00/LogVol05 isize=256agcount=32, agsize=29434880 blks
 =   sectsz=512   attr=0
data =   bsize=4096   blocks=941916160, imaxpct=25
 =   sunit=0  swidth=0 blks, unwritten=1
naming   =version 2  bsize=4096  
log  =internal   bsize=4096   blocks=32768, version=1
 =   sectsz=512   sunit=0 blks, lazy-count=0
realtime =none   extsz=4096   blocks=0, rtextents=0


RW What is the storage setup?

The filesystem is on a LVM-Volume which sits on a RAID 5 (Hardware
RAID) drive

RW Need the info.

So the way to go forward would be using xfs_fsr on that drive. I read
some horror stories about lost files, are these to be taken seriously
(I mean they were in some Ubuntu forums ;) )

Any other thoughts on parameters?

Thanks for your time

Bernhard
___
CentOS mailing list
CentOS@centos.org
http://lists.centos.org/mailman/listinfo/centos


[CentOS] Performance problems with XFS on Centos 5.4

2010-04-09 Thread Bernhard Gschaider

Hi!

During the last weeks I experienced some performance problems with a
large file-system on XFS basis. Sometimes for instance ls is
painfully. Immidiatly afterwards ls on the same directory is
immidiate. I used strace on this ls and found that during the first ls
the lstat-calls need approx 0.02s each while during the second ls the
are two orders of magnitude faster.

Googling around I stumbled upon some messages similar like this

http://www.opensubscriber.com/message/linux-...@oss.sgi.com/1355060.html

which have in common
a) they're from around 2006
b) they suggest to increase a mount-option ihashsize. This mount
   option is listed as deprecated in the current kernel-doc

So my question: does anyone have experience with that kind of
performance problem? Do you think it is a XFS problem or are there
some other tuning parameters in the kernel that could be modified for
instance via /proc?

The reason why I'm asking here is that it is a production file-system
so I would be very unpopular if I experiment too much (a couple of
reboots is OK ;) )

Bernhard

PS: the situation got worse during the last weeks when the file-system
increased in size, so the option that some kind of buffer now is too
small and I'm experiencing some kind of thrashing seems very likely to
me
___
CentOS mailing list
CentOS@centos.org
http://lists.centos.org/mailman/listinfo/centos


[CentOS] Repository for clean and safe installation of python2.6 (and other python-software)

2009-11-30 Thread Bernhard Gschaider

Hi!

I want to test some software on my CentOS 5 machine that is not too
happy with the system python (2.4). So when I install a python to my
machines I want it to be 

a) clean: in other words it should be a RPM
b) safe: it should not remove the system-python (in order not to break
   the system-scripts in subtle ways) but be accessible for
   instance as python2.6 (while python would still be python2.4)

I've looked around (even other places than the usual suspects RPMforge
and EPEL) for repositories that could have such packages.

Before I start rolling my own RPMs: is anyone aware of a repository
that has such packages? (an added bonus would be if it had RPMs of
PyQt4, I didn't find those for Centos or RHEL, too)

Bernhard
___
CentOS mailing list
CentOS@centos.org
http://lists.centos.org/mailman/listinfo/centos


Re: [CentOS] Repository for clean and safe installation of python2.6 (and other python-software)

2009-11-30 Thread Bernhard Gschaider

 On Mon, 30 Nov 2009 11:24:24 -0500
 JP == Jim Perrin jper...@gmail.com wrote:

JP On Mon, Nov 30, 2009 at 10:10 AM, Bernhard Gschaider
JP bgschaid_li...@ice-sf.at wrote:

 a) clean: in other words it should be a RPM b) safe: it should
 not remove the system-python (in order not to break   the
 system-scripts in subtle ways) but be accessible for   instance
 as python2.6 (while python would still be python2.4)

 Before I start rolling my own RPMs: is anyone aware of a
 repository that has such packages? (an added bonus would be if
 it had RPMs of PyQt4, I didn't find those for Centos or RHEL,
 too)


JP http://iuscommunity.org/

Brilliant. Exactly the kind of information you don't get by googling
around. Thanks

JP I've heard good things about this repo, but as with everything
JP else, use at your own risk. If it eats your dog and runs off
JP with your girlfriend, I am not responsible.

I understand. But if I GET a dog I can hold you resonsible? ;)

Bernhard


pgpNbX5loXzXm.pgp
Description: PGP signature
___
CentOS mailing list
CentOS@centos.org
http://lists.centos.org/mailman/listinfo/centos


Re: [CentOS] Upgrade to 5.4 with an existing XFS-filesystem

2009-10-22 Thread Bernhard Gschaider

 On Tue, 20 Oct 2009 12:36:06 -0700
 AY == Akemi Yagi amy...@gmail.com wrote:

AY On Tue, Oct 20, 2009 at 11:51 AM, Bernhard Gschaider
AY bgschaid_li...@ice-sf.at wrote:

 I've got a fileserver currently running under 5.3 with the
 /home-partition being an XFS-filesystem. I use the kmod-xfs
 from extras. It works great ;)
 
 Now: as I understand it from the release-notes the 5.4 kernel
 has XFS already built-in. Right? Or is it just a kmod-package
 (technology preview)
 
 Now my question is: are there any recommendations for an
 upgrade-procedure? I mean, I can probably manage, but I'll want
 to minimize downtime

 BTW: when doing yum list updates I don't see any
 kernel*-packages in the list. Is this because the last kernel
 from the 5.3-updates has the same build-numer (164 I think)? 
 And is the 5.4-base-kernel the same as the latest
 5.3-updates-kernel?

AY The -164 kernel is indeed from 5.4 and has xfs as a built-in
AY kernel module.  If you are already running this kernel, that
AY indicates all is well and no further action is needed.

AY Could you show us the output returned by:

AY ls -l `find /lib/modules -name xfs.ko`

Thanks for the hint: I did it (I'll spare you the listing). The
-164-kernel ist the first one where according to rpm -qf path the
module is owned by the kernel package. All the other instances of
xfs.ko point to a module owned by kmod-xfs.

So obviously I'm not using the kmod-xfs anymore (I'm relieved that the
last kernel-update worked without a clash)

Thanks again for clearing that up

Bernhard

BTW: yes. It is a x86_64-machine


pgpiSRTUa8Qq1.pgp
Description: PGP signature
___
CentOS mailing list
CentOS@centos.org
http://lists.centos.org/mailman/listinfo/centos


[CentOS] Upgrade to 5.4 with an existing XFS-filesystem

2009-10-20 Thread Bernhard Gschaider

Hi!

I've got a fileserver currently running under 5.3 with the
/home-partition being an XFS-filesystem. I use the kmod-xfs from
extras. It works great ;)

Now: as I understand it from the release-notes the 5.4 kernel has XFS
already built-in. Right? Or is it just a kmod-package (technology
preview) 

Now my question is: are there any recommendations for an
upgrade-procedure? I mean, I can probably manage, but I'll want to
minimize downtime

Bernhard

BTW: when doing yum list updates I don't see any kernel*-packages
in the list. Is this because the last kernel from the 5.3-updates has
the same build-numer (164 I think)? And is the 5.4-base-kernel the
same as the latest 5.3-updates-kernel?


pgpTLeDsemdYR.pgp
Description: PGP signature
___
CentOS mailing list
CentOS@centos.org
http://lists.centos.org/mailman/listinfo/centos


[CentOS] Install CentOS on Partition(!)-images as a Xen-guest

2009-09-15 Thread Bernhard Gschaider

Hi!

Some weeks ago the Running Xen-book was recommended to me on this
list. A nice book, but it gave me crazy idea: One of the
recommendations (which sounds very reasonable to me) was to have
instead of a single file that serves as a disk-image to have to
files: one of them (which is mounted to hda1) serves as a partition
that holds the data, the other one (which mounts to hda2) is the swap
partition. The text in the books (admittedly it's a bit vague there)
lead me to the conclusion that to the Xen-machine they will look like
two partitions of a single drive hda. The advantage should be that it
is much easier to extend the data partition (Take the machine
offline. Extend the image with dd. Extend the filesystem on the image)

I created the two files as described in book. Formatted them with
mkswap and mkfs.ext3 (as described in the book) and added them to the
configuration file for the host. Now when I start the installation of
the host machine right in the beginning I get a message /dev/hda1 has
a loop partition layout. To use this disk for the installation of
CentOS it must be initialized. When I don't allow formatting I get
caught in a loop (Is a loop partition. Initialize?), when I allow
formatting I get a partition on each of the devices (hda11 and hda21)
and I would be allowed to go on with the installation, but IMHO this
would defeat the purpose of the exercise

So my question:
 - is there some error in thinking on my side?
 - is this a situation that the installer can't deal with? (in other
   words: would it work if I copied a complete installation into the root
   partition-image and then booted that?)
 - or is there a reason that using partition images is not very
   popular (Googling around did not reveal anything useful)
 
Bernhard


pgpzx4cF2QWDb.pgp
Description: PGP signature
___
CentOS mailing list
CentOS@centos.org
http://lists.centos.org/mailman/listinfo/centos


Re: [CentOS] virt-manager crashes Host during installation of guest

2009-08-24 Thread Bernhard Gschaider

 On Thu, 20 Aug 2009 01:47:07 +0200
 BG == Bernhard Gschaider bgschaid_li...@ice-sf.at wrote:

BG Thanks to everyone who took the time to answer in this thread.

BG I'm just writing this message to give this thread some closure
BG and am not expecting any answers

I found the cause of my problem. I'll answer to my own posting,
because none of the follow-ups lead to the solution, the reason being
that I withheld some vital information for the diagnosis:

The filesystem which the disk-image for the virtual machine resides on
is XFS!

Googling around I found indications that there are indications that
XEN and XFS are not the best of friends, but nobody hinted at
something as drastic as completely freezing the host.

Anyway. When I create the image on an ext3-partition and point the
configuration to it (no other changes) the installation works without
problems. 

Bernhard

PS: There were speculations in the replies, that this was my first
Xen-experience. I want to stress that I have already a running
Xen-machine, that's why I was surprised that it didn't work this time
(but I set that one up a year ago, so technically I am starting anew
with Xen and the speculations are right)

 On Wed, 19 Aug 2009 10:29:33 -0700 (PDT)
 IM == Ian Murray murra...@yahoo.co.uk wrote:

  This (and other replies) lead me to two possible culprits: 
 - either the graphical console over X11 is not a good idea
 (but I  can't imagine that, it shouldn't shoot the kernel) 
 - I always installed as a paravirtualized machine, Could it be
 that  the install-kernel on the 5.3-media is not aware of
 this and somehow  manages to shot the host (because I noticed
 that most recipies on  the  net, including
 http://wiki.centos.org/HowTos/Xen/  InstallingCentOSDomU 
 never talk about paravirtualized (so I assume they use a fully
  virtualized guest)
 
  I will try these later today (when people left the office
 and no one  will complain about server downtimes)
 
  Bernhard
 
  BTW: Just one fundamental question: as the upstream OS
 vendor is  switching his virtualization to KVM anyway, is it
 a good idea to  forget Xen and use KVM (in other words: is it
 stable enough for  production)?

BG I tried removing both suspects by

BG  - following the Wiki-Howto to the letter (especially using
BG the Xen-install-kernels) - instead of going over the network I
BG worked directly at the machine (although I totally agree that
BG a VNC-session shouldn't be ble to shoot the machine)

BG but the problem is still there. When I start the configured
BG machine that points to an install-kernel with

BG xm create newGuest -c

BG I see the kernel boot up until it comes to the message

BG Write protecting the kernel read-only data

BG where it hangs for some seconds, then the screen goes blank
BG and the machine reboots.

BG I'm starting to suspect that it is somehow hardware-related
BG (it is a Fujitsu-Siemens Synergy server with a
BG RAID-controller) and I will investigate in that direction


-- 
---
DI Bernhard F.W. Gschaider
---
EMail:  bernhard.gschai...@ice-sf.at
WWW  : www.ice-sf.at
Jabber : bgsch...@jabber.org
Tel:+43(3842)98282-42   Fax:+43(3842)98282-02
---


pgpsqDdZT4qZg.pgp
Description: PGP signature
___
CentOS mailing list
CentOS@centos.org
http://lists.centos.org/mailman/listinfo/centos


Re: [CentOS] virt-manager crashes Host during installation of guest

2009-08-24 Thread Bernhard Gschaider

 On Mon, 24 Aug 2009 20:43:57 +0300
 PK == Pasi Kärkkäinen pa...@iki.fi wrote:

PK On Mon, Aug 24, 2009 at 06:04:06PM +0200, Bernhard Gschaider
PK wrote:
  On Thu, 20 Aug 2009 01:47:07 +0200
  BG == Bernhard Gschaider bgschaid_li...@ice-sf.at wrote:
 
BG Thanks to everyone who took the time to answer in this thread.

BG I'm just writing this message to give this thread some closure
BG and am not expecting any answers
 I found the cause of my problem. I'll answer to my own posting,
 because none of the follow-ups lead to the solution, the reason
 being that I withheld some vital information for the diagnosis:
 
 The filesystem which the disk-image for the virtual machine
 resides on is XFS!
 
 Googling around I found indications that there are indications
 that XEN and XFS are not the best of friends, but nobody hinted
 at something as drastic as completely freezing the host.
 
 Anyway. When I create the image on an ext3-partition and point
 the configuration to it (no other changes) the installation
 works without problems.
 

PK Are you running 32bit or 64bit Xen host/dom0?

PK XFS is known to have problems on 32bit kernels, while it
PK should work on 64bit kernels. (this is even without Xen).

Everything is 64 bit. So that can't be the explanation.

PK -- Pasi

 Bernhard
 
 PS: There were speculations in the replies, that this was my
 first Xen-experience. I want to stress that I have already a
 running Xen-machine, that's why I was surprised that it didn't
 work this time (but I set that one up a year ago, so
 technically I am starting anew with Xen and the speculations
 are right)
 
  On Wed, 19 Aug 2009 10:29:33 -0700 (PDT)
  IM == Ian Murray murra...@yahoo.co.uk wrote:
 
   This (and other replies) lead me to two possible
 culprits:   - either the graphical console over X11 is not
 a good idea  (but I  can't imagine that, it shouldn't shoot
 the kernel)   - I always installed as a paravirtualized
 machine, Could it be  that  the install-kernel on the
 5.3-media is not aware of  this and somehow  manages to
 shot the host (because I noticed  that most recipies on 
 the  net, including  http://wiki.centos.org/HowTos/Xen/ 
 InstallingCentOSDomU   never talk about paravirtualized (so
 I assume they use a fully   virtualized guest)
  
   I will try these later today (when people left the office
  and no one  will complain about server downtimes)
  
   Bernhard
  
   BTW: Just one fundamental question: as the upstream OS
  vendor is  switching his virtualization to KVM anyway, is
 it  a good idea to  forget Xen and use KVM (in other words:
 is it  stable enough for  production)?
 
BG I tried removing both suspects by

BG - following the Wiki-Howto to the letter (especially using the
BG Xen-install-kernels) - instead of going over the network I
BG worked directly at the machine (although I totally agree that
BG a VNC-session shouldn't be ble to shoot the machine)

BG but the problem is still there. When I start the configured
BG machine that points to an install-kernel with

BG xm create newGuest -c

BG I see the kernel boot up until it comes to the message

BG Write protecting the kernel read-only data

BG where it hangs for some seconds, then the screen goes blank
BG and the machine reboots.

BG I'm starting to suspect that it is somehow hardware-related
BG (it is a Fujitsu-Siemens Synergy server with a
BG RAID-controller) and I will investigate in that direction
 
 
 --
 
---
 DI Bernhard F.W. Gschaider
 
---
 EMail: bernhard.gschai...@ice-sf.at WWW : www.ice-sf.at Jabber
 : bgsch...@jabber.org Tel: +43(3842)98282-42 Fax:
 +43(3842)98282-02
 
---



 ___ CentOS mailing
 list CentOS@centos.org
 http://lists.centos.org/mailman/listinfo/centos

PK ___ CentOS mailing
PK list CentOS@centos.org
PK http://lists.centos.org/mailman/listinfo/centos


-- 
---
DI Bernhard F.W. Gschaider
---
EMail:  bernhard.gschai...@ice-sf.at
WWW  : www.ice-sf.at
Jabber : bgsch...@jabber.org
Tel:+43(3842)98282-42   Fax:+43(3842)98282-02
---


pgpCildAALV4D.pgp
Description: PGP signature

Re: [CentOS] virt-manager crashes Host during installation of guest

2009-08-19 Thread Bernhard Gschaider

Thanks for the replies so far.

 On Wed, 19 Aug 2009 11:06:08 +0100
 MMG == Marcelo M Garcia marcelo.maia.gar...@googlemail.com wrote:

MMG Bernhard Gschaider wrote:
 Hi!
 
 I have the following problem: I have a server (CentOS 5.3
 x86_64) on which I want to install a virtual Xen-machine
 (CentOS 5.3 x86_64), I ssh from my workstation (Centos 5.3
 x86_64  do you see the pattern ;) ) to that server and
 start the virt-manager. I create a new Guest (Paravirtualiuzed)
 and point it to the server with the installation files (CentOS
 5.3, but I already said that). The manager creates the disk
 image an then opens the Graphical console for
 installation. Sometime around the point where the installation
 program wants me to select the keyboard the graphical console
 it freezes. The server is completely dead (no console, no disk
 activity, no ping, only a reset will repair it)
 
 My question: am I doing something stupid? But I figured
 They're all the same system, this must work
 
 I don't want to play around with it too much as the server is
 also our file-server and people start complaining.
 
 So any hint will be greatly appreciated (otherwise I'll have to
 setup another machine for the guests)
 

MMG I use the virt-manager, but I always use a kickstart to do
MMG the installation and I never had problems.

This (and other replies) lead me to two possible culprits:
 - either the graphical console over X11 is not a good idea (but I
   can't imagine that, it shouldn't shoot the kernel)
 - I always installed as a paravirtualized machine, Could it be that
   the install-kernel on the 5.3-media is not aware of this and somehow
   manages to shot the host (because I noticed that most recipies on the
   net, including http://wiki.centos.org/HowTos/Xen/InstallingCentOSDomU
   never talk about paravirtualized (so I assume they use a fully
   virtualized guest)

I will try these later today (when people left the office and no one
will complain about server downtimes)

Bernhard

BTW: Just one fundamental question: as the upstream OS vendor is
switching his virtualization to KVM anyway, is it a good idea to
forget Xen and use KVM (in other words: is it stable enough for
production)?


pgpcYIOMGWB0c.pgp
Description: PGP signature
___
CentOS mailing list
CentOS@centos.org
http://lists.centos.org/mailman/listinfo/centos


Re: [CentOS] virt-manager crashes Host during installation of guest

2009-08-19 Thread Bernhard Gschaider

Thanks to everyone who took the time to answer in this thread.

I'm just writing this message to give this thread some closure and am
not expecting any answers 

 On Wed, 19 Aug 2009 10:29:33 -0700 (PDT)
 IM == Ian Murray murra...@yahoo.co.uk wrote:

  This (and other replies) lead me to two possible culprits: 
 - either the graphical console over X11 is not a good idea (but
 I  can't imagine that, it shouldn't shoot the kernel)  - I
 always installed as a paravirtualized machine, Could it be that
  the install-kernel on the 5.3-media is not aware of this and
 somehow  manages to shot the host (because I noticed that most
 recipies on  the  net, including
 http://wiki.centos.org/HowTos/Xen/  InstallingCentOSDomU 
 never talk about paravirtualized (so I assume they use a fully
  virtualized guest)
 
  I will try these later today (when people left the office and
 no one  will complain about server downtimes)
 
  Bernhard
 
  BTW: Just one fundamental question: as the upstream OS vendor
 is  switching his virtualization to KVM anyway, is it a good
 idea to  forget Xen and use KVM (in other words: is it stable
 enough for  production)?

IM Sorry for thread mucking. I did not receive this email, but
IM took it from a response.

IM The Xen wiki describes a paravirtual install. The config file
IM would have a line like builder=hvm if it was fully
IM virtualised guest. AFAIK the graphical view is just a VNC
IM session, so I would be surprised if that managed to trash your
IM kernel. More likely it's something that the guest is doing
IM that is causing the issue. You could always prepare your
IM guests on a different machine and transfer them
IM later. Ofcourse, if you had a command of xm, that is.

I tried removing both suspects by

 - following the Wiki-Howto to the letter (especially using the
   Xen-install-kernels)
 - instead of going over the network I worked directly at the machine
   (although I totally agree that a VNC-session shouldn't be ble to shoot
   the machine)

but the problem is still there. When I start the configured machine
that points to an install-kernel with

xm create newGuest -c

I see the kernel boot up until it comes to the message

Write protecting the kernel read-only data

where it hangs for some seconds, then the screen goes blank and the
machine reboots.

I'm starting to suspect that it is somehow hardware-related (it is a
Fujitsu-Siemens Synergy server with a RAID-controller) and I will
investigate in that direction

IM As I said before, I would recommend the xen list for this
IM specific issue.

Will look there to, thanks

IM As for the Xen vs whatever issue, I was disappointed when it
IM became clear that Upstream was going to push another
IM technology, having spent last year or two trying to learn Xen
IM (and I am no expert, at all). Having said that, I've heard of
IM issues with speed with KVM and I haven't had any such issues
IM with Xen. My only issue with Xen is that the official releases
IM are based on quite an old kernel, which is fine for CentOS, bc
IM it is the same as the vanilla kernel. Anecdotally, a lot of
IM issues on the xen list IMHO seem to arise from ppl using later
IM patched kernels, which perhaps isn't the best route for
IM stability.

As I'm using the latest kernel that comes with the 5.3-updates and the
machine has nothing but the standard-5.3 stuff on it, I don't think
this is the case

Bernhard


pgpsAMvZLWlJK.pgp
Description: PGP signature
___
CentOS mailing list
CentOS@centos.org
http://lists.centos.org/mailman/listinfo/centos


[CentOS] virt-manager crashes Host during installation of guest

2009-08-18 Thread Bernhard Gschaider

Hi!

I have the following problem: I have a server (CentOS 5.3 x86_64) on
which I want to install a virtual Xen-machine (CentOS 5.3 x86_64), I
ssh from my workstation (Centos 5.3 x86_64  do you see the pattern ;) )
to that server and start the virt-manager. I create a new Guest
(Paravirtualiuzed) and point it to the server with the installation
files (CentOS 5.3, but I already said that). The manager creates the
disk image an then opens the Graphical console for
installation. Sometime around the point where the installation program
wants me to select the keyboard the graphical console it freezes. The
server is completely dead (no console, no disk activity, no ping, only
a reset will repair it)

My question: am I doing something stupid? But I figured They're all
the same system, this must work

I don't want to play around with it too much as the server is also our
file-server and people start complaining.

So any hint will be greatly appreciated (otherwise I'll have to setup
another machine for the guests)

Bernhard

-- 
---
DI Bernhard F.W. Gschaider
---
EMail:  bernhard.gschai...@ice-sf.at
WWW  : www.ice-sf.at
Jabber : bgsch...@jabber.org
Tel:+43(3842)98282-42   Fax:+43(3842)98282-02
---


pgpc5U4uCtqrb.pgp
Description: PGP signature


pgp1jR1ZNiBts.pgp
Description: PGP signature
___
CentOS mailing list
CentOS@centos.org
http://lists.centos.org/mailman/listinfo/centos


[CentOS] Preventing hour-long fsck on ext3-filesystem

2009-05-14 Thread Bernhard Gschaider

Hi!

I'm justing in the process of setting up a new fileserver for our
company. I'm installing CentOS 5.3 (64 bit) on it.

One of the problems with it is that it has a 3.5TB filesystem for
the user data which I formatted during setup as an ext3. Now my
experience with our current fileserver is that a 0.5TB ext3 filesystem
needs approx half an hour to complete (and kicks in every so and so
reboots or every 180days). My estimate is that for the larger
filesystem (and the faster machine) the fsck would need well over an
hour (being optimistic). I dread the day when I have to reboot the
server and wait for 2hours or more just because the system thought it
would be a prudent thing to check the filesystem.

My question:

 - is there another stable filesystem (XFS, ReiserFS ...) in the
   centosplus-kernel where this could be avoided (fsck is faster) and
   that is as safe as ext3
 - Or would it be better to switch off automatic checking with tune2fs

Any opinion/experience welcome. I looked around a bit but couldn't
find a good answer

Bernhard

PS: Sorry for the stupid question, but I'm only part-time admin and
testing this myself would take weeks, I guess

-- 
---
DI Bernhard F.W. Gschaider
---
EMail:  bernhard.gschai...@ice-sf.at
WWW  : www.ice-sf.at
Jabber : bgsch...@jabber.org
Tel:+43(3842)98282-42   Fax:+43(3842)98282-02
---
___
CentOS mailing list
CentOS@centos.org
http://lists.centos.org/mailman/listinfo/centos


Re: [CentOS] Preventing hour-long fsck on ext3-filesystem

2009-05-14 Thread Bernhard Gschaider

Thank you all for your quick answers (you guys must have started
typing BEFORE I hit the Send-button).

The general consensus seems to be If you can start anew: use
XFS. This leaves one question: as the XFS is not included in the
standard-kernel which option offers the smoothest sailing
(especially during kernel-updates):

 - kernel from centosplus
 - kmod-xfs from centosplus
 - kmod-xfs from extras 

Bernhard

 On Thu, 14 May 2009 11:57:49 -0400
 BLB == Brent L Bates blba...@vigyan.com wrote:

BLB  I strongly recommend XFS over ext[23] ANY day.  XFS is
BLB faster, more robust, and more dependable than ext.  I've used
BLB it for years and it is rock solid.  I've had it work through
BLB failing disk drives and number system crashes (caused by
BLB faulty memory).  It takes a licking and keeps on ticking.
BLB :-) No need to `fsck' the drive.  If there are any file
BLB system problems, one can run xfs_check with a live system.
BLB It isn't recommended as it can give false positives for a
BLB live running file system, but it can help if needed.
BLB xfs_repair has to be run on an unmounted file system,
BLB however, I've almost never needed to use xfs_check or
BLB xfs_repair.  XFS has over a decade and pentabytes of use
BLB behind it.  I wouldn't use any other file system.

BLB --

BLB   Brent L. Bates (UNIX Sys. Admin.)  M.S. 912 Phone:(757)
BLB 865-1400, x204 NASA Langley Research Center FAX:(757)
BLB 865-8177 Hampton, Virginia 23681-0001 Email:
BLB b.l.ba...@larc.nasa.gov http://www.vigyan.com/~blbates/



-- 
---
DI Bernhard F.W. Gschaider
---
EMail:  bernhard.gschai...@ice-sf.at
WWW  : www.ice-sf.at
Jabber : bgsch...@jabber.org
Tel:+43(3842)98282-42   Fax:+43(3842)98282-02
---
___
CentOS mailing list
CentOS@centos.org
http://lists.centos.org/mailman/listinfo/centos


[CentOS] Services take a long time during initialzation (LDAP-authentication)

2008-10-02 Thread Bernhard Gschaider

Hi!

I've got a CentOS4.7-machine as a fileserver. The machine also serves
as the LDAP-server against which all regular users are
authenticated. The problem I am having is that the machine takes a
very long time to boot. Most of the services start up rather quick,
but three services take quite a long time (extending the boot time to
over half an hour, which is  long)

The three services are (in that order):

 - NFS statd
 - LDAP
 - NFS quota

As soon as the LDAP-Service (and sshd) has started I can remotly log
into the machine: the rpc.quotad has started but takes no CPU-time (so
I guess he must be waiting for something)

Some other observations I had:
 - a restart of LDAP takes several minutes
 - if the LDAP-service is down a su ldap (ldap IS in /etc/passwd)
   seems to hang. The ldap-service-script does such a su to check the
   configuration so I guess that is what takes the service so long)

Could anyone give me a hint what the problem could be and how to fix
it? Is any more information required (don't want to spam you with the
nsswitch.conf and the whole pam.d-stuff)?

Bernhard
___
CentOS mailing list
CentOS@centos.org
http://lists.centos.org/mailman/listinfo/centos


Re: [CentOS] Job Scheduling suggestions

2008-09-11 Thread Bernhard Gschaider

 On Wed, 10 Sep 2008 19:54:44 -0700
 JRP == John R Pierce John wrote:

JRP Mag Gam wrote:
 At my university we have 50 computers in the lab. We would like
 to use a scheduler to schedule our fluid models, and I was
 wondering what is a good suggestion?
 

JRP you might look at using one of the scientific clustering
JRP packages, like Oscar, which implements and manages an MPI
JRP cluster.  this of course assumes your fluid model software is
JRP written to use MPI

If you're going for cluster software you might consider

http://www.rocksclusters.org/

It's even based on CentOS.

Here at our place we have a dedicated cluster with Rocks. In addition
I took the SGE-rpm from the distribution and installed it on our
regular (CentOS) workstations so that jobs can be scheduled on these
machines too
___
CentOS mailing list
CentOS@centos.org
http://lists.centos.org/mailman/listinfo/centos


Re: [CentOS] Problem with nvidia-drv-x11 when upgrading to CentOS 5.2

2008-07-01 Thread Bernhard Gschaider

 On Tue, 1 Jul 2008 10:40:38 -0700
 M == MHR  [EMAIL PROTECTED] wrote:

M On Wed, Jun 25, 2008 at 1:55 AM, Bernhard Gschaider
M [EMAIL PROTECTED] wrote:
 Hi John!
 
 As I understand it the nvidia-x11-drv IS the nvidia.com-driver
 just repackaged in such a way that it automatically recompiles
 itself if a new kernel is installed. For the
 vanilla-nvidia-drivers I had to write a script for that and the
 number of workstation here is not big enough that it justifies
 that (I did that, but it never worked 100% and testing a boot
 time script is a pain in the a##) Obviously the script in the
 RPM-package doesn't know how to behave during an upgrade (let's
 see how it fares when the first 5.2-kernel-update comes along)
 

M I may be mistaken on this, but IIRC, the driver does not
M recompile itself automatically at all.  I used dkms once a
M great many kernels ago, and the driver has been recompiled for
M me for every single update since then (4.4, I think), including
M my update to 5.2.  I didn't see any reference to this in this
M thread at all, so I'm assuming that you are not using dkms.

I didn't mention it, but the package recompiles the driver using DKMS.

I'm fully aware, that binary-only-packages won't work

M For the person who uninstalled, updated and reinstalled the
M driver, I didn't see any mention of a recompile - did you do
M that?

As said above: DKMS should take care of that - and it always did

M Now, if your driver doesn't work after an explicit recompile,
M that's a different problem.

___
CentOS mailing list
CentOS@centos.org
http://lists.centos.org/mailman/listinfo/centos


Force reinstallation of packages Was: [CentOS] Root-filesystem remounts as read-only during 5.2 upgrade (system completely shoot)

2008-06-26 Thread Bernhard Gschaider

 On Wed, 25 Jun 2008 23:53:16 +0200
 BG == Bernhard Gschaider [EMAIL PROTECTED] wrote:

 On Wed, 25 Jun 2008 14:39:22 -0400
 WLM == William L Maltby [EMAIL PROTECTED] wrote:

WLM On Wed, 2008-06-25 at 20:27 +0200, Bernhard Gschaider wrote:
 snip

 Is there a way to say: Hey RPM, have a look whether really
 the files in your database are on the disk) ?

WLM Use rpm's verify option. I forget the exact syntax: I'm sorry
WLM to have to sentence you to the rpm manpage dungeon. :-(

Sorry. Stupid question again: and if I find inconsistencies, then the
only way to force rpm to correct them yould be something like

yum remove offendingPackage
yum install offendingPackage

or the equivalent rpm-commands?

Currently the machine behaves quite strange:
 - Boots OK
 - Lets users log in and most applications work
 - Firefox works only for root
 - yumex hangs at starting
 - man rpm says XXX WARNING: old character encoding and/or character set

All this leads me to the conclusion that there are only some selected
packages corrupt (and I don't want to reinstall the machine). Would
Installing/Repairing from DVD help?

Bernhard

BG Thanks. I was looking for the keyword check in man-pages
BG (shows you that half the art in searching is knowing for
BG what)

___
CentOS mailing list
CentOS@centos.org
http://lists.centos.org/mailman/listinfo/centos


Re: Force reinstallation of packages Was: [CentOS] Root-filesystem remounts as read-only during 5.2 upgrade (system completely shoot)

2008-06-26 Thread Bernhard Gschaider

 On Thu, 26 Jun 2008 06:51:35 -0400
 WLM == William L Maltby [EMAIL PROTECTED] wrote:

WLM WARNING! Due to my background, I don't often read man pages
WLM like I used to. So there may be some inaccuracies or
WLM ambiguities below.

WLM On Thu, 2008-06-26 at 11:58 +0200, Bernhard Gschaider wrote:
 snip

 Sorry. Stupid question again: and if I find inconsistencies,
 then the

WLM Keep in mind that *some* inconsistencies are expected. Local
WLM config files being one good example. You must look at the
WLM codes displayed in the output, and possibly the files, to be
WLM sure it is really a discrepancy.

I know. I compared with the verify-output from a working machine. For
my theory (there are different rpm-packages on the disk than in the
rpm-database) to be right there should be a large amounts of files
with wrong MD5-sums. And there is only a handful for which this is the
case (and they seem mostly harmless)
 
 only way to force rpm to correct them yould be something like
 
 yum remove offendingPackage yum install offendingPackage
 
 or the equivalent rpm-commands?

WLM Not the only way, but probably the safest. However, that may
WLM try to also remove some dependencies, depending on the
WLM package you're trying to remove.

Yep. That's what I was afraid of

WLM I seem to recall a force parameter that is available for
WLM rpm and yum.  Although normally disparaged, this is a perfect
WLM situation for its use.

It exists in RPM, but in yum it is notoriously absent

 Currently the machine behaves quite strange: - Boots OK - Lets
 users log in and most applications work - Firefox works only
 for root - yumex hangs at starting

WLM Depending on your time-frame, this may be a symptom of the
WLM load on the servers you access. Yesterday A.M. I saw *BIG*
WLM delays downloading the xml(?) files. But I use yum CLI, so I
WLM see the blood-n-guts on the screen. BIAS GUIs suck... in
WLM general/BIAS

yum works. The problem according to an strace yum seems to be that
it is poll-ing on something, but I don't know on what, because I don't
get the arguments to that call, because it never finishes (last line
just says poll(

 - man rpm says XXX WARNING: old character encoding and/or
 character set
 
 All this leads me to the conclusion that there are only some
 selected packages corrupt (and I don't want to reinstall the
 machine). Would Installing/Repairing from DVD help?

WLM Maybe. But some of the rpms might be on your system from the
WLM update activities. Do and updatedb and then a locate
WLM .rpm. You may see some in /var/cache/yum. Subdirs under it
WLM might have what you need.

I'll try that. If it doesn't help I'll have to scratch the machine and
install anew.

Thanks
Bernhard
___
CentOS mailing list
CentOS@centos.org
http://lists.centos.org/mailman/listinfo/centos


Re: [CentOS] Problem with nvidia-drv-x11 when upgrading to CentOS 5.2

2008-06-26 Thread Bernhard Gschaider

 On Thu, 26 Jun 2008 11:25:42 -0400
 J == John  [EMAIL PROTECTED] wrote:

J Keep me Posted on what happens please. John

You mean whether the nvidia-x11-drv package works inter-5.2 during
kernel-updates? Today a new kernel-update arrived and two of my
machines recompiled the driver without problem (so it seems to be a
5.1-5.2 problem)

Bernhard

J -Original Message- From: [EMAIL PROTECTED]
J [mailto:[EMAIL PROTECTED] On Behalf Of Bernhard
J Gschaider Sent: Wednesday, June 25, 2008 4:55 AM To: CentOS
J mailing list Subject: Re: [CentOS] Problem with nvidia-drv-x11
J when upgrading to CentOS 5.2


 On Wed, 25 Jun 2008 01:13:11 -0400 J == John  [EMAIL PROTECTED] 
 wrote:

J -Original Message- From: [EMAIL PROTECTED]
J [mailto:[EMAIL PROTECTED] On Behalf Of Marko
J A. Jennings Sent: Tuesday, June 24, 2008 3:18 PM To:
J centos@centos.org Subject: Re: [CentOS] Problem with
J nvidia-drv-x11 when upgrading to CentOS 5.2

J On Tue, June 24, 2008 3:08 pm, Bernhard Gschaider wrote:
 Hi!
 
 I know that the nvidia-drv-x11-package is from RPMForge, but I
 thought I might mention it here.
 

J problem snipped

J --- Have you tried
J an older version of the driver from rpmforge? Do that First and
J see how that turns out. I myself am a fan of the Nvidia Drivers
J from nvidia.com. Meantime CentOS 5.2 is Very New and they may
J very well be issues with driver Stability yet to be.

J http://wiki.centos.org/HardwareList/Nvidia_Graphics This is a
J link of how to install the nvidia driver from
J nvidia.com. **WARNING** It is not finished but there should be
J enough info there to get to rolling in the right direction. You
J need to be aware it is a draft only. Cent OS 5.2 will also be
J included in it with in the next 30 days as time alows me.

J Alternativly after you get one Nvidia.com driver going on one
J machine, you could copy the built kernel module over to other
J machines providing they have the same card. Warning, I do not
J condone such as that. Good Luck...

J Hi John!

J As I understand it the nvidia-x11-drv IS the nvidia.com-driver
J just repackaged in such a way that it automatically recompiles
J itself if a new kernel is installed. For the
J vanilla-nvidia-drivers I had to write a script for that and the
J number of workstation here is not big enough that it justifies
J that (I did that, but it never worked 100% and testing a boot
J time script is a pain in the a##) Obviously the script in the
J RPM-package doesn't know how to behave during an upgrade (let's
J see how it fares when the first 5.2-kernel-update comes along)

___
CentOS mailing list
CentOS@centos.org
http://lists.centos.org/mailman/listinfo/centos


Re: [CentOS] Problem with nvidia-drv-x11 when upgrading to CentOS 5.2

2008-06-25 Thread Bernhard Gschaider

 On Wed, 25 Jun 2008 01:13:11 -0400
 J == John  [EMAIL PROTECTED] wrote:

J -Original Message- From: [EMAIL PROTECTED]
J [mailto:[EMAIL PROTECTED] On Behalf Of Marko
J A. Jennings Sent: Tuesday, June 24, 2008 3:18 PM To:
J centos@centos.org Subject: Re: [CentOS] Problem with
J nvidia-drv-x11 when upgrading to CentOS 5.2

J On Tue, June 24, 2008 3:08 pm, Bernhard Gschaider wrote:
 Hi!
 
 I know that the nvidia-drv-x11-package is from RPMForge, but I
 thought I might mention it here.
 

problem snipped

J --- Have you tried
J an older version of the driver from rpmforge? Do that First and
J see how that turns out. I myself am a fan of the Nvidia Drivers
J from nvidia.com. Meantime CentOS 5.2 is Very New and they may
J very well be issues with driver Stability yet to be.

J http://wiki.centos.org/HardwareList/Nvidia_Graphics This is a
J link of how to install the nvidia driver from
J nvidia.com. **WARNING** It is not finished but there should be
J enough info there to get to rolling in the right direction. You
J need to be aware it is a draft only. Cent OS 5.2 will also be
J included in it with in the next 30 days as time alows me.

J Alternativly after you get one Nvidia.com driver going on one
J machine, you could copy the built kernel module over to other
J machines providing they have the same card. Warning, I do not
J condone such as that. Good Luck...

Hi John!

As I understand it the nvidia-x11-drv IS the nvidia.com-driver just
repackaged in such a way that it automatically recompiles itself if a
new kernel is installed. For the vanilla-nvidia-drivers I had to write
a script for that and the number of workstation here is not big enough
that it justifies that (I did that, but it never worked 100% and
testing a boot time script is a pain in the a##)
Obviously the script in the RPM-package doesn't know how to behave
during an upgrade (let's see how it fares when the first
5.2-kernel-update comes along)

Bernhard
___
CentOS mailing list
CentOS@centos.org
http://lists.centos.org/mailman/listinfo/centos


Re: [CentOS] udevd can't reach LDAP-server during boot

2008-06-25 Thread Bernhard Gschaider

Thanks for the answers

 On Tue, 24 Jun 2008 15:51:37 -0400
 MS == Meenoo Shivdasani [EMAIL PROTECTED] wrote:

MS On Tue, Jun 24, 2008 at 3:44 PM, Johnny Hughes
MS [EMAIL PROTECTED] wrote:
 There is a BUG with nss_ldap:
 
 https://bugzilla.redhat.com/show_bug.cgi?id=448014
 
 We have this bug listed in our release notes:
 
 
http://wiki.centos.org/Manuals/ReleaseNotes/CentOS5.2#head-447967c60eb305ef2c5dbbc3f4e8b3c4c5170632
 
I read that, but I didn't connect it to my bug, because I didn't think
that anything to do with bash is relevant at that time during the boot
process. But obviously I was wrong (and by the way: that problem also
occurs with 5.1, so I don't think that it is a bash 3.2 problem)

 You can try the nss_ldap from our testing repo for this bug:
 
 http://dev.centos.org/centos/5/
 
 This may help with the problem.

MS FWIW, my experience with a variant of this problem and
MS nss_ldap from the testing repo was that boot would hang
MS indefinitely at udevd unless I limited the nss_reconnect_*
MS values.  I ended up changing to bind_policy soft as a
MS workaround.

So the bottom line is 
 - it is possible to shave 2 min off the boot process
 - a good chance to make the machine unbootable (and then play around
   until it works)
Well, I'm a lazy coward ;) but thanks

Bernhard
___
CentOS mailing list
CentOS@centos.org
http://lists.centos.org/mailman/listinfo/centos


[CentOS] No plugin-directory for firefox in CentOS 5.2

2008-06-25 Thread Bernhard Gschaider

I noticed, that the directory
/usr/lib/firefox-3.0b5/plugins/
is NOT created on CentOS 5.2. Therefor for instance the flash-plugin
is not installed for the new firefox
(After manually creating the directory and setting the right symlink
it works all right)
___
CentOS mailing list
CentOS@centos.org
http://lists.centos.org/mailman/listinfo/centos


Re: [CentOS] No plugin-directory for firefox in CentOS 5.2

2008-06-25 Thread Bernhard Gschaider

 On Wed, 25 Jun 2008 15:03:26 +0100
 KS == Karanbir Singh [EMAIL PROTECTED] wrote:

KS Bernhard Gschaider wrote:
 I noticed, that the directory /usr/lib/firefox-3.0b5/plugins/
 is NOT created on CentOS 5.2. Therefor for instance the
 flash-plugin is not installed for the new firefox (After
 manually creating the directory and setting the right symlink
 it works all right)

KS installing as a user, should the flash-plugin not go into your
KS ~/.mozilla dir ?

I was going for a system-wide install

KS Also, sitewide plugins are better in /usr/lib/mozilla/plugins/
KS which comes from the xulrunner rpm.

You're right. I wasn't aware of this directory. The funny thing
is: the flash-plugin did a symbolic link to the .so there, but firefox
doesn't recognize it (It also doesn't find the java-plugin that is
installed there)

KS Atleast, thats the way I look at things, what am I missing
KS here ?

In the .mozilla/plugin it works, but that'S not what I want.

For the time being I will put a symlink to the
/usr/lib/mozilla-direcotry into the firefox-3-directory (now all ther
plugins are recognized)

Bernhard
___
CentOS mailing list
CentOS@centos.org
http://lists.centos.org/mailman/listinfo/centos


[CentOS] Root-filesystem remounts as read-only during 5.2 upgrade (system completely shoot)

2008-06-25 Thread Bernhard Gschaider

Judging from the frequency of my messages here one could think that
I'm too stupid to upgrade a workstation to 5.2 (but the servers I've
tried work without problem)

OK. The problem: I've tried to upgrade a 5.1-x86_64-workstation to 5.2.
During the upgrade immidiatly after (according to the
/var/log/messages) upgrading the two (64  32-bit) libgcc-packages an
EXT3-error occurs and the root-partition gets remounted in
read-only-mode. Consequently ALL following package upgrades throw
errors saying this or that can't be done because he can't write to
/etc, /usr but in the end yum says everything is OK.

(Side info: /var is on a different partition, so it is still
writeable)

At reboot the machine wants a manual fsck which throws a lot of
errors. 
Then the machine reboots with with a lot of error messages (basically
because it can't find /lib64/libgcc_s.so.1 (nothing a symlink can't
fix). After that I try to fix things by manually upgrading the
libgcc-packages (otherwise yum won'T run). rpm -q for selected
packages shows that the majority of the packages is still in 5.1-state.
Trying to yum upgrade again fail because the machine can't find any
servers (but network functionality seems OK)

So I try to reboot. And now the really strange thing happens: A simple
rpm -q rpm says that no rpm is installed. Rebuilding the
rpm-database doesn't help.

So I'm a bit stuck here with a machine that is in limbo.

My questions: what could have been the cause for this (the machine was
working OK before that). The only thing I feel a little guilty about
was deinstalling the nvidia-kernel-driver and not reboot, but this
can't f### up the file-system, can it?

Any suggestions what I can do to get the machine into a normal state
again (apart from reinstall from scratch)
___
CentOS mailing list
CentOS@centos.org
http://lists.centos.org/mailman/listinfo/centos


Re: [CentOS] No plugin-directory for firefox in CentOS 5.2

2008-06-25 Thread Bernhard Gschaider

 On Wed, 25 Jun 2008 17:49:22 +0100
 KS == Karanbir Singh [EMAIL PROTECTED] wrote:

KS Bernhard Gschaider wrote: Also, sitewide plugins are better in
KS /usr/lib/mozilla/plugins/ which comes from the xulrunner rpm.
 You're right. I wasn't aware of this directory. The funny thing
 is: the flash-plugin did a symbolic link to the .so there, but
 firefox doesn't recognize it (It also doesn't find the
 java-plugin that is installed there)

KS That might be a real bug, I wonder if you can try this on a
KS clean VM install or a real machine install and make sure that
KS the plugins from /usr/lib/mozilla/plugins are really not seen
KS by a fresh firefox-3 install ( I'd guess doing a useradd or a
KS user with no ~/.mozilla would also be suiteable to test this )

OK. Sorry to have bothered you. It seems to be a problem of that
specific user (me). The others on the same machines see the
/usr/lib/mozilla-plugins without a problem.

I was using a tar-installed to /opt Firefox3 before. I guess it must
have fiddled around with some settings in ~/.mozilla (but obviously
not the fault of the RPMs)

Bernhard

___
CentOS mailing list
CentOS@centos.org
http://lists.centos.org/mailman/listinfo/centos


Re: [CentOS] Root-filesystem remounts as read-only during 5.2 upgrade (system completely shoot)

2008-06-25 Thread Bernhard Gschaider

OK. I managed to beat the machine into submission. But a slight
incertainty remains.

 On Wed, 25 Jun 2008 18:34:26 +0200
 BG == Bernhard Gschaider [EMAIL PROTECTED] wrote:

BG Judging from the frequency of my messages here one could think
BG that I'm too stupid to upgrade a workstation to 5.2 (but the
BG servers I've tried work without problem)

BG OK. The problem: I've tried to upgrade a
BG 5.1-x86_64-workstation to 5.2.  During the upgrade immidiatly
BG after (according to the /var/log/messages) upgrading the two
BG (64  32-bit) libgcc-packages an EXT3-error occurs and the
BG root-partition gets remounted in read-only-mode. Consequently
BG ALL following package upgrades throw errors saying this or
BG that can't be done because he can't write to /etc, /usr but in
BG the end yum says everything is OK.

BG (Side info: /var is on a different partition, so it is still
BG writeable)

That is my problem: Did that step of the upgrade leave the
rpm-database in a state that is not in tune with what is actually on
the disk

BG At reboot the machine wants a manual fsck which throws a lot
BG of errors. Then the machine reboots with with a lot of error
BG messages (basically because it can't find /lib64/libgcc_s.so.1
BG (nothing a symlink can't fix). After that I try to fix things
BG by manually upgrading the libgcc-packages (otherwise yum won'T
BG run). rpm -q for selected packages shows that the majority
BG of the packages is still in 5.1-state.  Trying to yum
BG upgrade again fail because the machine can't find any servers
BG (but network functionality seems OK)

The problem was that the centos-release was removed at the start of
the upgrade (and that is needed by yum to determine which $releasever
to use)

BG So I try to reboot. And now the really strange thing happens:
BG A simple rpm -q rpm says that no rpm is
BG installed. Rebuilding the rpm-database doesn't help.

BG So I'm a bit stuck here with a machine that is in limbo.

By manually reinstalling the rpm.rpm and some other packages I managed
to kickstart the upgrade again (and it seems to have succeeded)

Only problem: in the second upgrade run less packages were listed as
due to be updated (roughly a half). So I'm not sure: Are they marked
as upgraded in the rpm-database but in reality there are the old
versions on the disk. 

Is there a way to say: Hey RPM, have a look whether really the files
in your database are on the disk) ? 

BG My questions: what could have been the cause for this (the
BG machine was working OK before that). The only thing I feel a
BG little guilty about was deinstalling the nvidia-kernel-driver
BG and not reboot, but this can't f### up the file-system, can
BG it?

That question remains. But it is academic

BG Any suggestions what I can do to get the machine into a normal
BG state again (apart from reinstall from scratch)

As I said: solved

Thanks for listening
___
CentOS mailing list
CentOS@centos.org
http://lists.centos.org/mailman/listinfo/centos


[CentOS] Problem with evolution while upgrading to CentOS 5.2

2008-06-24 Thread Bernhard Gschaider

Hi!

When trying to prepare a workstation with (so that the real yum
upgrade doesn't have to download the packages)

yum upgrade --upgradeonly

it fails with these messages:


Error: Missing Dependency: libegroupwise-1.2.so.12 is needed by package 
evolution
Error: Missing Dependency: libgtkhtml-3.8.so.15 is needed by package evolution
Error: Missing Dependency: libexchange-storage-1.2.so.2 is needed by package 
evolution
Error: Missing Dependency: libexchange-storage-1.2.so.2()(64bit) is needed by 
package evolution
Error: Missing Dependency: libegroupwise-1.2.so.12()(64bit) is needed by 
package evolution


On the test-machine I solved this by deinstalling evolution (I don't
use it anyway), but this is not an option for the other
machines

Could anyone give me a hint what is wrong?

Bernhard
___
CentOS mailing list
CentOS@centos.org
http://lists.centos.org/mailman/listinfo/centos


[CentOS] Problem with nvidia-drv-x11 when upgrading to CentOS 5.2

2008-06-24 Thread Bernhard Gschaider

Hi!

I know that the nvidia-drv-x11-package is from RPMForge, but I thought
I might mention it here.

I have a machine with a NVidia-card and because of the ease of use I
use the rpmforge-package for that.

After upgrading the problem was that during rebooting as soon as the
machine hit the RHGB (which implicitly uses that driver) the machine
froze. I booted into the old kernel removed rhgb, rebooted. Now the
machine froze when starting X. Same procedure: Old Kernel. Remove
nvidia-drv-x11. Reboot into the new kernel and reinstall
nvidia-drv-x11. After a reboot everything works as it used to.

The cause of the problem could have been (sorry, I didn't do any
thorough forensic) that the nvidia.ko that was used in the new kernel
was a symbolic link from the weak-updates-directory into the old
kernel extra-directory. Is it possible that this only works inside
of kernels with the same x in CentOS 5.x ?

For the other machines I am upgrading I plan to

1. deinstall nvidia-x11-drv
2. do a regular yum upgrade
3. reinstall nvidia-x11-drv after reboot

Is there any better way to do it?

Bernhard

PS: the main purpose of this message is to alert others to this
problem, not to get an answer, although that would be nice
___
CentOS mailing list
CentOS@centos.org
http://lists.centos.org/mailman/listinfo/centos


[CentOS] udevd can't reach LDAP-server during boot

2008-06-24 Thread Bernhard Gschaider

Hi!

I'm using CentOS 5.1 (x86_64) machines which authenticate using
LDAP. At the start of booting I get messages like this:

udevd[1158]: nss_ldap: failed to bind to LDAP server 
ldaps://ldap.server.example.com/: Can't contact LDAP server
udevd[1158]: nss_ldap: reconnecting to LDAP server...
udevd[1158]: nss_ldap: could not connect to any LDAP server as (null) - Can't 
contact LDAP server
udevd[1158]: nss_ldap: failed to bind to LDAP server 
ldaps://ldap.server.example.com/: Can't contact LDAP server
udevd[1158]: nss_ldap: reconnecting to LDAP server (sleeping 1 seconds)...

This escaletes to 2, 4, 8, 16, 32, 64 seconds. After that
(==timeouting for 2 minutes) booting continues without problem, so
this is not really a showstopper, but inconvenient.

Googling around revealed some fixes for Debian/Ubuntu. The bottom line
i that udevd needs some user/group that is not in the local files but
on the LDAP-server. The fix usually was adding this user (nvram,
scanner ...) or group locally. The problem is, that on these systems
after the last attempt something like

udevd[1158]: lookup_group: error resolving group 'rdma': Illegal seek

is printed out, makeing it easy to find the right group/user. Is there
a way to get CentOS to a similar behaviour, making it easier to find
the culprit?

Bernhard
___
CentOS mailing list
CentOS@centos.org
http://lists.centos.org/mailman/listinfo/centos


Re: [CentOS] Problem with nvidia-drv-x11 when upgrading to CentOS 5.2

2008-06-24 Thread Bernhard Gschaider
 On Tue, 24 Jun 2008 15:17:56 -0400 (EDT)
 MAJ == Marko A Jennings [EMAIL PROTECTED] wrote:

MAJ On Tue, June 24, 2008 3:08 pm, Bernhard Gschaider wrote:
 Hi!
 
 I know that the nvidia-drv-x11-package is from RPMForge, but I
 thought I might mention it here.
 
 I have a machine with a NVidia-card and because of the ease of
 use I use the rpmforge-package for that.
 
 After upgrading the problem was that during rebooting as soon
 as the machine hit the RHGB (which implicitly uses that driver)
 the machine froze. I booted into the old kernel removed rhgb,
 rebooted. Now the machine froze when starting X. Same
 procedure: Old Kernel. Remove nvidia-drv-x11. Reboot into the
 new kernel and reinstall nvidia-drv-x11. After a reboot
 everything works as it used to.
 
 The cause of the problem could have been (sorry, I didn't do
 any thorough forensic) that the nvidia.ko that was used in the
 new kernel was a symbolic link from the
 weak-updates-directory into the old kernel
 extra-directory. Is it possible that this only works inside
 of kernels with the same x in CentOS 5.x ?
 
 For the other machines I am upgrading I plan to
 
 1. deinstall nvidia-x11-drv 2. do a regular yum upgrade
 3. reinstall nvidia-x11-drv after reboot

MAJ That is exactly what I did and my machine hangs again after
MAJ nvidia-x11-drv is installed.

I THINK (but I'm not sure) I maually had to remove the
weak-updates/nvidia.ko and or the extra/nvidia.ko before doing the
reinstall (don't know if that makes a difference). My guess would be

1. deinstall the package in the old kernel
2. remove these links and files in the nex kernel
3. boot into the kernel
4. reinstall package (that should take some time, because as far as I
can tell this involves recompiling the module)


Bernhard

PS: Your hang is also: Screen turns pitch black ?

___
CentOS mailing list
CentOS@centos.org
http://lists.centos.org/mailman/listinfo/centos


Re: [CentOS] Problem with evolution while upgrading to CentOS 5.2

2008-06-24 Thread Bernhard Gschaider

 On Tue, 24 Jun 2008 14:25:11 -0500
 JH == Johnny Hughes [EMAIL PROTECTED] wrote:

JH Bernhard Gschaider wrote:
 Hi!  When trying to prepare a workstation with (so that the
 real yum upgrade doesn't have to download the packages) yum
 upgrade --upgradeonly it fails with these messages: Error:
 Missing Dependency: libegroupwise-1.2.so.12 is needed by
 package evolution Error: Missing Dependency:
 libgtkhtml-3.8.so.15 is needed by package evolution Error:
 Missing Dependency: libexchange-storage-1.2.so.2 is needed by
 package evolution Error: Missing Dependency:
 libexchange-storage-1.2.so.2()(64bit) is needed by package
 evolution Error: Missing Dependency:
 libegroupwise-1.2.so.12()(64bit) is needed by package evolution
 On the test-machine I solved this by deinstalling evolution (I
 don't use it anyway), but this is not an option for the other
 machines Could anyone give me a hint what is wrong?

JH evolution should be upgraded as part of the upgrade ... 

Thought so, too ;)

JH but your seems to want to keep the old version of evolution
JH while trying to upgrade the other packages that are listed.

JH This must be caused by something in your configuration that is
JH removing evolution from the update set.

JH This something can either be an exclude=evolution in your
JH CentOS-Base.repo file or some kind of misconfiguration of the
JH priorities or protectbase plugins.

Didn't figure out what the problem seems to be, but a rigorous 
yum upgrade -y --disablerepo=* --enablerepo=base --enablerepo=update 
--downloadonly
seems to do the trick.

Thanks for the hint


___
CentOS mailing list
CentOS@centos.org
http://lists.centos.org/mailman/listinfo/centos


Re: [CentOS] FireFox

2008-05-29 Thread Bernhard Gschaider

 On Tue, 27 May 2008 21:51:07 -0400
 WLM == William L Maltby [EMAIL PROTECTED] wrote:

WLM On Tue, 2008-05-27 at 17:19 -0700, MHR wrote:
 On Tue, May 27, 2008 at 5:02 PM, Robert Spangler
 [EMAIL PROTECTED] wrote:
 
  Can I use one out of the Fedora's repos?  If so, which repo?
 
 
 You can just pull down the latest version from mozilla.org -
snipped

WLM I chose not to uninstall the distributed one. I unpackaed the
WLM 3.0rc1 tarball in the $HOME of a user and run it from there.

snipped recipe

WLM This lets all other users get the box-stock version while I
WLM test the new one. So far it's looking pretty good.

Question: if I do this, will I be able to move back to the
stock-1.5-version without problems. In other words: is the stuff like
bookmarks, history etc that is written to disk backward-compatible?
___
CentOS mailing list
CentOS@centos.org
http://lists.centos.org/mailman/listinfo/centos