Re: [Lustre-discuss] ne2scan and fsfind

2011-02-17 Thread DEGREMONT Aurelien
Hello

I advise you to have a look at Robinhood, which has a ton of features to 
handle Lustre file management:
http://robinhood.sf.net/ http://robinhood.sourceforge.net

And if you think this does not fit your needs I will be very interested 
to know why.

Regards,

Aurélien Degrémont

Andrus, Brian Contractor a écrit :

 Hello all!

 I have been googling around trying to find some good tools to handle 
 the large fileystems we have grown here and see reference to ‘ne2scan’ 
 and ‘fsfind’.

 They sound great, but I am not finding a place to get them.

 Does anyone out there know where these gems are hidden?

 Brian Andrus

 ITACS/Research Computing

 Naval Postgraduate School

 Monterey, California

 voice: 831-656-6238

 

 ___
 Lustre-discuss mailing list
 Lustre-discuss@lists.lustre.org
 http://lists.lustre.org/mailman/listinfo/lustre-discuss
   

___
Lustre-discuss mailing list
Lustre-discuss@lists.lustre.org
http://lists.lustre.org/mailman/listinfo/lustre-discuss


[Lustre-discuss] Kernel Panic error after lustre 2.0 installation

2011-02-17 Thread Arya Mazaheri
Hi there,
I have got an error after installing lustre 2.0 on the MGS server with RAID
controller card.
The server OS is centOS 5.4 x86_64 and has 1.2TB storage which has
configured by RAID 1+0.
After installing lustre rpm packages and rebooting machine, I face with the
errors below at linux startup:

Unable to access resume device (LABEL=SWAP-sda3)
mount: could not find filesystem 'dev/root'
setuproot: moving /dev failed: No such file or directory
setuproot: error mounting /proc: No such file or directory
setuproot: error mounting /sys: No such file or directory
swirchroot: mount failed: No such file or directory
Kernel Panic - not syncing: Attempted to kill init!

I have no problem with the original kernel installed by centos. I guessed
this may be related to RAID controller card driver which may not loaded by
the patched lustre kernel. so I have added the driver into the initrd.img
file. But it didn't solve the problem.

Should I install the lustre by building the source? Or any other clue to
this problem?

Thanks in advance...
___
Lustre-discuss mailing list
Lustre-discuss@lists.lustre.org
http://lists.lustre.org/mailman/listinfo/lustre-discuss


Re: [Lustre-discuss] Kernel Panic error after lustre 2.0 installation

2011-02-17 Thread Arya Mazaheri
The driver name is arcmsr.ko and I extracted it from driver.img included
in RAID controller's CD. The following text file may clarify better:

ftp://areca.starline.de/RaidCards/AP_Drivers/Linux/DRIVER/RedHat/FedoraCore/Redhat-Fedora-core8/1.20.0X.15/Intel/readme.txt

Please tell me, if you need more information about this issue...

On Thu, Feb 17, 2011 at 11:33 PM, Brian J. Murrell br...@whamcloud.comwrote:

 On Thu, 2011-02-17 at 23:26 +0330, Arya Mazaheri wrote:
  Hi there,

 Hi,

  Unable to access resume device (LABEL=SWAP-sda3)
  mount: could not find filesystem 'dev/root'
  setuproot: moving /dev failed: No such file or directory
  setuproot: error mounting /proc: No such file or directory
  setuproot: error mounting /sys: No such file or directory
  swirchroot: mount failed: No such file or directory
  Kernel Panic - not syncing: Attempted to kill init!
 
  I have no problem with the original kernel installed by centos. I
  guessed this may be related to RAID controller card driver which may
  not loaded by the patched lustre kernel.

 That seems like a reasonable conclusion given the information available.

  so I have added the driver into the initrd.img file.

 Where did you get the driver from?  What is the name of the driver?

  But it didn't solve the problem.

 Depending on where it came from, yes, it might not.

  Should I install the lustre by building the source?

 That may be required, but not necessarily required.  We need more
 information.

 b.



 ___
 Lustre-discuss mailing list
 Lustre-discuss@lists.lustre.org
 http://lists.lustre.org/mailman/listinfo/lustre-discuss




signature.asc
Description: PGP signature
___
Lustre-discuss mailing list
Lustre-discuss@lists.lustre.org
http://lists.lustre.org/mailman/listinfo/lustre-discuss


Re: [Lustre-discuss] Kernel Panic error after lustre 2.0 installation

2011-02-17 Thread Kevin Van Maren
Yep.  All you have to do is rebuild the driver for the Lustre kernel.

First, bring the system back up with the non-Lustre kernel.



See the bottom of the readme:

# cd /usr/src/linux/drivers/scsi/arcmsr
(suppose /usr/src/linux is the soft-link for 
/usr/src/kernel/2.6.23.1-42.fc8-i386)
# make -C /lib/modules/`uname -r`/build CONFIG_SCSI_ARCMSR=m 
SUBDIRS=$PWD modules
# insmod arcmsr.ko

Except instead of uname -r substitute the lustre kernel's 'uname -r', 
as you want to build for the Lustre kernel.  Be sure you have the Lustre 
kernel-devel RPM installed.

Note that the insmod will not work (you already have it for the 
running kernel, and the one you built for the Lustre kernel will not 
work).  You will need to rebuild the initrd for the Lustre kernel (see 
the other instructions in the readme, using the Lustre kernel).

Kevin


Arya Mazaheri wrote:
 The driver name is arcmsr.ko and I extracted it from driver.img 
 included in RAID controller's CD. The following text file may clarify 
 better:

 ftp://areca.starline.de/RaidCards/AP_Drivers/Linux/DRIVER/RedHat/FedoraCore/Redhat-Fedora-core8/1.20.0X.15/Intel/readme.txt

 Please tell me, if you need more information about this issue...

 On Thu, Feb 17, 2011 at 11:33 PM, Brian J. Murrell 
 br...@whamcloud.com mailto:br...@whamcloud.com wrote:

 On Thu, 2011-02-17 at 23:26 +0330, Arya Mazaheri wrote:
  Hi there,

 Hi,

  Unable to access resume device (LABEL=SWAP-sda3)
  mount: could not find filesystem 'dev/root'
  setuproot: moving /dev failed: No such file or directory
  setuproot: error mounting /proc: No such file or directory
  setuproot: error mounting /sys: No such file or directory
  swirchroot: mount failed: No such file or directory
  Kernel Panic - not syncing: Attempted to kill init!
 
  I have no problem with the original kernel installed by centos. I
  guessed this may be related to RAID controller card driver which may
  not loaded by the patched lustre kernel.

 That seems like a reasonable conclusion given the information
 available.

  so I have added the driver into the initrd.img file.

 Where did you get the driver from?  What is the name of the driver?

  But it didn't solve the problem.

 Depending on where it came from, yes, it might not.

  Should I install the lustre by building the source?

 That may be required, but not necessarily required.  We need more
 information.

 b.



 ___
 Lustre-discuss mailing list
 Lustre-discuss@lists.lustre.org
 mailto:Lustre-discuss@lists.lustre.org
 http://lists.lustre.org/mailman/listinfo/lustre-discuss


 

 ___
 Lustre-discuss mailing list
 Lustre-discuss@lists.lustre.org
 http://lists.lustre.org/mailman/listinfo/lustre-discuss
   

___
Lustre-discuss mailing list
Lustre-discuss@lists.lustre.org
http://lists.lustre.org/mailman/listinfo/lustre-discuss


Re: [Lustre-discuss] Lustre client error

2011-02-17 Thread Kevin Van Maren
To figure out which OST is which, use e2label /dev/sdX (or e2label 
/dev/mapper/mpath7) which will print the OST index in hex.

If clients run out of space, but there is space left, see Bug 22755 
(mostly fixed in Lustre 1.8.4).

Lustre assigns the OST index at file creation time.  Lustre will avoid 
full OSTs, but once a file is created any growth must be accommodated by 
the initial OST assignment(s).  Deactivating the OST on the MDS will 
prevent new allocations, but they shouldn't be happening anyway.

You can copy/rename some large files to put them on another OST which 
will free up space on the full OST (move will not allocate new space, 
just change the directory name).

Kevin



Jagga Soorma wrote:
 This OST is 100% now with only 12GB remaining and something is 
 actively writing to this volume.  What would be the appropriate thing 
 to do in this scenario?  If I set this to read only on the mds then 
 some of my clients start hanging up.

 Should I be running lfs find -O OST_UID /lustre and then move the 
 files out of this filesystem and re-add them back?  But then there is 
 no gurantee that they will not be written to this specific OST.

 Any help would be greately appreciated.

 Thanks,
 -J

 On Tue, Feb 15, 2011 at 3:05 PM, Jagga Soorma jagg...@gmail.com 
 mailto:jagg...@gmail.com wrote:

 I might be looking at the wrong OST.  What is the best way to map
 the actual /dev/mapper/mpath[X] to what OST ID is used for that
 volume?

 Thanks,
 -J


 On Tue, Feb 15, 2011 at 3:01 PM, Jagga Soorma jagg...@gmail.com
 mailto:jagg...@gmail.com wrote:

 Also, it looks like the client is reporting a different %used
 compared to the oss server itself:

 client:
 reshpc101:~ # lfs df -h | grep -i 0007
 reshpcfs-OST0007_UUID  2.0T  1.7T202.7G   84%
 /reshpcfs[OST:7]

 oss:
 /dev/mapper/mpath72.0T  1.9T   40G  98%
 /gnet/lustre/oss02/mpath7

 Here is how the data seems to be distributed on one of the OSS's:
 --
 /dev/mapper/mpath52.0T  1.2T  688G  65%
 /gnet/lustre/oss02/mpath5
 /dev/mapper/mpath62.0T  1.7T  224G  89%
 /gnet/lustre/oss02/mpath6
 /dev/mapper/mpath72.0T  1.9T   41G  98%
 /gnet/lustre/oss02/mpath7
 /dev/mapper/mpath82.0T  1.3T  671G  65%
 /gnet/lustre/oss02/mpath8
 /dev/mapper/mpath92.0T  1.3T  634G  67%
 /gnet/lustre/oss02/mpath9
 --

 -J


 On Tue, Feb 15, 2011 at 2:37 PM, Jagga Soorma
 jagg...@gmail.com mailto:jagg...@gmail.com wrote:

 I did deactivate this OST on the MDS server.  So how would
 I deal with a OST filling up?  The OST's don't seem to be
 filling up evenly either.  How does lustre handle a OST
 that is at 100%?  Would it not use this specific OST for
 writes if there are other OST available with capacity? 

 Thanks,
 -J


 On Tue, Feb 15, 2011 at 11:45 AM, Andreas Dilger
 adil...@whamcloud.com mailto:adil...@whamcloud.com wrote:

 On 2011-02-15, at 12:20, Cliff White wrote:
  Client situation depends on where you deactivated
 the OST - if you deactivate on the MDS only, clients
 should be able to read.
 
  What is best to do when an OST fills up really
 depends on what else you are doing at the time, and
 how much control you have over what the clients are
 doing and other things.  If you can solve the space
 issue with a quick rm -rf, best to leave it online,
 likewise if all your clients are trying to bang on it
 and failing, best to turn things off. YMMV

 In theory, with 1.8 the full OST should be skipped for
 new object allocations, but this is not robust in the
 face of e.g. a single very large file being written to
 the OST that takes it from average usage to being full.

  On Tue, Feb 15, 2011 at 10:57 AM, Jagga Soorma
 jagg...@gmail.com mailto:jagg...@gmail.com wrote:
  Hi Guys,
 
  One of my clients got a hung lustre mount this
 morning and I saw the following errors in my logs:
 
  --
  ..snip..
  Feb 15 09:38:07 reshpc116 kernel: LustreError: 11-0:
 an error occurred while communicating with
 10.0.250.47@o2ib3. The ost_write operation failed with -28
  Feb 15 09:38:07 reshpc116 kernel: LustreError:
 Skipped 4755836 previous similar messages
  Feb 15 09:48:07 reshpc116 kernel: LustreError: 11-0:
 an 

[Lustre-discuss] Is this setup possible

2011-02-17 Thread vilobh meshram
Hi,

Is this setup possible.I want to install a patchless client on the client
nodes.

Following are details :-

1) Server has no OFED ; client has OFED 1.5.2.

2) Client / Server has Lustre 1.8.3.

3) Client Linux version   : 2.6.30

Server Linux Version : 2.6.18-164.11.1

Is it possible that if I install different Linux version of Client/Server
but same Lustre version on Client/Server the things will work fine.
Also since I want to install patchless client on the client side do I need
to restrict the Lustre setup to some specific version of the Linux Kernel.

Thanks,
Vilobh
___
Lustre-discuss mailing list
Lustre-discuss@lists.lustre.org
http://lists.lustre.org/mailman/listinfo/lustre-discuss


[Lustre-discuss] Fwd: Query regarding Lustre setup

2011-02-17 Thread vilobh meshram
Hi,

Is this setup possible.I want to install a patchless client on the client
nodes.

Following are details :-

1) Server has no OFED ; client has OFED 1.5.2.

2) Client / Server has Lustre 1.8.3.

3) Client Linux version   : 2.6.30

Server Linux Version : 2.6.18-164.11.1

Is it possible that if I install different Linux version of Client/Server
but same Lustre version on Client/Server the things will work fine.
Also since I want to install patchless client on the client side do I need
to restrict the Lustre setup to some specific version of the Linux Kernel.

Thanks,
Vilobh
___
Lustre-discuss mailing list
Lustre-discuss@lists.lustre.org
http://lists.lustre.org/mailman/listinfo/lustre-discuss


[Lustre-discuss] Query regarding Lustre Setup

2011-02-17 Thread Vilobh Meshram
 Hi,

Is this setup possible.I want to install a patchless client on the client
nodes.

Following are details :-

1) Server has no OFED ; client has OFED 1.5.2.

2) Client / Server has Lustre 1.8.3.

3) Client Linux version   : 2.6.30

Server Linux Version : 2.6.18-164.11.1

Is it possible that if I install different Linux version of Client/Server
but same Lustre version on Client/Server the things will work fine.
Also since I want to install patchless client on the client side do I need
to restrict the Lustre setup to some specific version of the Linux Kernel.


Thanks,
Vilobh
**
___
Lustre-discuss mailing list
Lustre-discuss@lists.lustre.org
http://lists.lustre.org/mailman/listinfo/lustre-discuss


Re: [Lustre-discuss] Is this setup possible

2011-02-17 Thread Cliff White
All the nodes have to run the same network type, so they can talk to one
another. If client is runnng
Infiniband, server must also run Infiniband, in most cases. See the Lustre
Manual for information on
Lustre Routing.
Clients and server can run different versions of Lustre.
You need to run a version of the kernel supported by Lustre, even with
patchless clients.
See Lustre release notes for kernel versions.
cliffw


On Thu, Feb 17, 2011 at 2:54 PM, vilobh meshram meshram.vil...@gmail.comwrote:

 Hi,

 Is this setup possible.I want to install a patchless client on the client
 nodes.

 Following are details :-

 1) Server has no OFED ; client has OFED 1.5.2.

 2) Client / Server has Lustre 1.8.3.

 3) Client Linux version   : 2.6.30

 Server Linux Version : 2.6.18-164.11.1

 Is it possible that if I install different Linux version of Client/Server
 but same Lustre version on Client/Server the things will work fine.
 Also since I want to install patchless client on the client side do I need
 to restrict the Lustre setup to some specific version of the Linux Kernel.

 Thanks,
 Vilobh

 ___
 Lustre-discuss mailing list
 Lustre-discuss@lists.lustre.org
 http://lists.lustre.org/mailman/listinfo/lustre-discuss


___
Lustre-discuss mailing list
Lustre-discuss@lists.lustre.org
http://lists.lustre.org/mailman/listinfo/lustre-discuss