Thanks for the update.

On 4/26/07, Greenseid, Joseph M. <[EMAIL PROTECTED]> wrote:

For the sake of closure, I figured I'd come back to this to update on what
I found.

My OS is CentOS 4.4.  I think that it is actually a network issue, not an
NFS issue.  I found some similar problems on the CentOS mailing list
referring to this.

Anyway, in my testing, I threw a 10 packet ping test into the netfs init
script, and though all packets get dropped, it appears to be enough to let
the network sort itself out, so NFS is now mounting the remote file
systems.  It's very bizarre, and I will continue investigating, but at least
it's I've figured out where the problem is and it's mostly working now...

--Joe

________________________________

From: [EMAIL PROTECTED] on behalf of Michael
Edwards
Sent: Mon 4/16/2007 9:44 AM
To: oscar-users@lists.sourceforge.net
Subject: Re: [Oscar-users] NFS file systems not mounting on boot


Have you tried turning off pfilter?  I hadn't noticed you were running
4.2.1, but the pfilter package has problems on some systems.  I don't
think they were with NFS, but since this appears to be some sort of boot
timing issue you might try making a new image which doesn't include it and
push that to the nodes.  You'll need to turn the service off on the head
node as well.

You might try a more specific NFS list, OSCAR doesn't do much with the
stock NFS install to my knowledge so the fact that they are unfamiliar with
OSCAR shouldn't mater much.

How many nodes are you using?  Have you tested with only one node to see
if it is a network bottleneck of some kind?  My brain keeps straying toward
the switch, or service startup order, since everything works once everything
is going.


On 4/16/07, Greenseid, Joseph M. <[EMAIL PROTECTED]> wrote:

       I made this change but it does not seem to help.  The nodes still
don't catch the NFS file systems on boot. Very strange...

       --Joe

       ________________________________

       From: [EMAIL PROTECTED] on behalf of
Greenseid, Joseph M.
       Sent: Fri 4/13/2007 8:43 AM
       To: oscar-users@lists.sourceforge.net;
oscar-users@lists.sourceforge.net
       Subject: Re: [Oscar-users] NFS file systems not mounting on boot



       Thanks for the suggestion.  I'll give it a try and let you know how
it goes.

       --Joe

       ________________________________

       From: [EMAIL PROTECTED] on behalf of
Michael Edwards
       Sent: Thu 4/12/2007 5:54 PM
       To: oscar-users@lists.sourceforge.net
       Subject: Re: [Oscar-users] NFS file systems not mounting on boot


       One thing you can try is to kick the value of the System V startup
script toward the end of the boot process.  I have had occasional problems
where the network startup took a very long time but seemed to go on in the
background after the system said [ok] and there was not any connectivity
until much later in the boot process.

       check in /var/lib/systemimager/images/<imagename>/etc/rc3.d/ for
netfs and nfslock (at least), they should show up as symlinks, something
like S25netfs and S14nfslock.  You can mv the file to something like
S90netfs and S90nfslock.  You may need to play with this, but if nfslock is
broken things should work, just with lots of errors in the log files.

       Don't forget to either cpush the files to the /etc/rc3.d on the
nodes and/or reimage the nodes.

       This is a hack, but I have "had" to do it with autofs on some of my
systems to get things to work.


       On 4/12/07, Greenseid, Joseph M. <[EMAIL PROTECTED]> wrote:

               The head node is not rebooting during this process.  It is
simply up and running; a good example of this is a node installation.

               On the head node, I am running the OSCAR Wizard; I am in
the monitor cluster screen, and the nodes are installing.  Upon completion
of the install, I set them to reboot.  They reboot (with the head node still
up, running the OSCAR Wizard), and no NFS file systems.  When I click the
"complete cluster setup" button after they reboot, it says success, and then
do the "test cluster setup," the new nodes do not have /home mounted, and
that test fails...

               --Joe

               ________________________________

               From: [EMAIL PROTECTED] on behalf
of Michael Edwards
               Sent: Thu 4/12/2007 2:15 PM
               To: oscar-users@lists.sourceforge.net
               Subject: Re: [Oscar-users] NFS file systems not mounting on
boot



               Are you completely booting the head node before you boot
the client
               nodes?  If you don't the client nodes boot much faster than
the head
               (because they run  so few services) and they will generaly
finish
               booting before the nfs server is up, so the mounts fail.

               On 4/12/07, Greenseid, Joseph M. < [EMAIL PROTECTED]>
wrote:
               > I am installing OSCAR 4.2.1 on an IA64 cluster.  When the
nodes reboot (after installation, and also every time after that), they fail
to mount any NFS file systems.  However, once the node has finished booting,
if I do a "mount /home" it mounts instantly.
               >
               > My exports file on my head node says:
               >
               > ~]$ cat /etc/exports
               > #
               > /home 10.2.148.1/255.255.255.0(async,rw,no_root_squash)
               > /share 10.2.148.1/255.255.255.0(async,rw)
               > ~]$
               >
               > My fstab entry on a compute node looks like this:
               >
               > # This file is edited by fstab-sync - see 'man
fstab-sync' for details
               > /dev/sda3       swap    swap    defaults        0       0
               > /dev/sda2       /       ext3    defaults        1       2
               > /dev/sda4       /tmp    ext3    defaults        1       2
               > /dev/sda1       /boot/efi
vfat    defaults        1       2
               > /dev/fd0        /mnt/floppy
auto    noauto,owner    0       0
               > none    /dev/pts        devpts  defaults        0       0
               > none    /proc   proc    defaults        0       0
               > nfs_oscar:/share        /share  nfs     rw      0       0
               > nfs_oscar:/home /home   nfs     rw      0       0
               > none      /dev/shm        tmpfs   defaults        0 0
               >
               >
               > I tried changing nfs_oscar to the head node's eth0 IP
addr, and the file looks like this:
               >
               >
               > # This file is edited by fstab-sync - see 'man
fstab-sync' for details
               > /dev/sda3       swap    swap    defaults        0       0
               > /dev/sda2       /       ext3    defaults        1       2
               > /dev/sda4       /tmp    ext3    defaults        1       2
               > /dev/sda1       /boot/efi
vfat    defaults        1       2
               > /dev/fd0        /mnt/floppy
auto    noauto,owner    0       0
               > none    /dev/pts        devpts  defaults        0       0
               > none    /proc   proc    defaults        0       0
               > 10.2.148.1:/share       /share  nfs     rw      0       0
               > 10.2.148.1:/home        /home   nfs     rw      0       0
               > none      /dev/shm        tmpfs   defaults        0 0
               >
               > However, NFS mounting during boot fails in both cases.
               >
               > I get variations of the error message in my
/var/log/messages log from the failures during boot:
               >
               > Apr 12 13:23:58 compute-15-01 mount: mount: mount to NFS
server 'nfs_oscar' failed:
               > Apr 12 13:23:58 compute-15-01 mount: System Error: No
route to host.
               >
               > or
               >
               > Apr 12 13:24:07 compute-15-01 mount: mount: mount to NFS
server 'nfs_oscar' failed: System Error: Connection refused
               >
               > or
               >
               > Apr 12 13:32:25 compute-15-01 mount: mount: mount to NFS
server '10.2.148.1' failed:
               > Apr 12 13:32:25 compute-15-01 mount: System Error: No
route to host.
               >
               > As I said, in both instances (both IP address and
nfs_oscar hostname in the fstab), once booting was complete, the command
"mount /home" worked perfectly fine.
               >
               > Any idea why it isn't working during boot?
               >
               > Thanks,
               > --Joe
               >
               >
-------------------------------------------------------------------------
               > Take Surveys. Earn Cash. Influence the Future of IT
               > Join SourceForge.net's Techsay panel and you'll get the
chance to share your
               > opinions on IT & business topics through brief
surveys-and earn cash
               >
http://www.techsay.com/default.php?page=join.php&p=sourceforge&CID=DEVDEV
               > _______________________________________________
               > Oscar-users mailing list
               > Oscar-users@lists.sourceforge.net
               > https://lists.sourceforge.net/lists/listinfo/oscar-users
               >


-------------------------------------------------------------------------
               Take Surveys. Earn Cash. Influence the Future of IT
               Join SourceForge.net 's Techsay panel and you'll get the
chance to share your
               opinions on IT & business topics through brief surveys-and
earn cash

http://www.techsay.com/default.php?page=join.php&p=sourceforge&CID=DEVDEV<
http://www.techsay.com/default.php?page=join.php&p=sourceforge&CID=DEVDEV>
               _______________________________________________
               Oscar-users mailing list
                Oscar-users@lists.sourceforge.net <mailto:
Oscar-users@lists.sourceforge.net>
               https://lists.sourceforge.net/lists/listinfo/oscar-users




-------------------------------------------------------------------------
               Take Surveys. Earn Cash. Influence the Future of IT
               Join SourceForge.net's Techsay panel and you'll get the
chance to share your
               opinions on IT & business topics through brief surveys-and
earn cash

http://www.techsay.com/default.php?page=join.php&p=sourceforge&CID=DEVDEV
               _______________________________________________
               Oscar-users mailing list
               Oscar-users@lists.sourceforge.net
               https://lists.sourceforge.net/lists/listinfo/oscar-users





-------------------------------------------------------------------------
       Take Surveys. Earn Cash. Influence the Future of IT
       Join SourceForge.net's Techsay panel and you'll get the chance to
share your
       opinions on IT & business topics through brief surveys-and earn
cash

http://www.techsay.com/default.php?page=join.php&p=sourceforge&CID=DEVDEV
       _______________________________________________
       Oscar-users mailing list
       Oscar-users@lists.sourceforge.net
       https://lists.sourceforge.net/lists/listinfo/oscar-users




-------------------------------------------------------------------------
       This SF.net email is sponsored by DB2 Express
       Download DB2 Express C - the FREE version of DB2 express and take
       control of your XML. No limits. Just data. Click to get it now.
       http://sourceforge.net/powerbar/db2/
       _______________________________________________
       Oscar-users mailing list
       Oscar-users@lists.sourceforge.net
       https://lists.sourceforge.net/lists/listinfo/oscar-users






-------------------------------------------------------------------------
This SF.net email is sponsored by DB2 Express
Download DB2 Express C - the FREE version of DB2 express and take
control of your XML. No limits. Just data. Click to get it now.
http://sourceforge.net/powerbar/db2/
_______________________________________________
Oscar-users mailing list
Oscar-users@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/oscar-users



-------------------------------------------------------------------------
This SF.net email is sponsored by DB2 Express
Download DB2 Express C - the FREE version of DB2 express and take
control of your XML. No limits. Just data. Click to get it now.
http://sourceforge.net/powerbar/db2/
_______________________________________________
Oscar-users mailing list
Oscar-users@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/oscar-users

Reply via email to