So, I changed the internal IP to 192.168.1.10 to prevent any would-be conflicts, ran the start_over script, and went through the usual setup process.

Basically the same thing happened, but I got the iptable and verbose SSH output. Maybe you can help make a little sense out of it.
Here is the iptable output:
=====

Chain INPUT (policy ACCEPT)
target     prot opt source               destination

Chain FORWARD (policy ACCEPT)
target     prot opt source               destination

Chain OUTPUT (policy ACCEPT)
target     prot opt source               destination

=====

Here is the SSH -vvv command when run from oscarnode01 trying to get back into the head node:

=====

[EMAIL PROTECTED] ~]# ssh -vvv 192.168.1.10
OpenSSH_3.9p1, OpenSSL 0.9.7a Feb 19 2003
debug1: Reading configuration data /etc/ssh/ssh_config
debug1: Applying options for *
debug2: ssh_connect: needpriv 0
debug1: Connecting to 192.168.1.10 [192.168.1.10] port 22.
debug1: Connection established.
debug1: permanently_set_uid: 0/0
debug1: identity file /root/.ssh/identity type 0
debug3: Not a RSA1 key file /root/.ssh/id_rsa.
debug2: key_type_from_name: unknown key type '-----BEGIN'
debug3: key_read: missing keytype
debug3: key_read: missing whitespace
debug3: key_read: missing whitespace
debug3: key_read: missing whitespace
debug3: key_read: missing whitespace
debug3: key_read: missing whitespace
debug3: key_read: missing whitespace
debug3: key_read: missing whitespace
debug3: key_read: missing whitespace
debug3: key_read: missing whitespace
debug3: key_read: missing whitespace
debug3: key_read: missing whitespace
debug3: key_read: missing whitespace
debug3: key_read: missing whitespace
debug2: key_type_from_name: unknown key type '-----END'
debug3: key_read: missing keytype
debug1: identity file /root/.ssh/id_rsa type 1
debug3: Not a RSA1 key file /root/.ssh/id_dsa.
debug2: key_type_from_name: unknown key type '-----BEGIN'
debug3: key_read: missing keytype
debug3: key_read: missing whitespace
debug3: key_read: missing whitespace
debug3: key_read: missing whitespace
debug3: key_read: missing whitespace
debug3: key_read: missing whitespace
debug3: key_read: missing whitespace
debug3: key_read: missing whitespace
debug3: key_read: missing whitespace
debug3: key_read: missing whitespace
debug3: key_read: missing whitespace
debug2: key_type_from_name: unknown key type '-----END'
debug3: key_read: missing keytype
debug1: identity file /root/.ssh/id_dsa type 2

debug3: key_read: missing whitespace
debug3: key_read: missing whitespace
debug3: key_read: missing whitespace
debug3: key_read: missing whitespace
debug3: key_read: missing whitespace
debug3: key_read: missing whitespace
debug3: key_read: missing whitespace
debug3: key_read: missing whitespace
debug3: key_read: missing whitespace
debug3: key_read: missing whitespace
debug3: key_read: missing whitespace
debug3: key_read: missing whitespace
debug3: key_read: missing whitespace
debug2: key_type_from_name: unknown key type '-----END'
debug3: key_read: missing keytype
debug1: identity file /root/.ssh/id_rsa type 1
debug3: Not a RSA1 key file /root/.ssh/id_dsa.
debug2: key_type_from_name: unknown key type '-----BEGIN'
debug3: key_read: missing keytype
debug3: key_read: missing whitespace
debug3: key_read: missing whitespace
debug3: key_read: missing whitespace
debug3: key_read: missing whitespace
debug3: key_read: missing whitespace
debug3: key_read: missing whitespace
debug3: key_read: missing whitespace
debug3: key_read: missing whitespace
debug3: key_read: missing whitespace
debug3: key_read: missing whitespace
debug2: key_type_from_name: unknown key type '-----END'
debug3: key_read: missing keytype
debug1: identity file /root/.ssh/id_dsa type 2

debug3: key_read: missing whitespace
debug3: key_read: missing whitespace
debug3: key_read: missing whitespace
debug3: key_read: missing whitespace
debug3: key_read: missing whitespace
debug3: key_read: missing whitespace
debug3: key_read: missing whitespace
debug3: key_read: missing whitespace
debug3: key_read: missing whitespace
debug3: key_read: missing whitespace
debug3: key_read: missing whitespace
debug3: key_read: missing whitespace
debug3: key_read: missing whitespace
debug2: key_type_from_name: unknown key type '-----END'
debug3: key_read: missing keytype
debug1: identity file /root/.ssh/id_rsa type 1
debug3: Not a RSA1 key file /root/.ssh/id_dsa.
debug2: key_type_from_name: unknown key type '-----BEGIN'
debug3: key_read: missing keytype
debug3: key_read: missing whitespace
debug3: key_read: missing whitespace
debug3: key_read: missing whitespace
debug3: key_read: missing whitespace
debug3: key_read: missing whitespace
debug3: key_read: missing whitespace
debug3: key_read: missing whitespace
debug3: key_read: missing whitespace
debug3: key_read: missing whitespace
debug3: key_read: missing whitespace
debug2: key_type_from_name: unknown key type '-----END'
debug3: key_read: missing keytype
debug1: identity file /root/.ssh/id_dsa type 2

debug3: key_read: missing whitespace
debug3: key_read: missing whitespace
debug3: key_read: missing whitespace
debug3: key_read: missing whitespace
debug3: key_read: missing whitespace
debug3: key_read: missing whitespace
debug3: key_read: missing whitespace
debug2: key_type_from_name: unknown key type '-----END'
debug3: key_read: missing keytype
debug1: identity file /root/.ssh/id_rsa type 1
debug3: Not a RSA1 key file /root/.ssh/id_dsa.
debug2: key_type_from_name: unknown key type '-----BEGIN'
debug3: key_read: missing keytype
debug3: key_read: missing whitespace
debug3: key_read: missing whitespace
debug3: key_read: missing whitespace
debug3: key_read: missing whitespace
debug3: key_read: missing whitespace
debug3: key_read: missing whitespace
debug3: key_read: missing whitespace
debug3: key_read: missing whitespace
debug3: key_read: missing whitespace
debug3: key_read: missing whitespace
debug2: key_type_from_name: unknown key type '-----END'
debug3: key_read: missing keytype
debug1: identity file /root/.ssh/id_dsa type 2

=====

Thanks again for your help.

Rob



Michael Edwards wrote:
OSCAR doesn't need a gateway on the head node to work. One way communication generally implies there is a firewall on the head node or other routing problem.

What do you get from "iptables -L" on the head node?

You might try using a different address for the head node than 192.168.0.1 <http://192.168.0.1>, that is a common default address for networking hardware and can cause problems like this occasionally. I have become fond of 10.0.0.x because it isn't used as much.

You could also change the switch address too, if that is the problem.

On 10/26/07, *Robert Ashcraft* < [EMAIL PROTECTED] <mailto:[EMAIL PROTECTED]>> wrote:

    Michael,

    Thanks for the response.  My colleague has tried those things and
    they did not seems to help.  The "ssh -vvv" command does not
    provide any output and presumably just hangs somewhere in the
connection process.
    Just as some information...  If we set up the the compute node to
    connect to DCHP over the external MIT network (not through the
    switch), I was able to get two way communication (through the MIT
    network).  This seems to imply that it is some wrong with the
    static IP setup or something related to the switch.  However, the
    one-way communication is puzzling.  I don't think we have a
    gateway specified for the head node internal IP address, only the
    IP (192.168.0.1 <http://192.168.0.1>) and subnet mask
    (255.255.255.0 <http://255.255.255.0>).  Could that be the source
    of any problems?

    We will continue to try to diagnose the problem, but any more
    insight would be welcomed.  Thanks,

    Rob


    Michael Edwards wrote:
    Do you have the firewall on the head node turned off?

    You can check by doing "iptables -L" or checking under the
    "security level" utility.

    You can also try doing "ssh -vvv [EMAIL PROTECTED] " and see if it
    gives you any clues.

    On 10/25/07, *Robert Wilson Ashcraft* <[EMAIL PROTECTED]
    <mailto:[EMAIL PROTECTED]>> wrote:

        Hi,

        I am attempting to set up an OSCAR cluster.  I have gotten
        through everything
        past step 7, Complete CLuster Setup (which finished
        successfully).

        However, when I run the cluster tests, I get several
        failures, most noticibly
        with the node--> server communication.

        This is also confirmed by the fact that I can SSH to a node,
        but when I am
        logged into the node, I cannot SSH back into the server (it
        just hangs... no
        error message, but I can ctrl-C out of it)

        Do you have any idea why the SSH from the client to server
        would not be working?

        I have a feeling that if this problem is solved, the other
        failed test will work
        themselves out.

        I am attaching the oscarinstall.log file in case that helps.

        Here is my /etc/hosts file if that helps:
        # Do not remove the following line, or various programs
        # that require network functionality will fail.
127.0.0.1 <http://127.0.0.1> localhost.localdomain localhost
        192.168.0.1 <http://192.168.0.1>     pharos.mit.edu
        <http://pharos.mit.edu> pharos oscar_server nfs_oscar pbs_oscar
        18.80.7.242 <http://18.80.7.242>     pharos.mit.edu
        <http://pharos.mit.edu> pharos

        # These entries are managed by SIS, please don't modify them.
192.168.0.100 <http://192.168.0.100> oscarnode01.mit.edu
        <http://oscarnode01.mit.edu>        oscarnode01


        Thanks for your help.

        Rob Ashcraft


        
-------------------------------------------------------------------------

        This SF.net email is sponsored by: Splunk Inc.
        Still grepping through log files to find problems?  Stop.
        Now Search log events and configuration files using AJAX and
        a browser.
        Download your FREE copy of Splunk now >> http://get.splunk.com/
        _______________________________________________
        Oscar-users mailing list
        Oscar-users@lists.sourceforge.net
        <mailto:Oscar-users@lists.sourceforge.net>
        https://lists.sourceforge.net/lists/listinfo/oscar-users



    ------------------------------------------------------------------------

    -------------------------------------------------------------------------
    This SF.net email is sponsored by: Splunk Inc.
    Still grepping through log files to find problems?  Stop.

    Now Search log events and configuration files using AJAX and a browser.
Download your FREE copy of Splunk now >> http://get.splunk.com/
    ------------------------------------------------------------------------

    _______________________________________________
    Oscar-users mailing list

    Oscar-users@lists.sourceforge.net <mailto:Oscar-users@lists.sourceforge.net>
    https://lists.sourceforge.net/lists/listinfo/oscar-users
     <https://lists.sourceforge.net/lists/listinfo/oscar-users>


    -------------------------------------------------------------------------
    This SF.net email is sponsored by: Splunk Inc.
    Still grepping through log files to find problems?  Stop.
    Now Search log events and configuration files using AJAX and a
    browser.
    Download your FREE copy of Splunk now >> http://get.splunk.com/
    _______________________________________________
    Oscar-users mailing list
    Oscar-users@lists.sourceforge.net
    <mailto:Oscar-users@lists.sourceforge.net>
    https://lists.sourceforge.net/lists/listinfo/oscar-users


------------------------------------------------------------------------

-------------------------------------------------------------------------
This SF.net email is sponsored by: Splunk Inc.
Still grepping through log files to find problems?  Stop.
Now Search log events and configuration files using AJAX and a browser.
Download your FREE copy of Splunk now >> http://get.splunk.com/
------------------------------------------------------------------------

_______________________________________________
Oscar-users mailing list
Oscar-users@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/oscar-users

--

Robert W. Ashcraft

Ph.D. Candidate

Dept. Chemical Engineering

Massachusetts Institute of Technology

77 Massachusetts Ave.

Room 66-264

Cambridge, MA 02139

Phone: 617-253-6554

E-mail: [EMAIL PROTECTED]

-------------------------------------------------------------------------
This SF.net email is sponsored by: Splunk Inc.
Still grepping through log files to find problems?  Stop.
Now Search log events and configuration files using AJAX and a browser.
Download your FREE copy of Splunk now >> http://get.splunk.com/
_______________________________________________
Oscar-users mailing list
Oscar-users@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/oscar-users

Reply via email to