Hi all,

I am trying to set up Lustre using TCP. I have the following in /etc/ modprobe.conf:

options lnet networks="tcp0(eth2)"

to specify the third NIC only. I have two OSSs and one MDS. They startup and see each fine. My XML is pasted below.

When I try to have a client start with:

# lconf --node client lustre-fs.xml

it hangs at:

+ mount -t lustre_lite -o osc=lov1,mdc=MDC_compute-0-1.local_mds1_MNT_client lustre-fs /mnt/lustre

If I check its NIDs, I see:

# cat /proc/sys/lnet/nis
nid                      refs peer   max    tx   min
[EMAIL PROTECTED]                        2    0     0     0     0
[EMAIL PROTECTED]           2    8   256   256   255

which is the correct address for this client. If instead of using lconf, I simply modprobe lnet and run lctl, then I try to ping the MDS. It fails:

# lctl
lctl > network up
LNET configured
lctl > network tcp
lctl > ping 192.168.1.250
failed to ping [EMAIL PROTECTED]: Input/output error

Yet, I can ping the node on the command line:

# ping -s 9000 192.168.1.250
PING 192.168.1.250 (192.168.1.250) 9000(9028) bytes of data.
9008 bytes from 192.168.1.250: icmp_seq=0 ttl=64 time=0.142 ms

# ping -s 9000 nas-0-0-m
PING nas-0-0-m.local (192.168.1.250) 9000(9028) bytes of data.
9008 bytes from nas-0-0-m.local (192.168.1.250): icmp_seq=0 ttl=64 time=0.111 ms

If I try using lctl on the MDS to ping the client, it fails as well but I can ping the two OSSs.

Any suggestions?

Thanks,

Scott


<?xml version='1.0' encoding='UTF-8'?>
<lustre version='2003070801' mtime='1176488849'>
  <ldlm name='ldlm' uuid='ldlm_UUID'/>
  <node uuid='client_UUID' name='client'>
    <profile_ref uuidref='PROFILE_client_UUID'/>
<network uuid='NET_client_lnet_UUID' nettype='lnet' name='NET_client_lnet'>
      <nid>*</nid>
      <clusterid>0</clusterid>
      <port>988</port>
    </network>
  </node>
  <profile uuid='PROFILE_client_UUID' name='PROFILE_client'>
    <ldlm_ref uuidref='ldlm_UUID'/>
    <network_ref uuidref='NET_client_lnet_UUID'/>
    <mountpoint_ref uuidref='MNT_client_UUID'/>
  </profile>
  <node uuid='storenode1-m_UUID' name='storenode1-m'>
    <profile_ref uuidref='PROFILE_storenode1-m_UUID'/>
<network uuid='NET_storenode1-m_lnet_UUID' nettype='lnet' name='NET_storenode1-m_lnet'>
      <nid>192.168.1.2</nid>
      <clusterid>0</clusterid>
      <port>988</port>
    </network>
  </node>
<profile uuid='PROFILE_storenode1-m_UUID' name='PROFILE_storenode1- m'>
    <ldlm_ref uuidref='ldlm_UUID'/>
    <network_ref uuidref='NET_storenode1-m_lnet_UUID'/>
    <osd_ref uuidref='OSD_ost1_storenode1-m_UUID'/>
  </profile>
  <node uuid='storenode2-m_UUID' name='storenode2-m'>
    <profile_ref uuidref='PROFILE_storenode2-m_UUID'/>
<network uuid='NET_storenode2-m_lnet_UUID' nettype='lnet' name='NET_storenode2-m_lnet'>
      <nid>192.168.1.4</nid>
      <clusterid>0</clusterid>
      <port>988</port>
    </network>
  </node>
<profile uuid='PROFILE_storenode2-m_UUID' name='PROFILE_storenode2- m'>
    <ldlm_ref uuidref='ldlm_UUID'/>
    <network_ref uuidref='NET_storenode2-m_lnet_UUID'/>
    <osd_ref uuidref='OSD_ost2_storenode2-m_UUID'/>
  </profile>
  <node uuid='nas-0-0-m_UUID' name='nas-0-0-m'>
    <profile_ref uuidref='PROFILE_nas-0-0-m_UUID'/>
<network uuid='NET_nas-0-0-m_lnet_UUID' nettype='lnet' name='NET_nas-0-0-m_lnet'>
      <nid>192.168.1.250</nid>
      <clusterid>0</clusterid>
      <port>988</port>
    </network>
  </node>
  <profile uuid='PROFILE_nas-0-0-m_UUID' name='PROFILE_nas-0-0-m'>
    <ldlm_ref uuidref='ldlm_UUID'/>
    <network_ref uuidref='NET_nas-0-0-m_lnet_UUID'/>
    <mdsdev_ref uuidref='MDD_mds1_nas-0-0-m_UUID'/>
  </profile>
  <mds uuid='mds1_UUID' name='mds1'>
    <active_ref uuidref='MDD_mds1_nas-0-0-m_UUID'/>
    <lovconfig_ref uuidref='LVCFG_lov1_UUID'/>
    <filesystem_ref uuidref='FS_fsname_UUID'/>
  </mds>
  <mdsdev uuid='MDD_mds1_nas-0-0-m_UUID' name='MDD_mds1_nas-0-0-m'>
    <fstype>ldiskfs</fstype>
    <devpath>/var/run/lustre/mds</devpath>
    <autoformat>yes</autoformat>
    <devsize>5000000</devsize>
    <journalsize>0</journalsize>
    <inodesize>0</inodesize>
    <node_ref uuidref='nas-0-0-m_UUID'/>
    <target_ref uuidref='mds1_UUID'/>
  </mdsdev>
<lov stripesize='4194304' stripecount='-1' stripepattern='0' uuid='lov1_UUID' name='lov1'>
    <mds_ref uuidref='mds1_UUID'/>
    <obd_ref uuidref='ost1_UUID'/>
    <obd_ref uuidref='ost2_UUID'/>
  </lov>
  <lovconfig uuid='LVCFG_lov1_UUID' name='LVCFG_lov1'>
    <lov_ref uuidref='lov1_UUID'/>
  </lovconfig>
  <ost uuid='ost1_UUID' name='ost1'>
    <active_ref uuidref='OSD_ost1_storenode1-m_UUID'/>
  </ost>
<osd osdtype='obdfilter' uuid='OSD_ost1_storenode1-m_UUID' name='OSD_ost1_storenode1-m'>
    <target_ref uuidref='ost1_UUID'/>
    <node_ref uuidref='storenode1-m_UUID'/>
    <fstype>ldiskfs</fstype>
    <devpath>/dev/sda1</devpath>
    <autoformat>no</autoformat>
    <devsize>0</devsize>
    <journalsize>0</journalsize>
    <inodesize>0</inodesize>
  </osd>
  <ost uuid='ost2_UUID' name='ost2'>
    <active_ref uuidref='OSD_ost2_storenode2-m_UUID'/>
  </ost>
<osd osdtype='obdfilter' uuid='OSD_ost2_storenode2-m_UUID' name='OSD_ost2_storenode2-m'>
    <target_ref uuidref='ost2_UUID'/>
    <node_ref uuidref='storenode2-m_UUID'/>
    <fstype>ldiskfs</fstype>
    <devpath>/dev/sda1</devpath>
    <autoformat>no</autoformat>
    <devsize>0</devsize>
    <journalsize>0</journalsize>
    <inodesize>0</inodesize>
  </osd>
  <filesystem uuid='FS_fsname_UUID' name='FS_fsname'>
    <mds_ref uuidref='mds1_UUID'/>
    <obd_ref uuidref='lov1_UUID'/>
  </filesystem>
  <mountpoint uuid='MNT_client_UUID' name='MNT_client'>
    <filesystem_ref uuidref='FS_fsname_UUID'/>
    <path>/mnt/lustre</path>
  </mountpoint>
</lustre>

_______________________________________________
Lustre-discuss mailing list
[EMAIL PROTECTED]
https://mail.clusterfs.com/mailman/listinfo/lustre-discuss

Reply via email to