Multumesc pentru raspuns,

2011/12/23 Catalin Muresan <[email protected]>

> 2011/12/23 Bogdan Popescu <[email protected]>
>
> > Salutare tuturor,
> >
> > Urmaresc de foarte mult timp lista dar nu am indraznit sa postez pana
> > astazi, o scurta descriere... lucrez pentru o companie care se ocupa cu
> > editari video, au nevoie de foarte mult storage, in momentul de fata
> > folosind aproximativ 500TB in diferite clustere NAS accesate de
> > servere/statii de lucru din diferite locatii ...
> >
> > De aproximativ 2 luni intampin o problema care imi afecteaza fiecare NIC
> > din retea (cele mai multe fiind conectate pe 1Gbit/10G), statiile mai
> vechi
> > cu placi de 100 ajung sa aiba peste 50% loss si marea majoritate a
> > masinilor conectate la retea primesc pachete eronate...(tcpdump arata
> > pachete trimise intre storage (nfs) si diferite servere sau statii de
> lucru
> > si contin reply ERR/reply ok), cateva linii din tcpdump ar arata cam asa:
> >
> > 12:48:33.752052 IP f1nfs.mydomain.local.nfs >
> > mine.mydomain.local.518062048: reply ERR 1448
> > 12:48:33.752053 IP f1nfs.mydomain.local.nfs >
> > mine.mydomain.local.3130183266: reply ERR 1448
> > 12:48:33.752060 IP f1nfs.mydomain.local.nfs >
> > mine.mydomain.local.3980622089: reply ERR 1448
> > 12:48:33.752181 IP f1nfs.mydomain.local.nfs >
> > mine.mydomain.local.1209215430: reply ERR 1448
> > .....................
> > 107453 packets captured
> > 275218 packets received by filter
> > 167431 packets dropped by kernel
> > (10 secunde, "mine.mydomain.local" este o masina diferita fata de cea pe
> > care a fost rulat tcpdump)
> >
> >
> > Switchul principal este un Brocade BigIron RX16 care a inceput sa o ia
> > razna, mai exact arata pe foarte multe porturi output utilization peste
> 45%
> > la porturile 1Gbit si 98% la porturile 100Mbit (am avut si cazuri in care
> > porturile respective aveau 0 pachete primite prin interfata
> respectiva)...
> >
> > Am ramas fara idei si nu stiu ce anume ar putea ajuta la depistarea
> > problemei, sper sa ma puteti ajuta cu cateva idei/sugestii.
> >
>
> daca esti pe linux, ruleaza comenzile:
>

> ip -s -s link                # sa se vada daca sunt erori pe interfata

# ip -s -s link
1: lo: <LOOPBACK,UP,LOWER_UP> mtu 16436 qdisc noqueue
    link/loopback 00:00:00:00:00:00 brd 00:00:00:00:00:00
    RX: bytes  packets  errors  dropped overrun mcast
    3555113667 615577465 0       0       0       0
    RX errors: length  crc     frame   fifo    missed
               0        0       0       0       0
    TX: bytes  packets  errors  dropped carrier collsns
    3555113667 615577465 0       0       0       0
    TX errors: aborted fifo    window  heartbeat
               0        0       0       0
2: eth0: <BROADCAST,MULTICAST,UP,LOWER_UP> mtu 1500 qdisc pfifo_fast qlen
1000
    link/ether bc:30:5b:d7:92:41 brd ff:ff:ff:ff:ff:ff
    RX: bytes  packets  errors  dropped overrun mcast
    2778616109 1482044776 0       0       0       6
    RX errors: length  crc     frame   fifo    missed
               0        0       0       0       0
    TX: bytes  packets  errors  dropped carrier collsns
    712229372  1260796863 0       0       0       0
    TX errors: aborted fifo    window  heartbeat
               0        0       0       0

> ethtool -S eth0         # statistics
>
# ethtool -S eth0
NIC statistics:
     rx_bytes: 277657215586
     rx_error_bytes: 0
     tx_bytes: 1061569387441
     tx_error_bytes: 0
     rx_ucast_packets: 1031747549
     rx_mcast_packets: 3936897
     rx_bcast_packets: 446363653
     tx_ucast_packets: 1260625834
     tx_mcast_packets: 6
     tx_bcast_packets: 173087
     tx_mac_errors: 0
     tx_carrier_errors: 0
     rx_crc_errors: 0
     rx_align_errors: 0
     tx_single_collisions: 0
     tx_multi_collisions: 0
     tx_deferred: 0
     tx_excess_collisions: 0
     tx_late_collisions: 0
     tx_total_collisions: 0
     rx_fragments: 0
     rx_jabbers: 0
     rx_undersize_packets: 0
     rx_oversize_packets: 0
     rx_64_byte_packets: 327021748
     rx_65_to_127_byte_packets: 816309288
     rx_128_to_255_byte_packets: 156755780
     rx_256_to_511_byte_packets: 70100602
     rx_512_to_1023_byte_packets: 30381647
     rx_1024_to_1522_byte_packets: 81479034
     rx_1523_to_9022_byte_packets: 0
     tx_64_byte_packets: 3115156
     tx_65_to_127_byte_packets: 519513182
     tx_128_to_255_byte_packets: 56840902
     tx_256_to_511_byte_packets: 8982146
     tx_512_to_1023_byte_packets: 13718740
     tx_1024_to_1522_byte_packets: 658628803
     tx_1523_to_9022_byte_packets: 0
     rx_xon_frames: 0
     rx_xoff_frames: 0
     tx_xon_frames: 1
     tx_xoff_frames: 1
     rx_mac_ctrl_frames: 0
     rx_filtered_packets: 3367232996
     rx_ftq_discards: 0
     rx_discards: 0
     rx_fw_discards: 0


> ethtook -k eth0         # status offloading
>
# ethtool -k eth0
Offload parameters for eth0:
Cannot get device udp large send offload settings: Operation not supported
rx-checksumming: on
tx-checksumming: on
scatter-gather: on
tcp segmentation offload: on
udp fragmentation offload: off
generic segmentation offload: off
generic-receive-offload: off


> ethtook -a eth0         # status pause
>
ethtool -a eth0
Pause parameters for eth0:
Autonegotiate:  on
RX:             on
TX:             on


>
> poti obtine informatii similare din celalalt capat (switch) ? adica daca ai
> linuxul X legat in portul Y in switch, output-ul comenzilor pe linux +
> informatiile similare de pe portul corespondent din switch ajuta.
>

show interfaces ethernet 11/23
GigabitEthernet11/23 is up, line protocol is up
  Hardware is GigabitEthernet, address is 0012.f23f.3b00 (bia
0012.f23f.3b00)
  Configured speed auto, actual 1Gbit, configured duplex fdx, actual fdx
  Configured mdi mode AUTO, actual MDIX
  Member of L2 VLAN ID 102, port is untagged, port state is Forwarding
  STP configured to ON, Priority is level0, flow control enabled
  Force-DSCP disabled
  mirror disabled, monitor disabled
  Not member of any active trunks
  Not member of any configured trunks
  No port name
  MTU 1518 bytes, encapsulation ethernet
  300 second input rate: 579553 bits/sec, 291 packets/sec, 0.06% utilization
  300 second output rate: 93822066 bits/sec, 24187 packets/sec, 9.61%
utilization
  352571352 packets input, 92636820039 bytes, 0 no buffer
  Received 54059 broadcasts, 0 multicasts, 352517293 unicasts
  0 input errors, 0 CRC, 0 frame, 0 ignored
  0 runts, 0 giants, DMA received 352571352 packets
  43102015231 packets output, 37874584621589 bytes, 0 underruns
  Transmitted 123807465 broadcasts, 15879877 multicasts, 42962327889
unicasts
  0 output errors, 0 collisions, DMA transmitted 43102015231 packets

>> 89mbits/sec output rate pe interfata direct conectata, pe cand iptraf
arata 100kbits/sec


>
> Spor.
>
>
> >
> > Va multumesc si va doresc Sarbatori Fericite.
> >
> > ----------------
> > Bogdan
> >
> _______________________________________________
> RLUG mailing list
> [email protected]
> http://lists.lug.ro/mailman/listinfo/rlug
>
_______________________________________________
RLUG mailing list
[email protected]
http://lists.lug.ro/mailman/listinfo/rlug

Raspunde prin e-mail lui