2011/12/23 Bogdan Popescu <[email protected]>

> Multumesc pentru raspuns,
>
> 2011/12/23 Catalin Muresan <[email protected]>
>
> > 2011/12/23 Bogdan Popescu <[email protected]>
> >
> > > Salutare tuturor,
> > >
> > > Urmaresc de foarte mult timp lista dar nu am indraznit sa postez pana
> > > astazi, o scurta descriere... lucrez pentru o companie care se ocupa cu
> > > editari video, au nevoie de foarte mult storage, in momentul de fata
> > > folosind aproximativ 500TB in diferite clustere NAS accesate de
> > > servere/statii de lucru din diferite locatii ...
> > >
> > > De aproximativ 2 luni intampin o problema care imi afecteaza fiecare
> NIC
> > > din retea (cele mai multe fiind conectate pe 1Gbit/10G), statiile mai
> > vechi
> > > cu placi de 100 ajung sa aiba peste 50% loss si marea majoritate a
> > > masinilor conectate la retea primesc pachete eronate...(tcpdump arata
> > > pachete trimise intre storage (nfs) si diferite servere sau statii de
> > lucru
> > > si contin reply ERR/reply ok), cateva linii din tcpdump ar arata cam
> asa:
> > >
> > > 12:48:33.752052 IP f1nfs.mydomain.local.nfs >
> > > mine.mydomain.local.518062048: reply ERR 1448
> > > 12:48:33.752053 IP f1nfs.mydomain.local.nfs >
> > > mine.mydomain.local.3130183266: reply ERR 1448
> > > 12:48:33.752060 IP f1nfs.mydomain.local.nfs >
> > > mine.mydomain.local.3980622089: reply ERR 1448
> > > 12:48:33.752181 IP f1nfs.mydomain.local.nfs >
> > > mine.mydomain.local.1209215430: reply ERR 1448
> > > .....................
> > > 107453 packets captured
> > > 275218 packets received by filter
> > > 167431 packets dropped by kernel
> > > (10 secunde, "mine.mydomain.local" este o masina diferita fata de cea
> pe
> > > care a fost rulat tcpdump)
> > >
> > >
> > > Switchul principal este un Brocade BigIron RX16 care a inceput sa o ia
> > > razna, mai exact arata pe foarte multe porturi output utilization peste
> > 45%
> > > la porturile 1Gbit si 98% la porturile 100Mbit (am avut si cazuri in
> care
> > > porturile respective aveau 0 pachete primite prin interfata
> > respectiva)...
> > >
> > > Am ramas fara idei si nu stiu ce anume ar putea ajuta la depistarea
> > > problemei, sper sa ma puteti ajuta cu cateva idei/sugestii.
> > >
> >
> > daca esti pe linux, ruleaza comenzile:
> >
>
> > ip -s -s link                # sa se vada daca sunt erori pe interfata
>
> # ip -s -s link
> 1: lo: <LOOPBACK,UP,LOWER_UP> mtu 16436 qdisc noqueue
>    link/loopback 00:00:00:00:00:00 brd 00:00:00:00:00:00
>    RX: bytes  packets  errors  dropped overrun mcast
>    3555113667 615577465 0       0       0       0
>    RX errors: length  crc     frame   fifo    missed
>               0        0       0       0       0
>    TX: bytes  packets  errors  dropped carrier collsns
>    3555113667 615577465 0       0       0       0
>    TX errors: aborted fifo    window  heartbeat
>               0        0       0       0
> 2: eth0: <BROADCAST,MULTICAST,UP,LOWER_UP> mtu 1500 qdisc pfifo_fast qlen
> 1000
>    link/ether bc:30:5b:d7:92:41 brd ff:ff:ff:ff:ff:ff
>    RX: bytes  packets  errors  dropped overrun mcast
>    2778616109 1482044776 0       0       0       6
>    RX errors: length  crc     frame   fifo    missed
>               0        0       0       0       0
>    TX: bytes  packets  errors  dropped carrier collsns
>    712229372  1260796863 0       0       0       0
>    TX errors: aborted fifo    window  heartbeat
>               0        0       0       0
>
> > ethtool -S eth0         # statistics
> >
> # ethtool -S eth0
> NIC statistics:
>     rx_bytes: 277657215586
>     rx_error_bytes: 0
>     tx_bytes: 1061569387441
>     tx_error_bytes: 0
>     rx_ucast_packets: 1031747549
>     rx_mcast_packets: 3936897
>     rx_bcast_packets: 446363653
>     tx_ucast_packets: 1260625834
>     tx_mcast_packets: 6
>     tx_bcast_packets: 173087
>     tx_mac_errors: 0
>     tx_carrier_errors: 0
>     rx_crc_errors: 0
>     rx_align_errors: 0
>     tx_single_collisions: 0
>     tx_multi_collisions: 0
>     tx_deferred: 0
>     tx_excess_collisions: 0
>     tx_late_collisions: 0
>     tx_total_collisions: 0
>     rx_fragments: 0
>     rx_jabbers: 0
>     rx_undersize_packets: 0
>     rx_oversize_packets: 0
>     rx_64_byte_packets: 327021748
>     rx_65_to_127_byte_packets: 816309288
>     rx_128_to_255_byte_packets: 156755780
>     rx_256_to_511_byte_packets: 70100602
>     rx_512_to_1023_byte_packets: 30381647
>     rx_1024_to_1522_byte_packets: 81479034
>     rx_1523_to_9022_byte_packets: 0
>     tx_64_byte_packets: 3115156
>     tx_65_to_127_byte_packets: 519513182
>     tx_128_to_255_byte_packets: 56840902
>     tx_256_to_511_byte_packets: 8982146
>     tx_512_to_1023_byte_packets: 13718740
>     tx_1024_to_1522_byte_packets: 658628803
>     tx_1523_to_9022_byte_packets: 0
>     rx_xon_frames: 0
>     rx_xoff_frames: 0
>     tx_xon_frames: 1
>     tx_xoff_frames: 1
>     rx_mac_ctrl_frames: 0
>     rx_filtered_packets: 3367232996
>     rx_ftq_discards: 0
>     rx_discards: 0
>     rx_fw_discards: 0
>
>
> > ethtook -k eth0         # status offloading
> >
> # ethtool -k eth0
> Offload parameters for eth0:
> Cannot get device udp large send offload settings: Operation not supported
> rx-checksumming: on
> tx-checksumming: on
> scatter-gather: on
> tcp segmentation offload: on
> udp fragmentation offload: off
> generic segmentation offload: off
> generic-receive-offload: off
>
>
> > ethtook -a eth0         # status pause
> >
> ethtool -a eth0
> Pause parameters for eth0:
> Autonegotiate:  on
> RX:             on
> TX:             on
>
>
> >
> > poti obtine informatii similare din celalalt capat (switch) ? adica daca
> ai
> > linuxul X legat in portul Y in switch, output-ul comenzilor pe linux +
> > informatiile similare de pe portul corespondent din switch ajuta.
> >
>
> show interfaces ethernet 11/23
> GigabitEthernet11/23 is up, line protocol is up
>  Hardware is GigabitEthernet, address is 0012.f23f.3b00 (bia
> 0012.f23f.3b00)
>  Configured speed auto, actual 1Gbit, configured duplex fdx, actual fdx
>  Configured mdi mode AUTO, actual MDIX
>  Member of L2 VLAN ID 102, port is untagged, port state is Forwarding
>  STP configured to ON, Priority is level0, flow control enabled
>  Force-DSCP disabled
>  mirror disabled, monitor disabled
>  Not member of any active trunks
>  Not member of any configured trunks
>  No port name
>  MTU 1518 bytes, encapsulation ethernet
>  300 second input rate: 579553 bits/sec, 291 packets/sec, 0.06% utilization
>  300 second output rate: 93822066 bits/sec, 24187 packets/sec, 9.61%
> utilization
>  352571352 packets input, 92636820039 bytes, 0 no buffer
>  Received 54059 broadcasts, 0 multicasts, 352517293 unicasts
>  0 input errors, 0 CRC, 0 frame, 0 ignored
>  0 runts, 0 giants, DMA received 352571352 packets
>  43102015231 packets output, 37874584621589 bytes, 0 underruns
>  Transmitted 123807465 broadcasts, 15879877 multicasts, 42962327889
> unicasts
>  0 output errors, 0 collisions, DMA transmitted 43102015231 packets
>
> >> 89mbits/sec output rate pe interfata direct conectata, pe cand iptraf
> arata 100kbits/sec
>

poti pune 'dstat' si rula un dstat, personal am incredere mai mare in dstat
decit in iptraf

dstat -n -N eth0
sau
dstat -N eth0
doar ca aici se vede si cpu load/disk/swap.
pune si un ethtool eth0 ca nu e clar pe linux ce viteza vede (am vazut pe
switch e 1g, dar wth, sa verificam tot)

si daca arata dstat 89mbps, atunci trebuie identificat cu tcpdump unde e
traficul


> >
> > Spor.
> >
> >
> > >
> > > Va multumesc si va doresc Sarbatori Fericite.
> > >
> > > ----------------
> > > Bogdan
> > >
> > _______________________________________________
> > RLUG mailing list
> > [email protected]
> > http://lists.lug.ro/mailman/listinfo/rlug
> >
> _______________________________________________
> RLUG mailing list
> [email protected]
> http://lists.lug.ro/mailman/listinfo/rlug
>
_______________________________________________
RLUG mailing list
[email protected]
http://lists.lug.ro/mailman/listinfo/rlug

Raspunde prin e-mail lui