2011/12/23 Bogdan Popescu <[email protected]> > Multumesc pentru raspuns, > > 2011/12/23 Catalin Muresan <[email protected]> > > > 2011/12/23 Bogdan Popescu <[email protected]> > > > > > Salutare tuturor, > > > > > > Urmaresc de foarte mult timp lista dar nu am indraznit sa postez pana > > > astazi, o scurta descriere... lucrez pentru o companie care se ocupa cu > > > editari video, au nevoie de foarte mult storage, in momentul de fata > > > folosind aproximativ 500TB in diferite clustere NAS accesate de > > > servere/statii de lucru din diferite locatii ... > > > > > > De aproximativ 2 luni intampin o problema care imi afecteaza fiecare > NIC > > > din retea (cele mai multe fiind conectate pe 1Gbit/10G), statiile mai > > vechi > > > cu placi de 100 ajung sa aiba peste 50% loss si marea majoritate a > > > masinilor conectate la retea primesc pachete eronate...(tcpdump arata > > > pachete trimise intre storage (nfs) si diferite servere sau statii de > > lucru > > > si contin reply ERR/reply ok), cateva linii din tcpdump ar arata cam > asa: > > > > > > 12:48:33.752052 IP f1nfs.mydomain.local.nfs > > > > mine.mydomain.local.518062048: reply ERR 1448 > > > 12:48:33.752053 IP f1nfs.mydomain.local.nfs > > > > mine.mydomain.local.3130183266: reply ERR 1448 > > > 12:48:33.752060 IP f1nfs.mydomain.local.nfs > > > > mine.mydomain.local.3980622089: reply ERR 1448 > > > 12:48:33.752181 IP f1nfs.mydomain.local.nfs > > > > mine.mydomain.local.1209215430: reply ERR 1448 > > > ..................... > > > 107453 packets captured > > > 275218 packets received by filter > > > 167431 packets dropped by kernel > > > (10 secunde, "mine.mydomain.local" este o masina diferita fata de cea > pe > > > care a fost rulat tcpdump) > > > > > > > > > Switchul principal este un Brocade BigIron RX16 care a inceput sa o ia > > > razna, mai exact arata pe foarte multe porturi output utilization peste > > 45% > > > la porturile 1Gbit si 98% la porturile 100Mbit (am avut si cazuri in > care > > > porturile respective aveau 0 pachete primite prin interfata > > respectiva)... > > > > > > Am ramas fara idei si nu stiu ce anume ar putea ajuta la depistarea > > > problemei, sper sa ma puteti ajuta cu cateva idei/sugestii. > > > > > > > daca esti pe linux, ruleaza comenzile: > > > > > ip -s -s link # sa se vada daca sunt erori pe interfata > > # ip -s -s link > 1: lo: <LOOPBACK,UP,LOWER_UP> mtu 16436 qdisc noqueue > link/loopback 00:00:00:00:00:00 brd 00:00:00:00:00:00 > RX: bytes packets errors dropped overrun mcast > 3555113667 615577465 0 0 0 0 > RX errors: length crc frame fifo missed > 0 0 0 0 0 > TX: bytes packets errors dropped carrier collsns > 3555113667 615577465 0 0 0 0 > TX errors: aborted fifo window heartbeat > 0 0 0 0 > 2: eth0: <BROADCAST,MULTICAST,UP,LOWER_UP> mtu 1500 qdisc pfifo_fast qlen > 1000 > link/ether bc:30:5b:d7:92:41 brd ff:ff:ff:ff:ff:ff > RX: bytes packets errors dropped overrun mcast > 2778616109 1482044776 0 0 0 6 > RX errors: length crc frame fifo missed > 0 0 0 0 0 > TX: bytes packets errors dropped carrier collsns > 712229372 1260796863 0 0 0 0 > TX errors: aborted fifo window heartbeat > 0 0 0 0 > > > ethtool -S eth0 # statistics > > > # ethtool -S eth0 > NIC statistics: > rx_bytes: 277657215586 > rx_error_bytes: 0 > tx_bytes: 1061569387441 > tx_error_bytes: 0 > rx_ucast_packets: 1031747549 > rx_mcast_packets: 3936897 > rx_bcast_packets: 446363653 > tx_ucast_packets: 1260625834 > tx_mcast_packets: 6 > tx_bcast_packets: 173087 > tx_mac_errors: 0 > tx_carrier_errors: 0 > rx_crc_errors: 0 > rx_align_errors: 0 > tx_single_collisions: 0 > tx_multi_collisions: 0 > tx_deferred: 0 > tx_excess_collisions: 0 > tx_late_collisions: 0 > tx_total_collisions: 0 > rx_fragments: 0 > rx_jabbers: 0 > rx_undersize_packets: 0 > rx_oversize_packets: 0 > rx_64_byte_packets: 327021748 > rx_65_to_127_byte_packets: 816309288 > rx_128_to_255_byte_packets: 156755780 > rx_256_to_511_byte_packets: 70100602 > rx_512_to_1023_byte_packets: 30381647 > rx_1024_to_1522_byte_packets: 81479034 > rx_1523_to_9022_byte_packets: 0 > tx_64_byte_packets: 3115156 > tx_65_to_127_byte_packets: 519513182 > tx_128_to_255_byte_packets: 56840902 > tx_256_to_511_byte_packets: 8982146 > tx_512_to_1023_byte_packets: 13718740 > tx_1024_to_1522_byte_packets: 658628803 > tx_1523_to_9022_byte_packets: 0 > rx_xon_frames: 0 > rx_xoff_frames: 0 > tx_xon_frames: 1 > tx_xoff_frames: 1 > rx_mac_ctrl_frames: 0 > rx_filtered_packets: 3367232996 > rx_ftq_discards: 0 > rx_discards: 0 > rx_fw_discards: 0 > > > > ethtook -k eth0 # status offloading > > > # ethtool -k eth0 > Offload parameters for eth0: > Cannot get device udp large send offload settings: Operation not supported > rx-checksumming: on > tx-checksumming: on > scatter-gather: on > tcp segmentation offload: on > udp fragmentation offload: off > generic segmentation offload: off > generic-receive-offload: off > > > > ethtook -a eth0 # status pause > > > ethtool -a eth0 > Pause parameters for eth0: > Autonegotiate: on > RX: on > TX: on > > > > > > poti obtine informatii similare din celalalt capat (switch) ? adica daca > ai > > linuxul X legat in portul Y in switch, output-ul comenzilor pe linux + > > informatiile similare de pe portul corespondent din switch ajuta. > > > > show interfaces ethernet 11/23 > GigabitEthernet11/23 is up, line protocol is up > Hardware is GigabitEthernet, address is 0012.f23f.3b00 (bia > 0012.f23f.3b00) > Configured speed auto, actual 1Gbit, configured duplex fdx, actual fdx > Configured mdi mode AUTO, actual MDIX > Member of L2 VLAN ID 102, port is untagged, port state is Forwarding > STP configured to ON, Priority is level0, flow control enabled > Force-DSCP disabled > mirror disabled, monitor disabled > Not member of any active trunks > Not member of any configured trunks > No port name > MTU 1518 bytes, encapsulation ethernet > 300 second input rate: 579553 bits/sec, 291 packets/sec, 0.06% utilization > 300 second output rate: 93822066 bits/sec, 24187 packets/sec, 9.61% > utilization > 352571352 packets input, 92636820039 bytes, 0 no buffer > Received 54059 broadcasts, 0 multicasts, 352517293 unicasts > 0 input errors, 0 CRC, 0 frame, 0 ignored > 0 runts, 0 giants, DMA received 352571352 packets > 43102015231 packets output, 37874584621589 bytes, 0 underruns > Transmitted 123807465 broadcasts, 15879877 multicasts, 42962327889 > unicasts > 0 output errors, 0 collisions, DMA transmitted 43102015231 packets > > >> 89mbits/sec output rate pe interfata direct conectata, pe cand iptraf > arata 100kbits/sec >
poti pune 'dstat' si rula un dstat, personal am incredere mai mare in dstat decit in iptraf dstat -n -N eth0 sau dstat -N eth0 doar ca aici se vede si cpu load/disk/swap. pune si un ethtool eth0 ca nu e clar pe linux ce viteza vede (am vazut pe switch e 1g, dar wth, sa verificam tot) si daca arata dstat 89mbps, atunci trebuie identificat cu tcpdump unde e traficul > > > > Spor. > > > > > > > > > > Va multumesc si va doresc Sarbatori Fericite. > > > > > > ---------------- > > > Bogdan > > > > > _______________________________________________ > > RLUG mailing list > > [email protected] > > http://lists.lug.ro/mailman/listinfo/rlug > > > _______________________________________________ > RLUG mailing list > [email protected] > http://lists.lug.ro/mailman/listinfo/rlug > _______________________________________________ RLUG mailing list [email protected] http://lists.lug.ro/mailman/listinfo/rlug
