On 1/21/14, 10:01 PM, Tobias Oetiker wrote: > Hi Nld, > > Today Narayan Desai wrote: > >> Sorry, I should have given the requisite "yes, I know that this is a recipe >> for sadness, for I too have experienced said sadness". >> >> That said, we've seen this kind of problem when there was a device in a >> vdev that was dying a slow death. There wouldn't necessarily be any sign, >> aside from insanely high service times on an individual device in the pool. >> From this, I assume that ZFS is still sensitive to variation in underlying >> drive performance. >> >> Tobi, what do your drive service times look like? >> -nld > > the drives seem fine, smart is not reporting anything out of the > ordinary and also iostat -En shows 0 on all counts > > I don't think it is a disk issue, but rather something connected > with the network ... > > On times the machine becomes unreachable for some time, and then it > is possible to login via console and all seems well internally. > setting the network interface offline and then online again using > the dladm tool brings the connectivity back immediatly. waiting > helps as well ... since the problem sorts itself out after a few > seconds to minutes ... > > we just had another 'off the net' periode for 30 minutes > > unfortunately omnios itself does not seem to realize that something > is off, at least dmesg does not show any kernel messages about this > problem ... > > we have several systems running on the S2600CP MB ... this is the > only one showing problems ... > > the next thing I intend todo is to upgrade the MB firmware since I > found that this box has an older version than the other ones ... > > System Configuration: Intel Corporation S2600CP > BIOS Configuration: Intel Corp. SE5C600.86B.01.06.0002.110120121539 11/01/2012 > > other ideas, most welcome !
You mentioned a couple of e-mails back that you're using Intel I350s. Can you verify that your kernel has: commit 43ae55058ad99c869a9ae39d039490e8a3680520 Author: Dan McDonald <[email protected]> Date: Thu Feb 7 19:27:18 2013 -0500 3534 Disable EEE support in igb for I350 Reviewed by: Robert Mustacchi <[email protected]> Reviewed by: Jason King <[email protected]> Reviewed by: Marcel Telka <[email protected]> Reviewed by: Sebastien Roy <[email protected]> Approved by: Richard Lowe <[email protected]> I guess you can check for this string at runtime: $ strings /kernel/drv/amd64/igb | grep _eee_support If it is missing, then it could be the buggy EEE support that's throwing your link out of whack here. Cheers, -- Saso _______________________________________________ OmniOS-discuss mailing list [email protected] http://lists.omniti.com/mailman/listinfo/omnios-discuss
