Hi,

Want to let everyone know what we found out. I worked with the SAN vendor to determine that the SAN was running fine although we did find one setting incorrect on the hosts. If you run iSCSI you need to make sure you disable the Nagle Algorithm on your hosts as it can cause delays. A concise description is:

http://social.technet.microsoft.com/wiki/contents/articles/7636.iscsi-and-the-nagle-algorithm.aspx

So we did this and still no real change. Then we started looking deeper at the data and noticed that the read delays happened every 10 minutes h:00, h:10, h:20... and the rest of the time the were no reads at all on the transaction log directory. Hmm what process could be reading from the logs directory every 10 minutes. A few minutes later we figured out that a process was set up to copy the transaction logs off to a backup server every 10 minutes.

The reason it looked like we had very bad read response times on a daily basis was that the monitoring tool took measurements every 10 minutes for daily readings and just picked up the high points ignoring all the low points in between.

End result is that the response time issue is an artifact of the monitoring tool data collection methodology not a real problem. Lesson learned is that when I see strangeness I need to really understand how I collected the data showing the strangeness.

Thanks to everyone who replied and I hope this helps someone else.

cheers,

skib

--
"When we try to pick out anything by itself, we find it
  connected to the entire universe"            John Muir

Chris "Ski" Kacoroski, [email protected], 206-501-9803
or ski98033 on most IM services
_______________________________________________
Discuss mailing list
[email protected]
https://lists.lopsa.org/cgi-bin/mailman/listinfo/discuss
This list provided by the League of Professional System Administrators
http://lopsa.org/

Reply via email to