We’re seeing (4.3.x) MDS timeouts when opening large checkpoint replicas.  I 
notice that there was a ticket (770) opened for this which says that it was 
fixed in 4.5.  However when I look at the commit for that ticket the 
MAX_SYNC_TRANSFER_SIZE was not changed. Can anyone confirm that the timeout 
problem when transferring replicas has been fixed?

thanks
—
tony

#770

The checkpoint service has some major performance problems when using a lot of 
sections (greater than 5k).

Attached is a patch which addresses the following problems:

1) section id database is implemented as a linked list, so searching for a 
section id takes a long time (make it a C++ STL map)
2) MAX_SYNC_TRANSFER_SIZE is too large, and causes MDS timeouts
3) SectionCreate message should be asynchronous when ACTIVE_REPLICA is specified
------------------------------------------------------------------------------
Comprehensive Server Monitoring with Site24x7.
Monitor 10 servers for $9/Month.
Get alerted through email, SMS, voice calls or mobile push notifications.
Take corrective actions from your mobile device.
http://p.sf.net/sfu/Zoho
_______________________________________________
Opensaf-users mailing list
[email protected]
https://lists.sourceforge.net/lists/listinfo/opensaf-users

Reply via email to