Re: Witango-Talk: replicated database servers

Robert Garcia Fri, 13 Jan 2006 11:26:27 -0800

We spent several months moving to a database replicated system. Thereare MANY issues that make it more complicated than just adsconfig.ini type of setup, although it would seem that way.

The biggest issue, is latency for replication. There are some apps,that need access to data, immediately after insert or update. Forthis reason, you cannot just say, do all writes on DBMain, and allreads on other(s). You have to either architect all your applicationsto allow for this latency, or in our case, choose which functions ofyour application(s) will use the replicated servers to read from.

The goal of replication is to gain performance by load balancinginteraction with a database, or to gain redundancy.


We accomplish both goals in completely different ways.

1. Redundancy: we can quickly switch all dsns on witango machines todo all reads and writes to a slave database. This should only be donein a complete failure type of scenario, cuz then you have serious re-synchonization issues.

2. Performance: we went through our applications and chose the areasthat would not suffer from the latency issue, and and put a heavyread load on the database server, and moved only those functions toread from slaves. This works very well.

Being more specific, on eventpix, we moved all of the event/imageVIEWING to the slaves. This is the most used function of eventpix,and is hammered constantly, and completely frees the main db serverup. Also, it is not an issue that a user looks for an event, and aslave is a minute behind the main server.

Keep in mind, that you also have to incorporate that user sessioninto your equation.

So in a database, we store records for each slave, and an integer forits priority. So if there are 3 slaves, db2, db3, and db4, withprioritys of 100, and 100, and 200 respectively....Then db4 would get 2x the action of db2 and db3, I think you get thepicture. When an server is started, these values are stored in aglobal custom scope. When a user hits one of our balanced functions,it calls a tcf method, to "checkout" a dsn. The method gives it thedsn, and also knows to keep giving the SAME dsn to the same user. Youdon't want to have a single user switching from one slave to the other.

If there is a failure in a slave, we can go to our custom serveradministration console, and change the database entries of priority.Say if db3 goes down, its priority is 0. Then we trip a function thatreloads this db data into the global scope. The tcf method in thiscase WILL give the user process a new DSN, cuz db3 is now down. Sothe tcf NEVER hits the DB for the values, only the global scope inmemory, so the tcf only adds about 1ms of latency to this entireprocess. When db3 is back up, we change the values in the db, thenreload that data into the global scope, and all of the applicationsdo the right thing on the fly.

As a footnote, we have been absolutely blown away by the performanceof primebase in this type of situation. The main database isaccepting over 20k images a day, and replicating to 2 slaves, and theslaves are never more than a few seconds behind, even during the mosttrafficed periods of the day.


This works with DirectDBMS actions and regular search actions.

I hope that all makes sense, I am typing this while on a boringsupport call...

Lastly, I did not make any feature requests for v6 because we are inthe process of moving everything to Zend/PHP and have no intention ofmigrating to v6. The logic will be the same.


--

Robert Garcia
President - BigHead Technology
VP Application Development - eventpix.com
13653 West Park Dr
Magalia, Ca 95954
ph: 530.645.4040 x222 fax: 530.645.4040
[EMAIL PROTECTED] - [EMAIL PROTECTED]
http://bighead.net/ - http://eventpix.com/

On Jan 13, 2006, at 10:27 AM, William M Conlon wrote:

ok, I see how you would abstract this and replace DirectDBMS, forexample, with a method call specifying a DSN, query, and pass avariable to receive the resultset. It seems like you also need totell the object whether it's a read or write query so it doesn'thave to parse the request.
Or maybe you only use this for the slaves (reading)? Because partof the complexity is working around the inability of <@BIND> toinsert/update columns larger than 32k. So far I can only get thisto work with Insert/Update actions.
Since you've implemented this, have you suggested any featureenhancements for version 6? It really seems to me that the appshould be insulated from the db servers, so in the ideal case,witango would manage a DSN pool, each with a connection pool.
bill

On Jan 13, 2006, at 10:13 AM, Robert Garcia wrote:
We have this setup using primebase replication. We wrote our owncode in witango, to manage it, it is held in a custom globalscope, and allows us to change the reads on the fly accrossapplications in case a slave dies.
--

Robert Garcia
President - BigHead Technology
VP Application Development - eventpix.com
13653 West Park Dr
Magalia, Ca 95954
ph: 530.645.4040 x222 fax: 530.645.4040
[EMAIL PROTECTED] - [EMAIL PROTECTED]
http://bighead.net/ - http://eventpix.com/

On Jan 13, 2006, at 9:41 AM, William M Conlon wrote:
Yeah, I was thinking would have to set up two DSNs and have mycode explicitly choose the master for writing. But selecting aslave for reading should be handled by something that manages DSNconnections. Maybe it shouldn't be witango, but rather anenhanced DB connector (J/ODBC), so when there are many slavesit's transparent to the application -- kind of like multiplewitango servers.
On Jan 13, 2006, at 9:29 AM, John McGowan wrote:
I was thinking about using my L4 load balancer to handle thistype of stuff, but of course the load balancer can't really tellthe difference between a read and a write. So I was thinkingthat I might have to setup two DSN's if I wanted this type ofsupport. one that would be load balanced, and another that wouldonly go to the master.
Of course that would require me to do some rewriting or searchand replacing of my existing code.
/John



William M Conlon wrote:
I was reading up on mySQL database replication (master andslaves), and was curious whether witango had any facilities tosupport replicated db clusters. I was thinking something inthe dsconfig.ini, maybe that would specify the master (forwrites) and the slaves (for reads).
Then we could just refer to our DSN and let witango figure outwhere to connect.________________________________________________________________________TO UNSUBSCRIBE: Go to http://www.witango.com/developer/maillist.taf
________________________________________________________________________
TO UNSUBSCRIBE: Go to http://www.witango.com/developer/maillist.taf
Bill

William M. Conlon, P.E., Ph.D.
To the Point
345 California Avenue Suite 2
Palo Alto, CA 94306
   vox:  650.327.2175 (direct)
   fax:  650.329.8335
mobile:  650.906.9929
e-mail:  mailto:[EMAIL PROTECTED]
   web:  http://www.tothept.com
________________________________________________________________________
TO UNSUBSCRIBE: Go to http://www.witango.com/developer/maillist.taf
________________________________________________________________________
TO UNSUBSCRIBE: Go to http://www.witango.com/developer/maillist.taf
Bill

William M. Conlon, P.E., Ph.D.
To the Point
345 California Avenue Suite 2
Palo Alto, CA 94306
   vox:  650.327.2175 (direct)
   fax:  650.329.8335
mobile:  650.906.9929
e-mail:  mailto:[EMAIL PROTECTED]
   web:  http://www.tothept.com
________________________________________________________________________
TO UNSUBSCRIBE: Go to http://www.witango.com/developer/maillist.taf


________________________________________________________________________
TO UNSUBSCRIBE: Go to http://www.witango.com/developer/maillist.taf

Re: Witango-Talk: replicated database servers

Reply via email to