Re: Derby Transaction Log Shipping

Duncan Groenewald Wed, 13 Feb 2008 16:01:44 -0800

Thanks for that explanation - it pretty much confirms what I expected- which was that booting the secondary means that things get changedand that logs can no longer be loaded.


Duncan


On 08/02/2008, at 11:33 PM, Jørgen Løland wrote:

Hi Duncan,
First of all, the scenario you describe seems (to me) to be solvedby the new replication functionality. However, I think it can bedone the hard way with a plan similar to what you describe. Heregoes :)
Log files can be found in <database_dir>/log. When you enable logarchive mode, the log files will not be deleted. Hence, you do notneed to perform backup on day 2 and 3 - you may simply copy the logfiles from the <database_dir>/log directory.
So, ideally, the steps would be like this:
Day 1: make a backup, copy it to the secondary location. Boot thesecondary db and check that it is all ok
Day 2: copy the log files generated since the backup was made
Day 3: copy the log files generated since the backup was made
Day 4: boot secondary db, which now is in the same state as theprimary was in when the log was copied on day 3.
With a few modifications, this should work just fine:
Problem, day 1: Assuming that users are allowed access to theprimary database when you make the first backup (as indicated byyour scenario), the data pages and log files will containinformation from uncommitted transactions. When you boot thesecondary to check that everything is ok, Derby will go through thesame steps as when doing crash recovery. That means going through aredo phase (redoing operations in the log that are not reflected inthe data pages) and an undo phase (basically abort transactionsthat were active at the time the backup made). The undo phase iskey here because Derby do operations on the data pages of thesecondary that were not done on the primary. This is fine if youwant to use the secondary, but not if you want to keep sending itlog files.
Solution: Don't allow any active transactions when you make theinitial backup or (probably better in your scenario) don't boot thesecondary database to check if it is ok. Wait until the primary hasfailed before booting it.
Problem, day 2 and 3: The log file with highest number copied onday 1 (say logN.dat) may have been modified since you copied it.
Solution: Overwrite the secondary log file logN.dat with logN.datfrom the primary database.
I think that should do it, but if you do not require this NOW, Iwould rather wait for replication in 10.4.
Good luck,
Jørgen


Duncan Groenewald wrote:
I still don't know if I really understand the Derby model as itseems the transaction logs are archived when a database backup isrun. So here is a scenario:Day 1: Backup Primary Derby (enabling logging), copy backupdatabase to secondary server and boot secondary server to check itis all OK.Day 2: Backup Primary Derby DB and copy archived log files tosecondary server.Day 3: Backup Primary Derby DB and copy new archived log files tosecondary server.Day 4: Boot secondary Derby DB to check its OK... In theory thenthe boot process will replay all the log files and the databaseshould be in the same state as the Primary was on Day 3 ?Somehow I don't think this would actually work - but I will giveit a try...
Here is the scenario I am try to cater for:
24x7 realtime system needs to be relocated to another site (orneeds to have a warm standby system that can be enabled in 15minutes or less).Basic approach is to have two databases running and logs from theprimary are loaded on the secondary within a couple of minutes ofthem being written.Transaction dumps on primary database are written to timestampedfiles and file is renamed TRXDUMP20080206091545212_DONE.DAT oncedump write process has completed. A script checks for presence of*_DONE.DAT files every 30 seconds and copies file to remoteservers file system (or this gets done by the dump process aswell). Script on the remote server checks for presence of*_DONE.DAT files every 30 seconds and runs a Transaction Loadprocess on remote database to load the dump files. At any givenpoint in time the remote site is always within a few minutes ofthe primary site.It seems unlikely one could do this with Derby because there areno commands to periodically dump the transaction logs or to loadthe transaction logs.
Cheers
On 08/02/2008, at 7:05 PM, Knut Anders Hatlen wrote:
Duncan Groenewald <[EMAIL PROTECTED]> writes:
Thanks - the specification looks like its close to what I wouldlike.
The model I work from is one used by Sybase (and possibly  others)
where you can specify a database dump and a separatetransaction logdump at defined intervals using a script or some otherprogrammaticmethod. From what I can tell its not possible to do this withDerby,
since you can only dump the database and not the  logs.  Its also
unclear how you would load a log file on its own.

What I would like to see is two additional commands added to dump
transaction logs to specified directory or file name and another
command to load a transaction log file from a specified location/
file.  Ideally a transaction log file load should function much the
same way a normal user does to allow concurrent user access while
loading a transaction log file.
Not exactly what you want (it won't allow concurrent user accesswhileloading the transaction log), but you may achieve somethingsimilar with
log archiving and roll-forward recovery, combined with some creative
scripts. I haven't tried it myself, but you may get some ideas here:
http://db.apache.org/derby/docs/dev/adminguide/cadminrollforward.html
--
Knut Anders
Duncan Groenewald
mobile: +61406291205
email: [EMAIL PROTECTED]
--
Jørgen Løland


Duncan Groenewald
mobile: +61406291205
email: [EMAIL PROTECTED]

Re: Derby Transaction Log Shipping

Reply via email to