Re: [Drizzle-discuss] Improving the Engine API

Paul McCullagh Wed, 09 Dec 2009 02:04:26 -0800

Hi Jay,

On Dec 8, 2009, at 5:23 PM, Jay Pipes wrote:

Whatcha say? :)
Absolutely, it will work. And maybe it is the best solution.
I would have suggested that Drizzle assumes that DDL works withtransactions, and that the engine handles the situation by doingone of the following:
- Execute the DDL normally, because it supports DDL in transactions,
- Return an error because it does not handle DLL in a transaction, or
- Silently commit the transaction, and begin it again at the end ofthe DML statement, without Drizzle even knowing about it.But, I guess the problem is replication. If Drizzle is not aware ofa transaction end, then it would be replicated as such, and we mayend up with a different result on the slave.The problem of DDL in a transaction only occurs when auto-commit isdisabled, or an explicit BEGIN is used, so lets look at a quickexample:
BEGIN;
INSERT t1 VALUES (1, 1);
INSERT t1 VALUES (2, 2);
ALTER TABLE t1 ADD COLUMN c3 INT;
INSERT t1 VALUES (3, 3, 3);
INSERT t1 VALUES (4, 4, 4);
COMMIT;
A silent COMMIT on DML will lead to the following:
BEGIN;
INSERT t1 VALUES (1, 1);
INSERT t1 VALUES (2, 2);
COMMIT;
ALTER TABLE t1 ADD COLUMN INT c3;
BEGIN;
INSERT t1 VALUES (3, 3, 3);
INSERT t1 VALUES (4, 4, 4);
COMMIT;
While I am no fan of a silent COMMIT, it may be the best solution,because at least this sequence of statements will be compatiblewith engines that support DDL in transactions and those that don't(much like MyISAM happily ignores BEGIN TRANSACTION).The alternative would be to return an error. This would prevent thesurprise affect that I get when the server crashes and I discovermy transaction was not atomic after all.
Well, I'm not a huge fan of implicit anything, as you know, but inthis case, since engines do have a certain leeway in how they advisethe kernel that they will handle a statement, I'm OK with continuingthe existing MySQL behaviour of implicitly committing transactionsbefore DDL statements are executed -- but in Drizzle's case, only ifthe engine advises it is unable to include the DDL in the currenttransaction.


Yup, I can live with that too...

I guess this is a question for the DBA's on the list ... inputplease! :)
++


Last chance to object... :)

Also, one other thing we need to discuss is the following, which youalluded to in an earlier email:
Suppose PBXT can handle ADD INDEX in an optimized fashion, but PBXTdoes not implement the remainder of the ALTER TABLE statement andprefers Drizzle's kernel to handle the other operations. In thisparticular case, we need a way of allowing the engine to communicatethat it would like to handle *some part* of a statement internally,and let the kernel handle other parts. This is an interestingproblem, and I can see at least three possible solutions. Let meknow what you think of either of these:
1) Establish two more flags for the StatementExecutionIntent:

INTENT_INTERNAL_AFTER_KERNEL
INTENT_KERNEL_AFTER_INTERNAL
In the first flag, the engine is telling the kernel that it wishesto execute some part of the Statement *after* the kernel hasfinished executing the statement. In the second flag, the engine istelling the kernel it wants first crack at the statement.
I can see this solution being of medium-difficulty to implement, aslots of edge cases would have to be tested...
2) Don't add new flags to StatementExecutionIntent, but instead havethe engine "do its thing" (e.g. optimizally implement the ADD INDEXpart of an ALTER TABLE) in the call to StorageEngine::endStatement().
3) Create a new plugin type: plugin::PostKernelStatementExecute:

namespace drizzled {
namespace plugin {

/**
* Modules implement a subclass of this class and register
* an instance of the class as a "listener" for when the
* kernel has completed execution of a Statement.
*
* For example, a storage engine might implement a subclass
* called OptimizedAddIndex which would listen for the kernel's
* completed execution of an ALTER TABLE statement and execute
* an optimal ADD INDEX clause for the ALTER TABLE statement.
*/
class PostKernelStatementExecute
{
public:
 bool operator()
   (Session &session, const message::Statement &statement);
};
}
}
Have the engine register a PostKernelStatementExecute trigger/hook.This PostKernelStatementExecute would be a subclass of the aboveplugin interface class and would allow the engine to react tocertain types of statements that it asks the kernel to execute"normally" but wants to add some optimized path for...
Thoughts?



I like (2) above, because it is simple. I see 2 variations:

A) As per (2) above, let the engine do the extra stuff instartStatement() and/or endStatement().


B) Add the following call sequence:

startStatement();
preStatement();
.... standard execution of the statement
postStatement();
endStatement();

Unless there is a reason why (A) or (B) will not work, then I think itis better than adding additional flags or method registration to theAPI.

Everyone knows how to override a method, so the documentation forpreStatement(), for example, is simple:

This method is called after startStatement() and before the standardexecution of the statement. Override it if you need to do something atthis point.



--
Paul McCullagh
PrimeBase Technologies
www.primebase.org
www.blobstreaming.org
pbxt.blogspot.com




_______________________________________________
Mailing list: https://launchpad.net/~drizzle-discuss
Post to     : [email protected]
Unsubscribe : https://launchpad.net/~drizzle-discuss
More help   : https://help.launchpad.net/ListHelp

Re: [Drizzle-discuss] Improving the Engine API

Reply via email to