RE: Updating record.original on record.store

Durchholz, Joachim Thu, 21 Feb 2013 04:40:21 -0800

I'm having similar thoughts as Ryan.

Let me ramble a bit; I'm not really sure how well this fits with the direction 
Jooq is heading. I hope it helps "stepping back from the drawing board" and 
taking a look from a broader perspective - then maybe it doesn't.
There's also the caveat that I have been unable to follow all the details of 
the discussion, so I may be missing important details or beating a dead horse. 
My apologies for that.
That said, I hope it's more useful than distracting. Feel free to ignore.


Tailoring the store operation to a specific use case is making me very nervous; 
I've been finding time and again that my transaction model doesn't quite fit 
the model the library designer had in mind.
I suspect that this is partly due to the fact that as soon as you have a cache 
for the purpose of avoiding unnecessary UPDATEs, you also get optimistic 
locking for free, so you model the entire transaction model around optimistic 
locking. Which has the unfortunate side effect that all transaction models that 
do not use optimistic locking will have less support.

So... I'm thinking about the transactional models I have been seeing and using.

One model is the classic pessimistic locking (PL) model.
It's the one that databases support out of the box; other transactional models 
are built on top of it.
It's unsuitable for online processing if a transaction can run longer than a 
handful of seconds.
It's also surprisingly unsuitable for anything that runs longer than about four 
hours, because it (a) accumulated a gigantic heap of rollback log and (b) if 
you need to rollback, you usually end up with having a failed batch run and not 
enough time to retry.
The gist of all this is that while PL is the best, people have always been 
splitting PL transactions into smaller, shorter transactions, almost since 
RDBMSes exist.

There's the idempotent transaction model. You run mostly nontransactional, but 
you make sure that each processing step can be repeated without ill effect, and 
that failure at any point won't damage the data. Essentially, you read the data 
but don't worry about whether there are any concurrent updates, and only the 
insert/update/delete operations need to run in a PL transaction (and sometimes 
even that isn't strictly required, the RDBMS's internal data structures would 
probably need transactional protection but writing inconsistent data can easily 
be corrected by rerunning the transaction).
This works well for all processing steps that transform some constant data into 
some other, non-overlapping data.
You need tight control over caching for this model: Some data you want to keep 
cached right at the beginning of the batch run until it is finished 
(configuration data that doesn't change except under administrator 
supervision). Some data you do not want to even see a cache: the bulk rows that 
are streamed through the update process. Some data you want to cache for a 
while and then throw away: information about a group of data (say, all data 
relating to a specific building while you're iterating over the data on the 
wares stored there and collect some statistics).

Another model is the read-process-write with optimistic locking transaction. 
You read everything in one PL transaction, process the data, and write in a 
different PL transaction; you apply some clever tricks to fail when this would 
end overwriting data that has been modified by others.
This is a very hairy approach: Sometimes, you want to fail when just the fields 
being overwritten have been modified elsewhere (e.g. customer name); sometimes, 
you want to fail if some other fields in the same record were modified (don't 
overwrite just the house number if somebody else modified the street name); 
sometimes, you want to fail if a record in a different table was updated (don't 
update the new price sum in the order record if some other order detail was 
modified - you might have to apply a different discount scheme now).
Ultimately, it's the call of the application developer what kinds of changes 
might have to be checked before an update can go through. Just calling UPDATE 
with expected old values in the WHERE condition, then counting how many records 
were actually modified, is an excellent trick, but it's not covering all needs, 
just the most basic one.

Then there's read-process-read-process-write model with optimistic locking. 
It's not imposing any additional requirements on the database library I think, 
it's just that the application programmer needs to be aware that the data he's 
working with might be outdated.

Then there's the cache-indefinitely case. You read some data such as the list 
of company locations, which is never going to be modified except under 
administrator control who'll manually flush any cache and tell J2SE application 
users to restart their clients so they get a fresh copy of the cache. 
Automation is possible but runs through extra channels that the DB lib isn't 
aware of.

Then there's the permanent-sync case. The application side gets notified of 
changes in any records that it has available locally. This isn't really 
different than optimistic locking, except that the optimistic locking exception 
might fly earlier. In the read-process-read-process-write case, it might avoid 
some exceptions. The downside is that it's entirely nonstandardized and very 
hard to implement. On the plus side, the entire data movement between 
application and database is automated and can be optimized; essentially, you're 
running (or faking, if you will) a database cluster where the application is 
keeping some records in the form of a Java object.
It's the thing every developer wants and nobody gets :-)

What I'm after is that I wouldn't want Jooq to support a specific transactional 
model preferentially. This runs the risk of less support for the other models.
It's my main gripe about Hibernate, which works really well for 
read-process-write-with-optimistic-lock but sucks at every other model. I'm 
really worried that Jooq is about to enter the same kind of slippery slope, 
though maybe with a different transactional model.

Just my 0.02 cents.

Regards,
Jo

-- 
You received this message because you are subscribed to the Google Groups "jOOQ 
User Group" group.
To unsubscribe from this group and stop receiving emails from it, send an email 
to [email protected].
For more options, visit https://groups.google.com/groups/opt_out.

RE: Updating record.original on record.store

Reply via email to