Re: [Sqlalchemy-users] Many to many problem?

Michael Bayer Sun, 04 Dec 2005 17:22:02 -0800

get ready for a long one...(this has a lot of unedited thoughtprocess in it)

Just to start off, the example in my previous email was just thestate of what it is *now*. so at the moment, the issue is not dealtwith very well. Theres probably going to be a lot of issues where Imgoing to come out with the ugly unfinished truth of it, but thatdoesnt mean thats how its going to stay..its just until we figure outhow it should really work. The criterion for a first release is, wethink we've nailed most of these issues and the API has stabilized,and all the way at the bottom of this email is a potential API changeto deal with this, so thats good.

when you have an object A that has a collection of B, and you changethe contents of the collection on A, then you do a commit, thedatabase is updated to reflect the changes of the collection on A.In that case, everything is fine, A is already up to date before thecommit even happened, since you modified its contents programmatically.

the problem arises when you have B sitting around which also containsa relationship, in the other direction, to one or more objects oftype A. Even though B was attached to A, B wasnt modified, but nowB's reference to A, which may be a list (many-to-many) or just ascalar (one-to-many from A's direction), is potentially in an invalidstate. At the moment, SQLAlchemy didnt even put "B" into thetransaction, since nothing was changed on it, so nothing happens to it.

I had figured that this issue would work itself out mostly in thatapplications would put object modifications within anobjectstore.begin()/objectstore.commit() pair, placed at the end of asession's lifecycle, which would be reset the next time a usersession started. This is how the ZBlog application gets around thewhole issue.

ZBlog also gets around the issue in another way; while it has two-way relationships, it only modifies those relationships in onedirection. So while a Blog has its owning User attached to it beforesaving, it never takes a User and appends a Blog to its 'blogs'attribute. The 'blogs' attribute is set up with a flag "live=True",which means that its a lazyload attribute that *always* loads,everytime you access it. So its not useful to write to it. The"live=True" idea represents my first attempt at managing this issue.

But now, both of you want to have two way relationships, *and* youwant to set them in both directions, *and* you want the objects to bein a completely valid state after commit. I tried to think of everypossible contingency before putting this thing out publically, butthere you go, only a week later and ive already been hit with like, adozen.

My first instinct on this, is for "B"s reference to A, or collectionof "A"'s, to be cleared out and reset with a lazy loader, so that itrefreshes from the database the next time it is loaded. Which Ididnt rush into doing, since it means I would take some in-memorylists, assume they are up-to-date with regards to the database, clearthem out and reset their loading. Also I would need to figure outsome way to detect that a B being added to A means that theres an Athat needs to be attached to B, which is not terribly straightforward(until the end of this email...).

Consider if class A has an attribute "listofB" and class B has anattribute "listofA", if you do something like this:


Aobj = A.mapper.select(...)
Bobj = B.mapper.select(...)

Bobj.listofA.append(new A())

objectstore.begin()
Aobj.listofB.append(Bobj)
objectstore.commit()

the way thats supposed to work is A's list of B gets saved to thedatabase, but B's list of A, which has a totally different A insideof it and was not in the scope of the transaction, should stillremain "pending".

So what should the application do with Bobj's listofA ? Heres awhole lot of options, and this is sort of just me thinking here:

1. Should it clear it out and reset it to re-lazy load ? That wouldremove the pending change of the new A() sitting in the list.2. Should it do #1, but if the list has pending changes raise anexception ? hm, maybe.3. Should it just append Aobj into the list, and not affect anycurrent changes to the list ? That might work but then, is theordering of the list correct ? Maybe thats the way to do it.4. Sort of like #2, should there be a flag on the attribute thatsays, "yes its OK to clear this out and re-load if the data changesin a commit" ? it seems like an application would *always* want thisto happen though.5. how about, it will re-lazy-load Bobj's listofA, but *not* clearout the existing contents, it will add the database results in,skipping those that were deleted from the list and maintaining thosethat were added. this is actually not too different from #3.6. should we use magic ! when you append Bobj to Aobj.listofB, it*automatically* sets the A on Bobj, either blowing away a previousscalar value, or appending to Bobj's listofA. The wholerelationship would be maintained without even touching the database.This is actually like #3 but has a more explicit contract, in that Bgets yanked into the transaction upon commit as well.

So, while this whole thing seems obvious, I havent decided how todeal with it yet.

#6 is very interesting. it basically means theres a "backreference"handler function associated with a relationship, which knows how topopulate the "backreference" upon a change. There is an example"examples/adjacencytree/byroot_tree.py" which does something similarto this, since it is representing a hierarchy of TreeNodes which allcontain a backreference to the root node. This example suggests thatthe "backreference" handler would have to be customizable.

I am going to ponder the "auto-backreference" attribute idea a littlefurther.


It might look something like:

Course.mapper = mapper(Course, courseTbl)
Student.mapper = mapper(Student, studentTbl, properties={

'courses': relation(Course.mapper, enrolTbl, lazy=False,backreference='students')

})

hey wow, guess what...the backreference right there can automaticallyadd the property 'students' to the Course.mapper as well. theproperty would be a lazy load and also have a backreference to'courses'.

backreference can also be a function, like this one, which receivesevents for "parent.children.append(child)" and "child.parent=parent"


def childappended(parent, child):
        child.root = parent.root
        child.parent = parent
def parentset(parent, child):
        parent.children.append(child)
        child.root = parent.root

TreeNode.mapper=mapper(TreeNode, treetable, properties=dict(
    id=treetable.c.node_id,

parent=relation(TreeNode,primaryjoin=treetable.c.parent_node_id==treetable.c.node_id,foreignkey=treetable.c.node_id, uselist=True, backreference=parentset),children=relation(TreeNode,primaryjoin=treetable.c.parent_node_id==treetable.c.node_id,uselist=True, private=True, backreference=childappended),

))

hey wow, this *might* work and not even require much code atall....basically, if it works in the ZBlog app, it'll probably workanywhere.


On Dec 4, 2005, at 6:13 PM, Robert Leftwich wrote:

Michael Bayer wrote:
nah, its working by design, but you might not like it. try theselines instead:
I'm not sure I understand the relationship between this and theoriginal Student/Courses issue/question. Are you saying that wehave to *manually* manage *all* m:n mapped relationships in theobjectstore by using delete+reload when the relationship is changedin any way?
Robert



-------------------------------------------------------
This SF.net email is sponsored by: Splunk Inc. Do you grep throughlog files
for problems?  Stop!  Download the new AJAX search engine that makes
searching your log files as easy as surfing the web. DOWNLOADSPLUNK!
http://ads.osdn.com/?ad_id=7637&alloc_id=16865&op=click
_______________________________________________
Sqlalchemy-users mailing list
[email protected]
https://lists.sourceforge.net/lists/listinfo/sqlalchemy-users




-------------------------------------------------------
This SF.net email is sponsored by: Splunk Inc. Do you grep through log files
for problems?  Stop!  Download the new AJAX search engine that makes
searching your log files as easy as surfing the  web.  DOWNLOAD SPLUNK!
http://ads.osdn.com/?ad_id=7637&alloc_id=16865&op=click
_______________________________________________
Sqlalchemy-users mailing list
[email protected]
https://lists.sourceforge.net/lists/listinfo/sqlalchemy-users

Re: [Sqlalchemy-users] Many to many problem?

Reply via email to