Re: [sqlalchemy] Re: ORM: read from view but write into separate table

Stanislav Lobanov Fri, 07 Dec 2018 07:12:10 -0800

That is great, i hope it will help someone as it helped me :)

P.S.: i have played a bit with your POC implementation and it works very 
well. I'm thinking about adding MapperOptions to add low-level control on 
how specific query entity should be selected (for example, i want latest 
version of this entity and all versions of that entity).


пятница, 7 декабря 2018 г., 17:59:58 UTC+3 пользователь Mike Bayer написал:
>
> I'm going to add this to the examples at 
>
> https://docs.sqlalchemy.org/en/latest/orm/examples.html#module-examples.versioned_rows
>  
> since it has this new twist of doing both UPDATE and INSERT, if that's 
> OK with you 
> On Fri, Dec 7, 2018 at 1:20 AM Stanislav Lobanov <[email protected] 
> <javascript:>> wrote: 
> > 
> > Thank you so much, Mike! 
> > 
> > Did not tried it yet, but it looks like you nailed it :) I'll check the 
> sources on holidays. 
> > 
> > With the deepest respect, 
> > Stanislav. 
> > 
> > четверг, 6 декабря 2018 г., 22:41:20 UTC+3 пользователь Mike Bayer 
> написал: 
> >> 
> >> given that it looks like a new version for you means an UPDATE of the 
> >> old row and an INSERT of the new, here is that, which is again 
> >> basically what we see at 
> >> 
> https://docs.sqlalchemy.org/en/latest/_modules/examples/versioned_rows/versioned_rows.html
>  
> >> with some extra steps to emit the UPDATE for the old row.     this 
> >> recipe combines both the versioned event handler and the query 
> >> handler. 
> >> 
> >> from sqlalchemy import ( 
> >>     create_engine, Integer, String, event, ForeignKey, Column, 
> DateTime, 
> >>     inspect, func, select, cast 
> >> ) 
> >> from sqlalchemy.orm import ( 
> >>     make_transient, Session, relationship, attributes, backref, 
> >>     make_transient_to_detached, Query, selectinload 
> >> ) 
> >> from sqlalchemy.ext.declarative import declarative_base 
> >> import datetime 
> >> import time 
> >> 
> >> Base = declarative_base() 
> >> 
> >> # this will be the current time as the test runs 
> >> now = None 
> >> 
> >> 
> >> class VersionedStartEnd(object): 
> >>     def __init__(self, **kw): 
> >>         # reduce some verbosity when we make a new object 
> >>         kw.setdefault("start", now - datetime.timedelta(days=3)) 
> >>         kw.setdefault("end", now + datetime.timedelta(days=3)) 
> >>         super(VersionedStartEnd, self).__init__(**kw) 
> >> 
> >>     def new_version(self, session): 
> >> 
> >>         # our current identity key, which will be used on the "old" 
> >>         # version of us to emit an UPDATE. this is just for assertion 
> purposes 
> >>         old_identity_key = inspect(self).key 
> >> 
> >>         # make sure self.start / self.end are not expired 
> >>         self.id, self.start, self.end 
> >> 
> >>         # turn us into an INSERT 
> >>         make_transient(self) 
> >> 
> >>         # make the "old" version of us, which we will turn into an 
> >>         # UPDATE 
> >>         old_copy_of_us = self.__class__( 
> >>             id=self.id, start=self.start, end=self.end) 
> >> 
> >>         # turn old_copy_of_us into an UPDATE 
> >>         make_transient_to_detached(old_copy_of_us) 
> >> 
> >>         # the "old" object has our old identity key (that we no longer 
> have) 
> >>         assert inspect(old_copy_of_us).key == old_identity_key 
> >> 
> >>         # now put it back in the session 
> >>         session.add(old_copy_of_us) 
> >> 
> >>         # now update the 'end' - SQLAlchemy sees this as a PK switch 
> >>         old_copy_of_us.end = now 
> >> 
> >>         # fun fact!  the new_version() routine is *not* called for 
> >>         # old_copy_of_us!  because we are already in the before_flush() 
> hook! 
> >>         # this surprised even me.   I was thinking we had to guard 
> against 
> >>         # it.  Still might be a good idea to do so. 
> >> 
> >>         self.start = now 
> >>         self.end = now + datetime.timedelta(days=2) 
> >> 
> >> 
> >> @event.listens_for(Session, "before_flush") 
> >> def before_flush(session, flush_context, instances): 
> >>     for instance in session.dirty: 
> >>         if not isinstance(instance, VersionedStartEnd): 
> >>             continue 
> >>         if not session.is_modified(instance, passive=True): 
> >>             continue 
> >> 
> >>         if not attributes.instance_state(instance).has_identity: 
> >>             continue 
> >> 
> >>         # make it transient 
> >>         instance.new_version(session) 
> >>         # re-add 
> >>         session.add(instance) 
> >> 
> >> 
> >> @event.listens_for(Query, "before_compile", retval=True) 
> >> def before_compile(query): 
> >>     """ensure all queries for VersionedStartEnd include criteria """ 
> >> 
> >>     for ent in query.column_descriptions: 
> >>         entity = ent['entity'] 
> >>         if entity is None: 
> >>             continue 
> >>         insp = inspect(ent['entity']) 
> >>         mapper = getattr(insp, 'mapper', None) 
> >>         if mapper and issubclass(mapper.class_, VersionedStartEnd): 
> >>             query = query.enable_assertions(False).filter( 
> >>                 func.now().between(ent['entity'].start, 
> ent['entity'].end) 
> >>             ) 
> >> 
> >>     return query 
> >> 
> >> 
> >> class Parent(VersionedStartEnd, Base): 
> >>     __tablename__ = 'parent' 
> >>     id = Column(Integer, primary_key=True, autoincrement=True) 
> >>     start = Column(DateTime, primary_key=True) 
> >>     end = Column(DateTime, primary_key=True) 
> >>     data = Column(String) 
> >> 
> >>     child_n = Column(Integer) 
> >> 
> >>     child = relationship( 
> >>         "Child", 
> >>         primaryjoin=( 
> >>             "Child.id == foreign(Parent.child_n)" 
> >>         ), 
> >> 
> >>         # note the primaryjoin can also be: 
> >>         # 
> >>         #  "and_(Child.id == foreign(Parent.child_n), " 
> >>         #  "func.now().between(Child.start, Child.end))" 
> >>         # 
> >>         # however the before_compile() above will take care of this for 
> us in 
> >>         # all cases except for joinedload.  You *can* use the above 
> primaryjoin 
> >>         # as well, it just means the criteria will be present twice for 
> most 
> >>         # parent->child load operations 
> >>         # 
> >> 
> >>         uselist=False, 
> >>         backref=backref('parent', uselist=False) 
> >>     ) 
> >> 
> >> 
> >> class Child(VersionedStartEnd, Base): 
> >>     __tablename__ = 'child' 
> >> 
> >>     id = Column(Integer, primary_key=True, autoincrement=True) 
> >>     start = Column(DateTime, primary_key=True) 
> >>     end = Column(DateTime, primary_key=True) 
> >>     data = Column(String) 
> >> 
> >>     def new_version(self, session): 
> >> 
> >>         # expire parent's reference to us 
> >>         session.expire(self.parent, ['child']) 
> >> 
> >>         # create new version 
> >>         VersionedStartEnd.new_version(self, session) 
> >> 
> >>         # re-add ourselves to the parent 
> >>         self.parent.child = self 
> >> 
> >> times = [] 
> >> 
> >> 
> >> def time_passes(s): 
> >>     """keep track of timestamps in terms of the database and allow time 
> to 
> >>     pass between steps.""" 
> >> 
> >>     # close the transaction, if any, since PG time doesn't increment in 
> the 
> >>     # transaction 
> >>     s.commit() 
> >> 
> >>     # get "now" in terms of the DB so we can keep the ranges low and 
> >>     # still have our assertions pass 
> >>     if times: 
> >>         time.sleep(1) 
> >>     times.append(s.scalar(select([cast(func.now(), DateTime)]))) 
> >>     if len(times) > 1: 
> >>         assert times[-1] > times[-2] 
> >>     return times[-1] 
> >> 
> >> e = create_engine("postgresql://scott:tiger@localhost/test", 
> echo='debug') 
> >> Base.metadata.drop_all(e) 
> >> Base.metadata.create_all(e) 
> >> 
> >> s = Session(e) 
> >> 
> >> now = time_passes(s) 
> >> 
> >> c1 = Child(data='child 1') 
> >> p1 = Parent(data='c1', child=c1) 
> >> 
> >> s.add(p1) 
> >> s.commit() 
> >> 
> >> # assert raw DB data 
> >> assert s.query(Parent.__table__).all() == [ 
> >>     (1, 
> >>      times[0] - datetime.timedelta(days=3), 
> >>      times[0] + datetime.timedelta(days=3), 
> >>      'c1', 1) 
> >> ] 
> >> assert s.query(Child.__table__).all() == [ 
> >>     (1, 
> >>      times[0] - datetime.timedelta(days=3), 
> >>      times[0] + datetime.timedelta(days=3), 
> >>      'child 1') 
> >> ] 
> >> 
> >> 
> >> now = time_passes(s) 
> >> 
> >> p1_check = s.query(Parent).first() 
> >> assert p1_check is p1 
> >> assert p1_check.child is c1 
> >> 
> >> p1.child.data = 'elvis presley' 
> >> 
> >> s.commit() 
> >> 
> >> p2_check = s.query(Parent).first() 
> >> assert p2_check is p1_check 
> >> c2_check = p2_check.child 
> >> 
> >> # same object 
> >> assert p2_check.child is c1 
> >> 
> >> # new data 
> >> assert c1.data == 'elvis presley' 
> >> 
> >> # new end time 
> >> assert c1.end == now + datetime.timedelta(days=2) 
> >> 
> >> # assert raw DB data 
> >> assert s.query(Parent.__table__).all() == [ 
> >>     (1, 
> >>      times[0] - datetime.timedelta(days=3), 
> >>      times[0] + datetime.timedelta(days=3), 
> >>      'c1', 1) 
> >> ] 
> >> assert s.query(Child.__table__).order_by(Child.end).all() == [ 
> >>     (1, 
> >>      times[0] - datetime.timedelta(days=3), 
> >>      times[1], 
> >>      'child 1'), 
> >>     (1, 
> >>      times[1], 
> >>      times[1] + datetime.timedelta(days=2), 
> >>      'elvis presley') 
> >> ] 
> >> 
> >> now = time_passes(s) 
> >> 
> >> p1.data = 'c2 elvis presley' 
> >> 
> >> s.commit() 
> >> 
> >> # assert raw DB data.  now there are two parent rows. 
> >> assert s.query(Parent.__table__).order_by(Parent.end).all() == [ 
> >>     (1, 
> >>      times[0] - datetime.timedelta(days=3), 
> >>      times[2], 
> >>      'c1', 1), 
> >>     (1, 
> >>      times[2], 
> >>      times[2] + datetime.timedelta(days=2), 
> >>      'c2 elvis presley', 1) 
> >> ] 
> >> assert s.query(Child.__table__).order_by(Child.end).all() == [ 
> >>     (1, 
> >>      times[0] - datetime.timedelta(days=3), 
> >>      times[1], 
> >>      'child 1'), 
> >>     (1, 
> >>      times[1], 
> >>      times[1] + datetime.timedelta(days=2), 
> >>      'elvis presley') 
> >> ] 
> >> 
> >> # add some more rows to test that these aren't coming back for 
> >> # queries 
> >> s.add(Parent(data='unrelated', child=Child(data='unrelated'))) 
> >> s.commit() 
> >> 
> >> 
> >> # Query only knows about one parent for id=1 
> >> p3_check = s.query(Parent).filter_by(id=1).one() 
> >> 
> >> assert p3_check is p1 
> >> assert p3_check.child is c1 
> >> 
> >> # and one child. 
> >> c3_check = s.query(Child).filter(Child.parent == p3_check).one() 
> >> assert c3_check is c1 
> >> 
> >> # one child one parent.... 
> >> c3_check = s.query(Child).join(Parent.child).filter( 
> >>     Parent.id == p3_check.id).one() 
> >> 
> >> # try selectinload eager loading across multiple parents 
> >> for parent in s.query(Parent).options(selectinload(Parent.child)): 
> >>     if parent.data == 'unrelated': 
> >>         assert parent.child.data == 'unrelated' 
> >>     elif parent.data == 'c2 elvis presley': 
> >>         assert parent.child.data == 'elvis presley' 
> >>     else: 
> >>         assert False 
> >> 
> >> 
> >> 
> >> 
> >> On Thu, Dec 6, 2018 at 1:40 PM Mike Bayer <[email protected]> 
> wrote: 
> >> > 
> >> > On Thu, Dec 6, 2018 at 3:53 AM Stanislav Lobanov <[email protected]> 
> wrote: 
> >> > > 
> >> > > Example business case is: 
> >> > > 
> >> > > Parent and child are added to the system (current date is 
> 2018-01-01) 
> >> > > 
> >> > > Parent 
> >> > > id  |  start         | end         | data    | child_id 
> >> > > 1   |  2018-01-01    | 2018-01-11  | c1      | 1           # just 
> pointer to child with some id (now points to first child record) 
> >> > > 
> >> > > Child 
> >> > > id  |  start         | end         | data       | 
> >> > > 1   |  2018-01-01    | 2018-01-11  | Elvis P.   | # this is current 
> version 
> >> > > 
> >> > > Then on 2018-01-02 children's name is changed from "Elvis P." to 
> "Elvis Presley". That change creates second version of child with ID=1: 
> >> > > 
> >> > > Parent 
> >> > > id  |  start         | end         | data    | child_id 
> >> > > 1   |  2018-01-01    | 2018-01-11  | c1      | 1           # just 
> pointer to child with some id (now logically points to second child 
> version) 
> >> > > 
> >> > > Child  # please notice that id is not changed as this is same child 
> >> > > id  |  start         | end         | data           | 
> >> > > 1   |  2018-01-01    | 2018-01-02  | Elvis P.       | # this is not 
> current (latest) version anymore 
> >> > > 1   |  2018-01-02    | 2018-01-11  | Elvis Presley  | # but this is 
> >> > > 
> >> > > 
> >> > > See? Parent does not care about what changes were made to child, it 
> is just cares to relate to the latest version of child, so there are no 
> composite foreign key to child table (this restriction comes from legacy 
> system and i can not add "child_start" and "child_end" columns to form full 
> composite FK to child table). 
> >> > > 
> >> > > When child is updated (effectively UPDATE is converted to INSERT) 
> then only child table is modified, so there are no cascades to parent, 
> because parent just targets to the row where Child.id == 1. 
> >> > 
> >> > from the data above, this is not strictly the case.  the row with 
> >> > "Elvis P.", the "end" date has been UPDATED from 2018-01-11 to 
> >> > 2018-01-02.  is that correct?  So you need an UPDATE *and* an INSERT. 
> >> >  Am seeing if I can make that happen. 
> >> > 
> >> > 
> >> > > 
> >> > > And this is my problem, because i do not know to to make such 
> "implicit" relationships in sqlalchemy. By implicit i mean a situation when 
> relation is made by child.id AND latest date range for related object. 
> >> > > 
> >> > > Currently we would be using following query to retreive information 
> about parent and it's child (in it's latest state): 
> >> > > 
> >> > > select * from parent, child 
> >> > > where parent.child_id=child.id 
> >> > > and now() between parent.start and parent.end -- gives us latest 
> parent state 
> >> > > and now() between child.start and child.end -- gives us latest 
> child state 
> >> > > 
> >> > > As you can see such a queries is hard to write, it is repetitive 
> and error prone. 
> >> > > 
> >> > > Also, i'm using versioned approach in one of the projects (using 
> your example). Everything related to data consistency, data integrity, data 
> querying must be done by hands, because i did not found a way to provide 
> cascades and correct relationship behaviour without full composite FK. 
> >> > > 
> >> > > Thanks! 
> >> > > 
> >> > > 
> >> > > вторник, 4 декабря 2018 г., 15:36:42 UTC+3 пользователь Stanislav 
> Lobanov написал: 
> >> > >> 
> >> > >> Hello. 
> >> > >> 
> >> > >> I have a table with schema: 
> >> > >> 
> >> > >> name: users 
> >> > >> fields:  id int, name text, start datetime, end datetime 
> >> > >> primary key: id, start, end 
> >> > >> 
> >> > >> This is kind of a historical table, where each row defines 
> separate "historical version" of an object. There are a single business 
> User entity (model) with possibly many historical versions. 
> >> > >> 
> >> > >> Such table structure makes it very hard to define relationships 
> and work with related objects. Also it is hard to work with "current" 
> version of User entity, because to retreive it we need to query it with 
> "now() between start and end" constraint. 
> >> > >> 
> >> > >> So i thought that maybe i can create a view for that table that 
> will hold only current versions and map that view onto User entity to hide 
> all historical complexities and compound PK from sqlalchemy. 
> >> > >> 
> >> > >> The question: is it possible to implement a mapping that will read 
> from view but write into real table? 
> >> > >> 
> >> > >> For example, view can have fields id (pk) and name. 
> >> > >> 
> >> > >> I know that there are great examples of versioning with sqlalchemy 
> but i want to hide non-functional implementation details from my business 
> code/entities with view. 
> >> > >> 
> >> > >> Thanks! 
> >> > > 
> >> > > -- 
> >> > > SQLAlchemy - 
> >> > > The Python SQL Toolkit and Object Relational Mapper 
> >> > > 
> >> > > http://www.sqlalchemy.org/ 
> >> > > 
> >> > > To post example code, please provide an MCVE: Minimal, Complete, 
> and Verifiable Example. See http://stackoverflow.com/help/mcve for a full 
> description. 
> >> > > --- 
> >> > > You received this message because you are subscribed to the Google 
> Groups "sqlalchemy" group. 
> >> > > To unsubscribe from this group and stop receiving emails from it, 
> send an email to [email protected]. 
> >> > > To post to this group, send email to [email protected]. 
> >> > > Visit this group at https://groups.google.com/group/sqlalchemy. 
> >> > > For more options, visit https://groups.google.com/d/optout. 
> > 
> > -- 
> > SQLAlchemy - 
> > The Python SQL Toolkit and Object Relational Mapper 
> > 
> > http://www.sqlalchemy.org/ 
> > 
> > To post example code, please provide an MCVE: Minimal, Complete, and 
> Verifiable Example. See http://stackoverflow.com/help/mcve for a full 
> description. 
> > --- 
> > You received this message because you are subscribed to the Google 
> Groups "sqlalchemy" group. 
> > To unsubscribe from this group and stop receiving emails from it, send 
> an email to [email protected] <javascript:>. 
> > To post to this group, send email to [email protected] 
> <javascript:>. 
> > Visit this group at https://groups.google.com/group/sqlalchemy. 
> > For more options, visit https://groups.google.com/d/optout. 
>

-- 
SQLAlchemy - 
The Python SQL Toolkit and Object Relational Mapper

http://www.sqlalchemy.org/

To post example code, please provide an MCVE: Minimal, Complete, and Verifiable 
Example.  See  http://stackoverflow.com/help/mcve for a full description.
--- 
You received this message because you are subscribed to the Google Groups 
"sqlalchemy" group.
To unsubscribe from this group and stop receiving emails from it, send an email 
to [email protected].
To post to this group, send email to [email protected].
Visit this group at https://groups.google.com/group/sqlalchemy.
For more options, visit https://groups.google.com/d/optout.

Re: [sqlalchemy] Re: ORM: read from view but write into separate table

Reply via email to