Re: [openstack-dev] [Keystone] Best way to do something MySQL-specific?

Adam Young Mon, 08 Jul 2013 19:07:41 -0700

Tokens are, for the most part, immutable. Once they are written, theydon't change except if they get revoked. This is a fairly rareoccurance, but it does happen.

Deleting tokens based on age should be fairly straight forward, andlocks should not need to be held for a significant amount of time.


My guess, however, is that the problem is SQL Alchemy:

        query = session.query(TokenModel)
        query = query.filter(TokenModel.expires < timeutils.utcnow())
        query.delete(synchronize_session=False)

If it is doing a fetch and then the delete, then the rows would be heldfor a short period of time.


Direct SQL might be a better approach:  prepare a statement:

"delete from token where expires < $1"

and then bind and execute in one command.

However, it seems to me that the conflict detection is the problem. Idon't know if there is a way to state "ignore any future queries thatwould match this criteria." It does seem to me that even doing thisdegree of conflict detection is somewhat violating the principal ofIsolation.

There might be an approach using table partitioning as well, where youonly write to partition one, and delete from partition 2, and then swap.






On 07/08/2013 09:13 PM, Robert Collins wrote:

On 9 July 2013 12:32, Adam Young <[email protected]<mailto:[email protected]>> wrote:
        * I am asking about MySQL.. presumably a "real" database.

     I have to admit I am a bit of a Postgresql Bigot. I don't really
    consider MySQL a real database, althought it has improved a lot
    over the years.  I am not up to speed on"InnoDB's gap locking
    behavior" but it is not something I would expect to be a problem
    in Postgresql.
PostgreSQL has similar but different characteristics, particular thelatest iteration of isolation behaviour where locks are held on *theresult of a query*, not on 'specific rows returned' - the differencebeing that adding a new row that matches the query for rows to delete,would encounter a conflict. You also need to delete small numbers ofrows at a time, though the reason in the plumbing is different. Thereare some nasty interlocks you can cause with very large deletes andautovacuum too - if you trigger deadlock detection it still takes/minutes/ to detect and cleanup, whereas we want sub-second liveness.
    once every second would be strange indeed.  I would think maybe
    once every five minutes or so.  Schedule your clean up IAW your
    deployment and usage.
5m intervals exacerbate the issue until it's solved. If the cleanupdeletes no more than (say) 1000 rows per iteration, it could run every5 minutes but when run keep going until the db is cleaned.
    Deleting a chunk of tokens in bulk would be preferable to doing
    client side iteration, I can;t see how that would not be the case.
right, so I think Clint prefers that too, the question is how to getsqlalchemy to output the appropriate sql for postgresql and mysql,which is different.
-Rob

--
Robert Collins <[email protected] <mailto:[email protected]>>
Distinguished Technologist
HP Cloud Services


_______________________________________________
OpenStack-dev mailing list
[email protected]
http://lists.openstack.org/cgi-bin/mailman/listinfo/openstack-dev

_______________________________________________
OpenStack-dev mailing list
[email protected]
http://lists.openstack.org/cgi-bin/mailman/listinfo/openstack-dev

Re: [openstack-dev] [Keystone] Best way to do something MySQL-specific?

Reply via email to