[openstack-dev] [oslo][all] The lock files saga (and where we can go from here)

Joshua Harlow Mon, 30 Nov 2015 10:47:57 -0800

Hi all,

I just wanted to bring up an issue, possible solution and get feedbackon it from folks because it seems to be an on-going problem that showsup not when an application is initially deployed but as on-goingoperation and running of that application proceeds (ie after running fora period of time).


The jist of the problem is the following:

A <<pick your favorite openstack project>> has a need to ensure that noapplication on the same machine can manipulate a given resource on thatsame machine, so it uses the lock file pattern (acquire a *local* lockfile for that resource, manipulate that resource, release that lockfile) to do actions on that resource in a safe manner (note this doesnot ensure safety outside of that machine, lock files are *not*distributed locks).


The api that we expose from oslo is typically accessed via the following:

oslo_concurrency.lockutils.synchronized(name, lock_file_prefix=None,external=False, lock_path=None, semaphores=None, delay=0.01)

or via its underlying library (that I extracted from oslo.concurrencyand have improved to add more usefulness) @http://fasteners.readthedocs.org/

The issue though for <<your favorite openstack project>> is that each ofthese projects now typically has a large amount of lock files that existor have existed and no easy way to determine when those lock files canbe deleted (afaik no? periodic task exists in said projects to clean uplock files, or to delete them when they are no longer in use...) so whathappens is bugs like https://bugs.launchpad.net/cinder/+bug/1432387appear and there is no a simple solution to clean lock files up (sinceoslo.concurrency is really not the right layer to know when a lock canor can not be deleted, only the application knows that...)


So then we get a few creative solutions like the following:

- https://review.openstack.org/#/c/241663/
- https://review.openstack.org/#/c/239678/
- (and others?)

So I wanted to ask the question, how are people involved in <<yourfavorite openstack project>> cleaning up these files (are they at all?)


Another idea that I have been proposing also is to use offset locks.

This would allow for not creating X lock files, but create a *single*lock file per project and use offsets into it as the way to lock. Forexample nova could/would create a 1MB (or larger/smaller) *empty* filefor locks, that would allow for 1,048,576 locks to be used at the sametime, which honestly should be way more than enough, and then therewould not need to be any lock cleanup at all... Is there any reason thiswasn't initially done back way when this lock file code was created?(https://github.com/harlowja/fasteners/pull/10 adds this functionalityto the underlying library if people want to look it over)


In general would like to hear peoples thoughts/ideas/complaints/other,

-Josh

__________________________________________________________________________
OpenStack Development Mailing List (not for usage questions)
Unsubscribe: [email protected]?subject:unsubscribe
http://lists.openstack.org/cgi-bin/mailman/listinfo/openstack-dev

[openstack-dev] [oslo][all] The lock files saga (and where we can go from here)

Reply via email to