Re: [AOLSERVER] Data "corruption" with fastpath caching

Jim Davidson Thu, 21 Aug 2008 15:52:52 -0700

Hi,

Yes -- the original reason for dev/inode on Unix instead of filenamewas to reduce memory consumed in the case of a large # of symlinks orhardlinks to the same file. This was the case for AOL'sdigitalcity.com back in 1999. For better or worse, the "AOL" inAOLserver means AOL was used as the model and we never had an exampleof dynamically creating files to be returned by fastpath (this isn't100% true -- see below).

Now, had we known 10 years ago that inodes could be rapidly reused asJohn pointed out, we would have never written the code to use dev/inode as opposed to filename keys. It simply cannot be strictlycorrect given the non-atomic nature of the stat() before the open().It may have been available as an option, but certainly off by defaultand likely undocumented.


So, here's what I'd suggest:

-- Cache by filename key should be the default. This is technicallythe correct fix to enable temporary, uniquely named files, to bereturned via ns_returnfile.-- John's "grace period" code is a clever optimization if fastpath isbeing used in this way and could also be an option, default off.

To clarify one point: There is no technical solution to creating tempfiles with the same name and avoiding the race condition withoutadditional synchronization. This is expecting a "private per-threadview" of the global filesystem namespace which isn't strictly possibleeven if some sort of heuristics (such as the grace period trick above)nearly eliminate the possibility in practice. The Unix file systemhas always had atomic features which have traditionally been used tosolve for race conditions using the uniqueness of filenames (viaO_EXCL open/creat flag and atomic rename). It's reasonable to assumeprogrammers understand and leverage that fact; it's not reasonable toattempt to solve for cases they don't. You can see some carefultempfile creation code in Ns_GetTemp in fd.c if interested.

Finally, for those still awake and sober: The "not 100% true" commentabove was that there was one solution at AOL where ADP code would on-demand create new ADP code. The idea was to do some "hard" work(query database, fetch from remote, etc.) while leaving other ADP codeto do "easy" stuff on the fly (select a promo, etc.). AOLserver nowhas direct support for this with it's new caching stuff in 4.5 butback in '99 that didn't exists. So, technically this was a case wherewe dynamically created code which was later read by ADP (which had thesame dev/inode cache stuff as fastpath). However, this was donecarefully:

-- Tcl-level mutex/condition variables to ensure only one thread didthe "hard" work even if several were interested in the result

-- Careful write to a non .adp extension, unique temp file
-- Atomic rename in place when ready

It was a combination of traditional atomic Unix filesystem semanticsand newer thread synchronization at the Tcl level used to avoid evergetting some mutant result.

BTW: If you're still interested and awake, check the ADP cache/parsecode -- it has some code to detect modification in place during parseeven though at AOL we would have never allowed that possibility (we'duse the atomic rename technique describe above). That's one case werecode was carefully written to mitigate poor programming practice weourselves would never have allowed (a programmer who had written suchcode would have ended up in my office for an uncomfortable chat). Iseem to remember trying for a very long time to exercise the failcases as things rarely would trip up even under high load but we madesure the code was in there anyway because it was closer to correct andblindly writing a file in place is sadly quite common.


-Jim






On Aug 21, 2008, at 3:24 PM, Rusty Brooks wrote:

I don't have any opinion on the fix, but I think the actualobjection to using the filename in the fix is that this would causehard links to files, which are for all intents and purposes The SameFile, to be considered different files by fastpath. (Hard linkshave different names, but the same inode)
Rusty

Titi Alailima wrote:
I agree that John's patch is worth doing. It satisfies both hisrequirements and the stated design goals of fastpath.The remaining issue is whether something called "ns_returnfile"which takes a pathname as a parameter should have some guaranteethat you will return what at least at some point was the contentsof a file with that pathname. It's perfectly acceptable in dealingwith caching systems that the cached value could be out of sync,but not that the cached value could be for something entirelydifferent from what you were looking for. Even with the mtime fixthere's no guarantee that systems which muck around with mtime(such as tar) won't cause separate files to collide. For acontrived example:1. tar xf foo.tar (creating two files "a" and "b" with the samesize and same mtime)
2. ns_returnfile b
3. Delete files "a" and "b"
4. tar xf foo.tar
5. ns_returnfile b (this could return the contents of "a" becausethe inode was reused)I don't think this example violates any of the stated principles ofusing ns_returnfile for only "static" data. Both "a" and "b" couldhave completely stable contents and due to some minor issue ofsystem administration (for example) their inodes could end upswapped and the cache poisoned.So I think we need both fixes, one to eliminate caching unless acertain criterion of "static-ness" has been met, and the other toprevent the cache from returning completely unrelated data. Othercaveats about ns_returnfile use still apply, and the documentationshould reflect them.Now the only people this wouldn't satisfy are those who areconcerned about pathnames taking up space in the cache or slowingit down. The option has been suggested to make pathname inclusionoptional, though I would advise against it unless the configurationoption is named in such a way as to indicate its "unsafe"-ness.
Titi Ala'ilima
Lead Architect
MedTouch LLC
1100 Massachusetts Avenue
Cambridge, MA 02138
617.621.8670 x309
-----Original Message-----
From: AOLserver Discussion [mailto:[EMAIL PROTECTED] On
Behalf Of Tom Jackson
Sent: Thursday, August 21, 2008 12:25 PM
To: AOLSERVER@LISTSERV.AOL.COM
Subject: Re: [AOLSERVER] Data "corruption" with fastpath caching

On Thu, 2008-08-21 at 11:14 -0400, Dossy Shiobara wrote:
4) I see the simplest (best?) solution here being a configurable
parameter that controls fastpath's cache key generation.  As Jim
points
out, one can quickly test whether this would solve the problem at
hand
by temporarily #define'ing _WIN32 in the appropriate place. Ifthis
proves successful, we change it from using #ifdef's to regular if()
statements and define a new configuration parameter.  End of
discussion.
I have responded twice to John's newest patch idea, which is a oneline
patch. It appears to completely eliminate any problem with cache
poisoning. It is simple, it doesn't change the semantics of thecommandor anything else. It simply works around a known limitation of thestat
mtime granularity.

The only security issue that was exposed was the misuse of
ns_returnfile. All of the data put into cache were entirely underthe
control of the AOLserver process. The developer / maintainer of that
process is responsible for everything the process does.ns_returnfile
is
an inherently dangerous API, there is no handholding involved. Youhave
to understand what it is doing and why it exists.
In fact, John even pointed out that the original code which wroteoutthe contents of the file reused the same name over and over.Assuming
that you can know that the contents of a file have not changed just
because it has the same name, same mtime and same size is an invalid
assumption, it will always be invalid. All caches have the same
limitation. By definition they are not in sync with the true copy.
Anyone who uses a cache needs to understand this.
So, this is important, John is not interested in the cache, heactuallywants to avoid the cache. So talking about how stuff is stored inthe
cache, and under what key, is unimportant for John. He wants to keep
his
newly created file from ever getting into the cache.

And this is where he has a point, a very good one. Why put newly
created
files into a cache, if the point of the cache is to handle static
files?
We can wait for evidence that it is static. In this case, we canwaituntil it is a few seconds old, at least. John's patch does exactlythis
and nothing more. It is actually a very ingenious change.
There is no difference between the inode and the filename underunix.
Both offer equal opportunity to screw up due to a race condition. It
can
still happen even in the patched ns_returnfile. Jim mentioned this.
After a file is stat'ed, the open might find a different (maybe
truncated) file. There is no guarantee that you won't get something
else, especially if you have multiple processes/threads creatingfiles
in an non-synchronized way. It is not part of ns_returnfile to
guarantee
that the contents/age of a file remains unchanged during thecourse of
execution, and when you throw in an external process it is nearly
impossible to come up with any code which can provide thatguarantee.
If
data integrity is really important to you, don't try to provide it
using
named files as temporary storage.

tom jackson


--
AOLserver - http://www.aolserver.com/

To Remove yourself from this list, simply send an email to
<[EMAIL PROTECTED]> with the
body of "SIGNOFF AOLSERVER" in the email message. You can leave the
Subject: field of your email blank.
--
AOLserver - http://www.aolserver.com/
To Remove yourself from this list, simply send an email to <[EMAIL PROTECTED]> with thebody of "SIGNOFF AOLSERVER" in the email message. You can leave theSubject: field of your email blank.
--
AOLserver - http://www.aolserver.com/
To Remove yourself from this list, simply send an email to <[EMAIL PROTECTED]> with thebody of "SIGNOFF AOLSERVER" in the email message. You can leave theSubject: field of your email blank.



--
AOLserver - http://www.aolserver.com/

To Remove yourself from this list, simply send an email to <[EMAIL PROTECTED]> 
with the
body of "SIGNOFF AOLSERVER" in the email message. You can leave the Subject: 
field of your email blank.

Re: [AOLSERVER] Data "corruption" with fastpath caching

Reply via email to