Re: memory (session) leaks in perl

Nick Williams Wed, 23 Aug 2006 08:26:50 -0700

Firstly, thanks for the in-depth reply! Some thoughts interleaved below...


Rocco Caputo wrote:

On Aug 22, 2006, at 14:06, Nick Williams wrote:
Rocco Caputo wrote:
Do you have a use case where it's impossible to do something underthe new behavior? I'm working under the assumption that a sessioncan always find a way to call sig(YOUR_SIGNAL_HERE => undef) whenit's ready to be destroyed.
"When it's ready to be destroyed" is the key. The new behaviourmeans that sessions need to track that themselves and (in effect)manage their own reference counting independent of POE's. Before,they could just rely on POE to let them know via _stop. Now, _stopis no longer called unless the session clears the sig first. This isjust a catch-22. And it leaves a race condition whereby the sessionexists but has declared it no longer wants the signal.
Sessions already do need to manage their reference counts, at leastin the sense that they won't exit until they stop watching forevents. Signal events are just another kind of event, and thesemantics were a bit exceptional for some good reasons that don'tnecessarily apply anymore.
If I understand the catch-22 correctly, it's that a session can'tclear its signal watchers from _stop because those watchers prevent_stop from executing. If that's the case, I'd like to point out thatit's not very useful to clear any resources from _stop to beginwith. POE will automatically reclaim its resources from the sessionafter _stop returns, so any explicit POE cleanup in _stop is anexpensive no-op.

In my existing code, I'm not cleaning up POE resources - it's *my*resources I'm cleaning up in _stop. The point is that _stop no longergets called because of the signal handlers, so I can no longer use _stopas a garbage cleanup mechanism similar to DESTROY, since I will bydefinition always have to know when the session is going to bedestructed (in order to remove signal handlers) and therefore _stopbecomes superfluous.

To be fair regarding discussions on this list, Jonathan Steinertannounced the intent to make sig() hold sessions alive in his 19October 2005 message titled "Nastiness, and wrapping up signalreforms". I replied that day with:
Big change. I don't mind this; the old semantics of not holdinga reference count were tied to _signal, which delivered signalswithout sessions explicitly asking for them. _signal is gonenow, so we can tie the explicit interest of sig() into areference count to keep the session alive.
Nobody else responded. 17 days later I replied with a public go-ahead to make the change.
Yes, I realise that there was this discussion previously. However,speaking purely for myself, I didn't understand the impact of thisat the time, since I wasn't cognizant of the internals of sessionreference counting at the time. Now I've looked at this, and I can'tsee how the new implementation makes sense.
I can understand how the implementation might be confusing. Thereleased versions since last December have flaws, especiallyregarding reference counting. In fact I recently committed fixes forthem while portability testing some of Benjamin Smith's new tests.
My issue is that that the bugzilla that Jonathan was attempting tofix is just trivial to fix using existing POE mechanisms of aliases,since there's an easy point at which you know you want to start thepersistence of the session, and there's a well-defined point atwhich you can release the persistence. However, by making thebehaviour of persistence implicit within signals, there is simply noway to achieve the opposite effect (automatic garbage collection).The user (the application) must decide at which point it has no morework to do and at that point it can then clear the signal. And onlythen will POE do it's garbage collection and call back to theapplication. This just doesn't make sense. Especially when youcompare signals in POE with signals in other dispatchers. Having ahandler configured for a signal should not make that processpersistent.
The point that persistence shouldn't be tied to signals forflexibility's sake is the start of a slippery slope. What would thenstop us from asserting that some timers should not implypersistence? Input timeouts, for example. They don't contribute toa session's lifespan since they're only relevant as long as there'san I/O watcher. Why then should delay() keep sessions alive?

We're really talking semantics. A delay says (i.e. is documented as)"call me back at time T" (in effect). The time will always happen (bydefinition). So, it makes sense with those semantics that the sessionshould stay alive until at least time T. However a signal (bydefinition) might never happen.

One solution might be to expand the Kernel's APIs for differentsemantic variants of each watcher. The endgame for that strategy isugly at best.

And after reading your comments, I'll agree with you that many variantsof the API aren't a good idea.

Another solution might be to drop implicit reference countingaltogether. Every session would then need to explicitly holdsomething as long as it wanted to live. That's the "alias" idea,although it should probably be something with a stronger referencecount. Then there's the converse: No session will die unless itexplicitly asks to be killed. Each has a certain charm, althoughboth have their pitfalls.
I think none of the solutions in the previous paragraph will satisfya significant portion of POE's users, so I'd rather just not go thereat this point.



Agreed.

Your last point is a good one. Where possible, POE has used Unix asa model for its behavior, and signal handlers don't contribute to aprocess' lifetime. The new sig() semantics are therefore incongruouswith the base model. Before I trash them, though, I'd like to learnmore about the other side. Are the new sig() semantics necessary? Iseem to recall yes, but I don't remember the details. If JonathanSteinert doesn't explain it, I'll need to make time to grovel throughmy logs (and the source) to refresh my memory.

The logs imply that the reason it was changed was to deal with abugzilla entered about there being no simple way to set up a session towait for UI_DESTROY. This is interesting, because from the earlier pointabout the distinction between delays (time passing will always happen)and signals (which may never happen), we can see that UI_DESTROY is avery special signal that WILL ALWAYS happen (if requested), which isvery different from the rest of the signals. The API was changed to dealwith UI_DESTROY, not taking into account that it's a different beast.

And it's after 04.30 here (it's the only time I could make to answerthis message). I'm not committing to anything until after some sleep.

Again, much thanks for taking the time to put some notes onto themailing list for us IRC-deprived people :-).

Maybe I'm thinking about it wrong. To me, "explicit interest of sig()" does not mean to me that I (this session) want to stay arounduntil that sig, it means to me that it's merely putting in place ahandler IN CASE of that sig. It's a really really importantdistinction. With the huge differentiator that the latter behaviourof 'in-case-of-handlers' CANNOT be achieved in the new POE signalworld without careful application coding and ignoring any of thebenefits of POE's internal garbage collection.
As a compromise, I've also proposed implicitly that maybe we shouldhave a new function wait_for_sig() as well as just sig(), so that wecan make the difference in semantics explicit to users. I don't mindwhich way around the functions and semantics are achieved, so longas there is a way of doing this.
People can interpret it either way depending on requirements andpoint of view. In the past I've had to explain why sig(USR1 =>"event") won't keep a daemon alive.
I'm still opposed to methods for semantic variants, though. I'drather avoid having both semantics at once if that's possible.


Fair enough.

Nick.

Re: memory (session) leaks in perl

Reply via email to