Re: memory (session) leaks in perl

Rocco Caputo Wed, 23 Aug 2006 01:46:33 -0700

On Aug 22, 2006, at 14:06, Nick Williams wrote:

Rocco Caputo wrote:
Do you have a use case where it's impossible to do somethingunder the new behavior? I'm working under the assumption that asession can always find a way to call sig(YOUR_SIGNAL_HERE =>undef) when it's ready to be destroyed.
"When it's ready to be destroyed" is the key. The new behaviourmeans that sessions need to track that themselves and (in effect)manage their own reference counting independent of POE's. Before,they could just rely on POE to let them know via _stop. Now, _stopis no longer called unless the session clears the sig first. Thisis just a catch-22. And it leaves a race condition whereby thesession exists but has declared it no longer wants the signal.

Sessions already do need to manage their reference counts, at leastin the sense that they won't exit until they stop watching forevents. Signal events are just another kind of event, and thesemantics were a bit exceptional for some good reasons that don'tnecessarily apply anymore.

If I understand the catch-22 correctly, it's that a session can'tclear its signal watchers from _stop because those watchers prevent_stop from executing. If that's the case, I'd like to point out thatit's not very useful to clear any resources from _stop to beginwith. POE will automatically reclaim its resources from the sessionafter _stop returns, so any explicit POE cleanup in _stop is anexpensive no-op.

To be fair regarding discussions on this list, Jonathan Steinertannounced the intent to make sig() hold sessions alive in his 19October 2005 message titled "Nastiness, and wrapping up signalreforms". I replied that day with:
Big change. I don't mind this; the old semantics of not holdinga reference count were tied to _signal, which delivered signalswithout sessions explicitly asking for them. _signal is gonenow, so we can tie the explicit interest of sig() into areference count to keep the session alive.
Nobody else responded. 17 days later I replied with a public go-ahead to make the change.
Yes, I realise that there was this discussion previously. However,speaking purely for myself, I didn't understand the impact of thisat the time, since I wasn't cognizant of the internals of sessionreference counting at the time. Now I've looked at this, and Ican't see how the new implementation makes sense.

I can understand how the implementation might be confusing. Thereleased versions since last December have flaws, especiallyregarding reference counting. In fact I recently committed fixes forthem while portability testing some of Benjamin Smith's new tests.

My issue is that that the bugzilla that Jonathan was attempting tofix is just trivial to fix using existing POE mechanisms ofaliases, since there's an easy point at which you know you want tostart the persistence of the session, and there's a well-definedpoint at which you can release the persistence. However, by makingthe behaviour of persistence implicit within signals, there issimply no way to achieve the opposite effect (automatic garbagecollection). The user (the application) must decide at which pointit has no more work to do and at that point it can then clear thesignal. And only then will POE do it's garbage collection and callback to the application. This just doesn't make sense. Especiallywhen you compare signals in POE with signals in other dispatchers.Having a handler configured for a signal should not make thatprocess persistent.

The point that persistence shouldn't be tied to signals forflexibility's sake is the start of a slippery slope. What would thenstop us from asserting that some timers should not implypersistence? Input timeouts, for example. They don't contribute toa session's lifespan since they're only relevant as long as there'san I/O watcher. Why then should delay() keep sessions alive?

One solution might be to expand the Kernel's APIs for differentsemantic variants of each watcher. The endgame for that strategy isugly at best. Another solution might be to drop implicit referencecounting altogether. Every session would then need to explicitlyhold something as long as it wanted to live. That's the "alias"idea, although it should probably be something with a strongerreference count. Then there's the converse: No session will dieunless it explicitly asks to be killed. Each has a certain charm,although both have their pitfalls.

I think none of the solutions in the previous paragraph will satisfya significant portion of POE's users, so I'd rather just not go thereat this point.

Your last point is a good one. Where possible, POE has used Unix asa model for its behavior, and signal handlers don't contribute to aprocess' lifetime. The new sig() semantics are therefore incongruouswith the base model. Before I trash them, though, I'd like to learnmore about the other side. Are the new sig() semantics necessary? Iseem to recall yes, but I don't remember the details. If JonathanSteinert doesn't explain it, I'll need to make time to grovel throughmy logs (and the source) to refresh my memory.

And it's after 04.30 here (it's the only time I could make to answerthis message). I'm not committing to anything until after some sleep.

Maybe I'm thinking about it wrong. To me, "explicit interest of sig()" does not mean to me that I (this session) want to stay arounduntil that sig, it means to me that it's merely putting in place ahandler IN CASE of that sig. It's a really really importantdistinction. With the huge differentiator that the latter behaviourof 'in-case-of-handlers' CANNOT be achieved in the new POE signalworld without careful application coding and ignoring any of thebenefits of POE's internal garbage collection.
As a compromise, I've also proposed implicitly that maybe we shouldhave a new function wait_for_sig() as well as just sig(), so thatwe can make the difference in semantics explicit to users. I don'tmind which way around the functions and semantics are achieved, solong as there is a way of doing this.

People can interpret it either way depending on requirements andpoint of view. In the past I've had to explain why sig(USR1 =>"event") won't keep a daemon alive.

I'm still opposed to methods for semantic variants, though. I'drather avoid having both semantics at once if that's possible.


--
Rocco Caputo - [EMAIL PROTECTED]

Re: memory (session) leaks in perl

Reply via email to