Re: [OT] This feels wrong (pthreads question)

Levi Pearson Sun, 28 Jan 2007 19:01:02 -0800

On Jan 28, 2007, at 2:22 PM, Steve wrote:

Hi everyone,
As a coding excersize just to "see if I could do it" I decided to make
a chat server using UDP.

Up to this point, you're doing pretty good! Coding things just tosee if you can do it is excellent practice and lots of fun, to boot.

A major part of my design is the ability to scale up without slowing
down much, as such I decided to break my server design into 3 major
component objects.
Listener, Sender, Core.

This isn't too bad of a goal, but it is often wise to make a fairlysimple prototype that performs the core features as simply aspossible and see if it works well enough for you. If it slows downtoo much as you perform scale testing, you can see exactly why ithappens and know precisely what needs to change.


The listener is pretty simple we just create a non blocking listener
on a port and poll it periodically.

Now we're really off in the weeds. Periodic polling of a non-blocking port is almost never what you want; at least not polling byhand. If the listener is in its own thread, just block on your readcall. If you need to do other things in the thread while waiting forinput, there's always the select() or poll() system calls, which willblock until input arrives on any of the file descriptors you tellthem to watch or a timeout of your choice occurs.

In fact, by building your application out of an event dispatch loopcentered on a select() call, you can avoid dealing with pthreadsaltogether. As far as I'm concerned, users of C and C++ should avoidthreading as often as it is feasible to, because threading introducesnondeterminism to your code and opens the door to all sorts of hard-to-find errors, many of which won't appear until you really heavilyload the application.

The Core server design handles processing of information coming in
from the listener, i.e. reading the buffer, and creating new sender
objects if the client has never been seen before, as well as cleaning
up sender objects if the client has gone too long without a response.

The Sender(s) are where I'm having difficulty here, but it seems to me
this shouldn't be so hard.  Basically a sender is a self contained
"machine", it needs its own thread because it runs in an infinite loop
checking the main chat buffer in the Core, if anything has changed it
sends those changes to the client, and then sleeps for 250ms.

Let me get this straight here. You want to write a scalableapplication, and you are assigning a thread to each client? Thoseare seriously conflicting design features. Each new thread (assumingyou're using Linux) allocates a new process structure in the kernelthat has a new chunk of memory for stack space allocated to it and apointer to the same heap as the process it belongs to. This is not aparticularly cheap data structure when compared to non-threadedalternatives. Start getting into the hundreds or thousands ofconcurrent connections, or get a DOS attack of hundreds of thousandsof incoming 'new users', and your server will fall right over.

Now I know in a typical implementation, that all clients are contained
in a list and when the buffer has changed then the server iterates
through all the clients and sends out the changes.   But I don't
really like that design, the whole point of my design is to do it
without iterating through a list.

That implementation is typical because 1) it is easy, and 2) it isefficient. What's not to like about it? If you want to dress it up,call it the Listener Pattern and create the appropriate objects.

So as I was saying basically the sender class has a public method
called "void run()", this method is the function that wakes up, checks
the buffer, sends if needed and then goes back to sleep.


... [ pthread and C++ stuff snipped ] ...


And it works, but it feels very wrong to me.  Having to cast the
object to void, then recast back to it's original form, seems like a
lot of overhead as well as being dangerous.  And it has to occur every
250 ms, which seems like alot of recasting to me.

Well, it feels pretty wrong to me, too, but just about everycombination of C/C++ and pthreads feels wrong to me. Casting to andfrom void isn't particularly dangerous if you always know exactlywhat you're casting, and it certainly doesn't add any overhead. Itlooks ugly, but considering the lousy type system C has, it'ssometimes necessary. It's just subverting the type system, afterall, not actually *doing* anything. What's got a lot of overhead iswaking up every 250ms (causing context switching and interruptingsomething else) whether there's any reason to or not. Slave threadslike that really should stay blocked waiting for an event, not polling.


Thoughts?  Ideas? Concerns?
Thanks in advance!

Well, it's cool that you're trying to build a scalable system as alearning project, and I think this is a reasonable sort of project tostart with. I think you're getting a bit ahead of yourself, though,and that you ought to take a couple of steps back and start withsomething simpler. If you *really* want to use threads, I wouldsuggest reading about them in a bit more depth before trying anotherdesign, because your current one is fundamentally broken. If youjust want to build a scalable system, I suggest avoiding threadsaltogether and building on top of a select()-based event loop.


                        --Levi

/*
PLUG: http://plug.org, #utah on irc.freenode.net
Unsubscribe: http://plug.org/mailman/options/plug
Don't fear the penguin.
*/

Re: [OT] This feels wrong (pthreads question)

Reply via email to