Re: Running a huge wicket site(1m + users)

Daan van Etten Wed, 16 Jul 2008 03:05:45 -0700


On 16 jul 2008, at 11:34, Martijn Dashorst wrote:

On Wed, Jul 16, 2008 at 10:33 AM, Daan van Etten <[EMAIL PROTECTED]> wrote:

Can you elaborate a bit on your first statement? You need a lot of
data-juggling for many clients, so I'd love to learn why it giveshigher
performance at the server.


What is this data juggling you talk of? If you use sticky sessions
(which is really necessary for any serious web application IMO) there
should not be any data juggling.

But if you need failover, you need a buddy system, because sessionsharing across multiple clusters is not that desirable or fast. Youeliminate the necessary data-juggling by using an inferior way offailover. Because you need more memory and more processing power andLAN traffic to keep the state synchronized, you simply need more ironto serve your web application. So, you need to scale out to largerclusters earlier. It's cheaper and faster to keep the state at theclient.

In my opinion it depends on your use case, but in high-loadenvironments I'd
suggest to keep the state at the client.


As long as you have 1 page that uses the server as an RPC service,
this works. WHen you have to transfer state between different pages
you'd rather keep the state on the server. Imagine posting 1MB of
client side state to the server, and then sending it back (happened
with a .net app).

So don't use multiple pages. Make your applications without completepage refreshes. Replace parts of your functionality, like panelreplacing in Wicket. When you've fetched that panel once, cache it atthe client (if done right, browsers do this for you automatically).The server thus only gives data or (static) JS/HTML. Transfer client-fetched data in a format like JSON, which is easily compressed and canbe fetched stateless. The server just 'dumbly' returns data, much likea session-less DB-server. It's even possible to aggressively cachethese responses in the browser, on downstream servers and/or on theserver.This is exactly what GMail does, after your initial session. It seemsto work for them... :-) GMail is one of the snappiest web apps outthere. Google does complete page refreshes, but probably only becauseit is 90% search result data. Full page refreshes can also be cached,but only if they're stateless.

Sessions stored server-side not
only make it more expensive to scale out, but you're going to hit the
performance ceiling much sooner than with sessions at the client.


What performance ceiling are you talking about? There is no
performance penalty for storing state on the server, as long as you
don't have to synchronize it with other servers. Otherwise it is just
using memory, and functions as a good cache. To minimize clustering
overhead most folks choose sticky sessions with a buddy system for
failover.

As long as you don't have many visitors, you never will hit theperformance ceiling. As soon as you need more than one server, youhave to solve the session problem by limiting the synchronization ofstate or other workarounds. With a buddy system, you only have limitedfailover. It doesn't work with two independent data centers, unlessyou continuously want to push session state around. Really, with 1million users, and (guessing) 10.000-20.000 concurrent visitors, youreally don't want to store all that session state. It will slow thingsdown.


An interesting link: 
http://davidvancouvering.blogspot.com/2007/09/session-state-is-evil.html

Regards,

Daan van Etten


---------------------------------------------------------------------
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, e-mail: [EMAIL PROTECTED]

Re: Running a huge wicket site(1m + users)

Reply via email to