excellent, thanks for the info guys.

yeah ... 9000 blogs is a lot and in truth i'm not even sure any system could really handle much more than that in a truly dynamic fashion. rss feeds are pretty easy to cache because they aren't as complex, but the pages themselves are more complicated and there are a number of possible views of the page data which makes caching even harder.

i am particularly intrigued that you both commented about the use of static html pages. i think this would be a great option for Roller and it would be very cool if we could be pretty sneaky about it and actually use the same url structure that exists now, but just map to raw html files on the backend. we should do some investigations on this, but i certainly like the idea of having the option to use static html pages.

Matt, if you are curious about your cache performance then you can turn up the debugging on the LRUCacheHandler2 class (i believe that's the right one). This will flood your logs with lots of messages about cache hits and misses so make sure and watch your log files sizes, but it'll give you the info you need. Another good idea is to turn on garbage collection debugging messages so you can see how much of your heap you are using. With 9000 blogs my guess is that your caches are pretty overwhelmed, but I would also guess that if you check your the garbage collection after a Full GC that you probably have a little more room in your heap to increase the sizes.

-- Allen


Matthew P. Schmidt wrote:

I'll share. We have about 9000 blogs with rapid growth. Its running on one dual xeon, on MySQL and Resin and uses about a 1.6G heap with 3 3000 item caches (page, rss, last modified). I'm not sure how much they're actually being used. Load is generally pretty manageable, especially with the latest version of Roller. As for hits, most of it is RSS, with several million hits of that per month. There are also a million or more blog views per month and the server doesn't generally have to restart that often. Before merging our fork with Roller 1.2, we were restarting every night due to a memory leak. Our biggest problem is probably the amount of referrer spam, even with a healthy blacklist of dirty words. I think static HTML for the pages (which they basically are now if your cache is big enough) and a better referrer filter would be two big helpers for us. Matthew P. Schmidt
Vice President of Technology
Javalobby.org
Email: [EMAIL PROTECTED]
Phone: 919.678.0300



Elias Torres wrote:

We have also around 1800 blogs and it's growing rapidly. Also, around
12K people make use of the system in total and this we know because we
don't allow anonymous comments. You need to be authenticated for
someone to comment/post.

I wonder why you are not allowed to give out server info. Maybe I'll
hold off on that too for now.

I'm sure others have asked this before, but is there a plan of turning
Roller blogs into static HTML? I'd be interested in hearing your
thoughts on this. I'm sure this would alleviate many of the caching
performance problems.

Elias

On 8/9/05, Allen Gilliland <[EMAIL PROTECTED]> wrote:
I am diverging from the deployments discussion for a second because Elias comment sparked a question. I'm interested in anything that anyone wants to share about their roller installation ...

how many blogs does it have?
what is your performance like?
what are your cache size settings?
how good is your caching efficiency on average?
any numbers on how much activity the site gets? hits/visits? load?
server info?  processors?  ram?  OS?  webserver?  database?
how is stability?  does the server require restarts often?

anything that can be shared would be cool. i'd like to keep some info on who is running roller and in what kind of environments so that we can hopefully make sure we are keeping roller well suited for various situations.

blogs.sun.com currently has almost 1600 blogs on it and our stability is quite good. i would give out server info, but i'm not allowed to. probably our biggest performance concern is page caching, which has gotten worse and worse as more people start blogging. i think out cache size is 4000 right now and that is plenty for the rss cache, but the page cache is still overwhelmed :/

anyways ... how about others?

-- Allen


On Tue, 2005-08-09 at 13:59, Elias Torres wrote:
I'm also not an official part of the project, but I might be running
the second or third largest Roller-based website ;-) and my opinion if
it counts at all, is that if Dave/Allen can handle the heat in the
kitchen, let them stay in the kitchen. I'm sure that's why they pay
them the big bucks.


Reply via email to