Re: Scalability/Clustering

Serge Huber Fri, 08 Jul 2005 01:56:11 -0700

David Nuescheler wrote:

we just recently ran a test using jackrabbit and cqfs
populating roughly 5m items (~500k nodes) and
even without using an rdbms back end we did not
run into issues. the performance of the persistence layerdegraded over time though.

Don't you mean you got good performance because you were NOT using adatabase ? Although I've been a proponent of DB storage, I also knowthat there will always be an overhead compared to raw file access. Thereare other advantages though (as you've summarized here :http://www.day.com/site/en/index/products/content-centric_infrastructure/content_repository/crx_faq.html:) )

Are there any efforts to make jackrabbit clustered for a load sharing
scenario (no session failover at repository layer) ?
i think there are a couple of caches that need to be madeclusterable (or at least pluggable) in the jackrabbit core forthat to happen efficiently, it has to be done very carefully,but it should not be to much work i think.
this is definitely on the roadmap and investigations into that
direction have already happend.

From what I have seen making the cache implementation pluggeable wouldbe a good necessary first step. It then becomes possible to use OSCache,JBossTreeCache or Tangosol Coherence that all handle clustered caches.

- implementing/extending an ORM Layer (Hibernate with shared caching for
 performance). The persistence implementation should be aware of the
 node types and allow a type specific mapping to tables. So we can map
 nodetypes with many instances to own tables while maintaining
 flexibility for new "simple" nodetypes.

One quick note about the current ORM implementation. The currentimplementation that I've worked on with Jackrabbit can be improved. Feelfree to have a look and contribute ! But what David is saying is true :for performance, the higher you can cache, the better !


Regards,
 Serge Huber.

Re: Scalability/Clustering

Reply via email to