Re: Is JDBC persistence manager supported by jackrabbit?

Vadim Gritsenko Fri, 02 Sep 2005 05:49:22 -0700

Marcel Reutegger wrote:

Vadim Gritsenko wrote:
Edgar Poce wrote:
when I decided to write the jdbc pm proposed in jcr-91 I wanted:

1 - a mature, transactional and scalable persistence storage
2 - use rdbms administrative tools, like scheduled backups, etc.
3 - rdbms referential integrity
4 - avoid redundancy. PMs store the NodeReferences twice.
5 - a storage that allows to modify the data easily, just in case.
I need at least 1, 2, and clustering on top of that... None ofexisting PMs will work in cluster environment (OJB and Hibernate donot count).
Please note that clustering Jackrabbit is not just about the persistencemanager. It also involves many other areas that we need to take care of.

I know. But having transactional clustered PM will enable me to create a clusterof Level 1 repository instances to run them on app servers. Next step can beenabling flushing/synchronization of caches on those Level 1 instances. Andafter all that is done, full clustering (with distributed locking, etc) will beeasier to tackle.

See: http://issues.apache.org/jira/browse/JCR-169 for a starting pointon discussions about this topic.


Thanks for the pointer.

Why wait release? :-) Isn't code in contrib meant to be grounds forexperimental code? :-) Let's bring it up before that - SimpleDB isn'tusable as well:
  * Synchronized to death
  * Stored BLOBs locally
Feel free to provide patches to enhance concurrency.

My first patch than will be port of connection pools from Edgar's JDBC PM. OnceDB PM has access to DB connection pool, there will be no need for anysynchronizations. Would you accept it?

Some enhancements that crossed my mind are:
- use a separate read-only connection for load() and exists() operations
- use a pool of prepared statements for load() and exists()

There are issues with single/double-connection design, beside the fact that(j2ee) applications are discouraged from managing system resources themselves:


  * No transaction isolation - which brings need for synchronizations
  * No keep-alive monitoring
  * No ability to reconnect severed connection

As for statement caching, IIRC driver does this.

With those changes we can then loosen some of the synchronization.
BLOBs are stored locally because many DBs are known for their badperformance when it comes to handling streams. So, speaking ofenhancements, introducing a configuration choice for BLOB handling isprobably another one.

Locally stored BLOBs might be Ok for non-clustered environment. It might be evenOk in some cluster deployments, if there is a replication mechanism.

But I don't think it is a good idea to replicate full set of BLOBs over eachserver (multiple times - if server runs more than one webapp) which happen tohave a need to access the repository. I prefer having all BLOBs in one place,even if it is a bit slower...


Vadim

Re: Is JDBC persistence manager supported by jackrabbit?

Reply via email to