Hi Sean.

I was also looking in to how Jackrabbit behaved for my needs (I had some
specific numbers I was looking for) - the best thing I found was to just
build up some basic test code to load up enough nodes to verify it would do
roughly what I needed (which was a bit more effort then I thought, but I got
there in the end).

The moral of the tale: try it ! hack away and see how it behaves. For a
large flat structure like you are suggesting, it seems that save operations
can be a little slow (but slow is relative, in terms of user experience its
still fast) for large numbers of nodes under the same parent, but otherwise
seemed fine (I was also concerned with versioning a lot, which I am not sure
concerns you).

On 9/4/06, Sean Dynan <[EMAIL PROTECTED]> wrote:

Hi all

I am just starting to investigate Jackrabbit as the content repository
for an application and some advice from the experts would seriously
speed up my evaluation.

The problem domain in question is somewhat similar to a corporate mail
server:
- Large user base
- Frequent user reads and writes, mostly of text and
   images. By 'frequent' I mean usage on a par with an
   email client

Where it differs from the mail server comparison:
- Users can query the repository by keyword and expect
   rapid results (think Google)
- Users can query each other's information stores
- No upper bound to the physical size of the repository

Right now, I don't envisage a deep-noded store. Think of many Items,
each containing content and a little bunch of metadata (e.g. datetime,
list of keywords, etc.). The email analogy would be many email
messages, each with a body and a header.

I am also thinking of implementing each user's content store as a
Workspace. Each workspace would have two top-level nodes hanging from
its root: Private and Public. Then each of those two nodes would
contain many, many content Items.

Is there anything so far that wouldn't be well served by building on
top of Jackrabbit?

Am I right in assuming (given my ideas above) that cross-workspace
queries are perfectly possible, and that all that is required is a
separate, logged-in Session to each one?

Can you envisage any issues (performance or otherwise) with my
pitifully meagre outline? Any sage words of advice on how to best
start architecting my repository?


Many thanks!
--
Sean Dynan

Reply via email to