Re: Proposal for simple LCF deployment model

Jack Krupansky Fri, 28 May 2010 10:49:42 -0700

But for a basic, early evaluation, "test drive", just the file system andweb repository connectors should be sufficient. And if there is a cleandatabase abstraction, a basic database package (e.g., derby) should besufficient for such a basic evaluation.

Are there technical reasons why third-party repository connectors cannot besupported using a Solr-style "plug-in" approach? Or, worst case, as separateprocesses with a clean inter-process API? Maybe not in the near-term, but asa longer-term vision.


-- Jack Krupansky

--------------------------------------------------
From: <karl.wri...@nokia.com>
Sent: Friday, May 28, 2010 11:10 AM
To: <connectors-dev@incubator.apache.org>
Subject: Re: Proposal for simple LCF deployment model

You forget that building lcf in its entirety requires that you supplyproprietary client components from third-party vendors. So i think it isunrealistic to expect canned builds that contain everything that you justdeploy. For lcf i think the build cycle will thus be very common.


Getting rid of the database requirement is also obviously not an option.

Karl

--- original message ---
From: "ext Jack Krupansky" <jack.krupan...@lucidimagination.com>
Subject: Re: Proposal for simple LCF deployment model
Date: May 28, 2010
Time: 10:42:17  AM

A simple deployment ala Solr is a good goal. Integrating Jetty with theLCF

deployment will go a long way towards that goal. The database software

deployment (PostgreSQL) is the other half of the hassle with deployingLCF.


I think there are three distinct goals here: 1) A super-easy Solr-style
deployment for initial evaluation of LCF, 2) deployment of the LCF
components for full-blown application development where app server and
database might need to be different from the initial evaluation, and 3)
deployment of LCF components for production deployment of the full
application.

Right now, evaluation of LCF requires deployment of the source code and

building artifacts - Solr evaluation does not require that step.Eliminated

the source and build step will certainly help simplify the evaluation
process.

Another possible consideration is that although some of us are especially
interested in integration with Solr and doing so easily and robustly, Solr
is just one of the output connections and LCF could be deployed for
applications that do not involve Solr at all. So, maybe there should be an
extra deployment wiki page for Solr guys that focuses on use of LCF with

Solr and related issues. Whether that should be the default presentationin

the doc is a matter for debate. Right now, I see no harm with a Solr bias.
At least it is a convenient way to demonstrate end-to-end use of LCF.

-- Jack Krupansky

--------------------------------------------------
From: <karl.wri...@nokia.com>
Sent: Friday, May 28, 2010 5:48 AM
To: <connectors-dev@incubator.apache.org>
Subject: Proposal for simple LCF deployment model

The current LCF standard deployment model requires a number of moving
parts, which are probably necessary in some cases, but simply introduce
complexity in others.  It has occurred to me that it may be possible to
provide an alternate deployment model involving Jetty, which would reduce
the number of moving parts by one (by eliminating Tomcat).  A simple LCF
deployment could then, in principle, look pretty much like Solr's.

In order for this to work, the following has to be true:

(1) jetty's basic JSP support must be comparable to Tomcat's.
(2) the class loader that jetty uses for webapp's must provide class

isolation similar to Tomcat's. If this condition is not met, we'd needto

build both a Tomcat and a Jetty version of each webapp.

The overall set of changes that would be required would be the following:

(a) An alternative "start" entry point would need to be coded, whichwould

start Jetty running the lcf-crawler-ui and lcf-authority-service webapps
before bringing up the agents engine.
(b) The alternative starting point should probably autocreate the
database, and should also autoregister all connectors.  This will require
a list, somewhere, of the connectors and authorities that are included,
and their preferred UI names for that installation.  This could come from
the configuration information, or from some other place.  Any ideas?
(c) There would need to an additional jar produced by the build process,

which would be the equivalent of the solr start.jar, so as to makerunning

the whole stack trivial.
(d) An "LCF API" web application, which provides access to all of the
current LCF commands, would also be an obvious requirement to go forward
with this model.

What are the disadvantages?  Well, I think that the main problem would be
security.  This deployment model, though simple, does not control access
to LCF is any way.  You'd need to introduce another moving part to do
that.

Bear in mind that this change would still not allow LCF to run using only

one process. There are still separate RMI-based processes needed forsome

connectors (Documentum and FileNet).  Although these could in theory be
started up using Java Activation, a main reason for a separate process in
Documentum's case is that DFC randomly crashes the JVM under which it
runs, and thus needs to be independently restarted if and when it dies.
If anyone has experience with Java Activation and wants to contribute
their time to develop infrastructure that can deal with that problem,
please let me know.

Finally, there is no way around the fact that LCF requires a
well-performing database, which constitutes an independent moving part of
its own.  This proposal does nothing to change that at all.

Please note that I'm not proposing that the current model go away, but
rather that we support both.

Thoughts?
Karl

Re: Proposal for simple LCF deployment model

Reply via email to