Re: [ServerEntry new API] Q about BasicServerAttribute

Emmanuel Lecharny Fri, 14 Dec 2007 15:42:16 -0800

Alex Karasulu wrote:

Short abstract :

So we should consider that checking for H/R is mandatory when adding anew value, but no more.


Some more elements about the other parts of your mail :

    We can let the Schema intercerptor deal with normalization and syntax
    checking, instead of asking the EntryAttribute to do the checking.
    That
    means we _must_ put this interceptor very high in the chain.
Right now I think this is split into two interceptors. The first onewhich is executed immediately is the Normalization interceptor. It'sreally an extension of the schema subsystem. Normalization cannotoccur without schema information and the process of normalizationautomatically enforces value syntax. This is because to normalizemost parsers embedded in a normalizer must validate the syntaxto transform the value to a cannonical representation using Stringprep rules.

those two guys work hand to hand... If we consider the NormalizationInterceptor alone, this is a pretty specific animal, yes. It is runasap, to be sure that the elements sent by the client are in a goodshape (ie, comparable). But we may have to normalize values later too :while searching for a value in a attribute, when adding some newattributes through any inner mechanism (trigger, for instance)...

The big difference that has evolved between the Normalizationinterceptor and the Schema interceptor is that the Normalizationinterceptor is not designed to fully check schema. It does *ONLY*what it needs to do to evaluate the validity of a request against theDIT. For example the DN is normalized and the filter expression isnormalized early to determine if we can short this process with arapid return. This reduces latency and weeds out most incorrectrequests. Now with normalized parameters the Exception interceptorcan more accurately do it's work to determine whether or not therequest makes sense: i.e. does the entry that is being deletedactually exist? Then the request goes deeper into the interceptorchain for further processing. The key concept in terms ofnormalization and schema checking is lazy execution.

yes, but fast failing is also a good thing to have.

Lazy execution makes sense most of the time but from the manyconverstations we've had it seems this might actually be harming ussince we're doing many of the same computations over and over againwhile discarding the results, especially where normalization isconcerned.

So true... At some point, we might want to keep the UP form and theNormalized form for values, as we do for DN. It will cost some morememory, but :

1) entries are transient, and can be discarded at will,
2) now that we will have StreamedValue, this won't be no more a big issue

3) normalizing values over and over may cost much more than storingtwice the size of data (in the worse cases)4) we should consider that very often, UP value == normilized value, sowe have a easy way to avoid a doubled memory consumption.


This need to be further, and in another thread...


    Here are the possible checks we can have on a value for an attribute :


    H/R : could be done when creating the attribute or adding some
    value into it

Yes this will have to happen very early within the codec I guess right?

yes. We will build the ServerEntry objects in the codec, like we areprocessing DN atm. That means we will need an access to the registriesin the codec.

    Syntax checking : SchemaInterceptor
    Normalization : SchemaInterceptor
Right now request parameters are normalized in within theNormalization interceptor and the these other aspects (items) arebeing handled in the Schema interceptor.
<snip/>
    It brings to my mind another concern :
    let's think about what could happen if we change the schema : we will
    have to update all the existing Attributes, which is simply not
    possible. Thus, storing the AttributeType within the
    EntryAttribute does
    not sound good anymore. (unless we kill all the current requests
    before
    we change the schema). It would be better to store an accessor to the
    schema sub-system, no ?
This is a big concern. For this reason I prefer holding references tohigh level service objects which can swap out things like registrieswhen the schema changes. This is especially important within servicesand interceptors that depend in particular on the schema service. Iwould rather spend an extra cycle to do more lookups than with lazyresolution which leads to a more dynamic architecture. Changes tocomponents are reflected immediately this way and have little impactin terms of leaving stale objects around which may present problemsand need to be cleaned up.

You are right. I was over looking this part. We should simply considerthat if the schema changes, then we must 'reboot' the server. At least,it will work in any case. Schema updates are not really meant to be doneoften (we are not designing AD, are we ? ;).

The fact is that if we need to keep the serve rup and running even if weneed to change the schema, then it's a little bit more complex thansimply interact with the loaded values in the process being requested.

However on the flip side there's a line we need to draw. Where wedraw this line will determine the level of isolation we want. Let medraw out a couple of specific scenarios to clarify.Scenario 1
========
A client binds to the server and pulls the schema at version 1, thenbefore issuing an add operation for a specific objectClass the schemachanges and one of the objectClasses in the entry to be added is nolonger present. The request will fail and should since the schemachanged. Incidentally a smart client should check thesubscemaSubentry timestamps before issing write operations to see ifneeds to check for schema changes that make the request invalid.

That won't be enough. Here, we need a kind of two phase commits, as weare modifying two sources of data at the same time. Not very simple tohandle. We should also consider that we may have concurrent requests onthe same data...

Scenario 2
========
A client binds to the server and pulls schema at version 1, thenissues an add request, as the add request is being processed by theserver the schema changes and one of the objectClass in the entry tobe added is no longer present.Scenario 1 is pretty clear and easy to handle. It will be handledautomatically for us anyway without having to explicitly code thecorrect behavior. Scenario 2 is a bit tricky. First of all we haveto determine the correct behavoir that needs to be exhibited. Beforeconfirming with the specifications (which we need to do) my suspicionswould incline me to think that this add request should be allowedsince it was issued and received before the schema change wascommitted. In this case it's OK for the add request to containhandles on schema data which might be old but consistent with the timeat which that request was issued.So to conclude I think it's OK, prefered and efficient for requestparameters and intermediate derived data structures used to evaluaterequests to have and leverage schema information that is notnecessarily up to date with the last schema change. This brings up aslew of other problems we have to tackle btw but we can talk aboutthis in another thread.

Oh, yeah... No need to stop and think right now, as the current serverdoes not handle those problems anyway. First, we have to 'clean' theEntry code :)

<snipped the rest of the convo, it will bring us far away from myinitial short Q ;) />

--
--
cordialement, regards,
Emmanuel Lécharny
www.iktek.com
directory.apache.org

Re: [ServerEntry new API] Q about BasicServerAttribute

Reply via email to