[ApacheDS] Separating normalization from parsing

Alex Karasulu Sun, 18 Sep 2005 11:40:18 -0700

Emmanuel, Ersin,

Emmanuel and I had a breif conversation on IRC regarding theinefficiencies caused by double parsing names at times to normalizethem. Emmanuel had a good yet simple idea to decouple these twooperations so Name normalization did not require another parse.Incidentally this solution also solves another problem that Ersin and Ihad discussed. Namely the need to be able to isolate normalization sothat it is does not complicate parsing. I just wanted to quickly letyou Ersin know that Emmanuel had some thoughts on this. An let youEmmanuel know that Ersin was thinking about this stuff :-).

Emmanuel's solution involved producing a NameComponent for populating aName rather than a String for the components. This way the attributetype and value are separated into fields within NameComponent objects bya populating parser. The normalization can then occur (it necessary)after the parse on the type and/or attribute value fields of theNameComponent objects within a DN. This approach would allownormalization to be decoupled from parsing.

There is however a slight problem with this approach. However I thinkwe might be able to get around it. Using a NameComponent instead of aString introduces a problem when mapping to the JNDI Name interface.Name expects a String for name components as seen by the add() methods,and the get() method. Getting around this is easy. The internalrepresentation for name components can be a NameComponent object withinLdapName rather than a String. The add() methods can be overloaded totake a NameComponent in addition to a String. The overloads taking aString can generate the NameComponent object before storing it withinthe internal array of NameComponents. Similarly, the get() method cancall toString() to return the String representation of the NameComponent(btw which can/should be cached by NameComponent). A newgetNameComponent() method can be added to LdapName for access to theindividual name components by a normalizer.

I like Emmauel's approach very much and think it can lead to someserious optimizations within the server.


Thoughts? Comments?

Alex

[ApacheDS] Separating normalization from parsing

Reply via email to